CommunityData:Message Walls: Difference between revisions

From CommunityData
No edit summary
 
(101 intermediate revisions by 4 users not shown)
Line 2: Line 2:


* Notes on Wikia Dumps [[CommunityData:Wikia Dumps]]
* Notes on Wikia Dumps [[CommunityData:Wikia Dumps]]
* Notes on the code [[CommunityData:Message Walls Code]]
* Notes on the code -- Now with a diagram! [[CommunityData:Message Walls Code]]


= Robustness Checks =
* Pre-period matching placebo test
* Normal placebo test


= Task Management =  
= Task Management =  
==Future Tasks==
* Scrape admin and bot edits using a script from Mako


==Next Steps (June 20th)==
==Overview ==
* (Nate) Improve wiki list by identifying wikis that turn off the feature without turning on first (Done)
* (Nate) Get <strike>muppet wiki</strike> Dr. Horrible Wiki edit weeks for Sneha (Done)
* (Salt or Nate) Do brute force mapping using revision ids and and hashing texts
* (Sneha) Will play with muppet wiki data
* (Sneha) create list of subsetting characteristics for study


==Next Steps (June 13th)==
(Updated March 15th)


* Build a new dataset of dumps of the ~4800 wikis (Salt/Nate) (May take more than a week to generate all the new dumps)
===Get missing wikis===
*'''ASAP''' Need to use wikilist3.csv to determine which wikis we don't have - Salt (with Mako's help)
* Build a msgwall version of the build_edit_weeks file from the anon_edits paper (Nate)
*'''ASAP''' Download the rest and put them through wikiq and build edit weeks - Salt (with Mako's help)


* Do analysis of alt history wiki and update (Sneha)
===Analysis===
* Another meeting with full team to go over the results and try to make sense of them (after Sneha takes a first stab)
* Determine any other models we want to run


* Create list of criteria to identify wikis we want to use in this study (Sneha)
===Writing===
* Switch from Haythornwaite to Reader to Leader framing (Sneha)
* knitr integration (Sneha + Nate)
* plots (Salt)
* Better pictures of message walls (Sneha)
* Better explanations of why talk pages suck (Sneha)
* Zotero streamlining


== Next Steps (June 6th)==  
== Archive ==


* Identify list of Wikis we will analyze from the tsv file. 
* [[/Archived_tasks|Past next steps]]
 
* Attempt to obtain a good dump for each of these wikis. See [[CommunityData:Wikia Dumps]] for information.
 
** This may depend on mapping between the urls in the tsv file and the dumps. Consider using HTTP redirects from the url under <siteinfo>. 
* Modify Wikiq to give an error message if the closing </mediawiki> tag is missing.
 
* Sneha to take a look althistory data from Nate. 
 
* Nate will write a version of build_edit_weeks for the message wall project
 
* Check back next meeting Tuesday (June 13th)

Latest revision as of 18:31, 14 June 2018

Useful Resources[edit]

Robustness Checks[edit]

  • Pre-period matching placebo test
  • Normal placebo test

Task Management[edit]

Overview[edit]

(Updated March 15th)

Get missing wikis[edit]

  • ASAP Need to use wikilist3.csv to determine which wikis we don't have - Salt (with Mako's help)
  • ASAP Download the rest and put them through wikiq and build edit weeks - Salt (with Mako's help)

Analysis[edit]

  • Another meeting with full team to go over the results and try to make sense of them (after Sneha takes a first stab)
  • Determine any other models we want to run

Writing[edit]

  • Switch from Haythornwaite to Reader to Leader framing (Sneha)
  • knitr integration (Sneha + Nate)
  • plots (Salt)
  • Better pictures of message walls (Sneha)
  • Better explanations of why talk pages suck (Sneha)
  • Zotero streamlining

Archive[edit]