Difference between revisions of "CommunityData:Message Walls"

From CommunityData
 
(27 intermediate revisions by 2 users not shown)
Line 3: Line 3:
 
* Notes on Wikia Dumps [[CommunityData:Wikia Dumps]]
 
* Notes on Wikia Dumps [[CommunityData:Wikia Dumps]]
 
* Notes on the code -- Now with a diagram! [[CommunityData:Message Walls Code]]
 
* Notes on the code -- Now with a diagram! [[CommunityData:Message Walls Code]]
 +
 +
= Robustness Checks =
 +
* Pre-period matching placebo test
 +
* Normal placebo test
  
 
= Task Management =  
 
= Task Management =  
Line 8: Line 12:
 
==Overview ==
 
==Overview ==
  
(last updated December 5)
+
(Updated March 15th)
  
===Data collection & analysis===
+
===Get missing wikis===
*Get missing wikis:
+
*'''ASAP''' Need to use wikilist3.csv to determine which wikis we don't have - Salt (with Mako's help)
** Salt will use wikiList.3.csv to find which wikis we don't have.
+
*'''ASAP''' Download the rest and put them through wikiq and build edit weeks - Salt (with Mako's help)
** Salt will email Mako the list, and he will ask the Spaniards whether they have them
 
** Collect any others we need (Salt, Mako)
 
*By Dec 6: Gather data for all wikis that made the switch within the first wave of migrations to msg walls (including re-running wikiq, new wikilist, and so on.) (Nate)
 
** Partition Danny Horn csv into train and test sets (Nate) ['''done''']
 
*Write down preliminary inclusion criteria for analysis (Sneha, Nate) ['''done''']
 
*By Dec 12: Implement current inclusion criteria for analysis on training set ['''done''']
 
** Update existing READMEs, and write codebook describing all variables in detail (Sneha) ['''done''']
 
* By Jan 26 - Debug editweeks code/identify source of pre-cutoff edits
 
* By Feb 15 - Get initial results
 
  
 
+
===Analysis===
* Early April - run analysis on test set
+
* Another meeting with full team to go over the results and try to make sense of them (after Sneha takes a first stab)
 +
* Determine any other models we want to run
  
 
===Writing===
 
===Writing===
*By Dec 15 - Convert draft of framing, literature review and methodology sections to ACM format (Sneha)
+
* Switch from Haythornwaite to Reader to Leader framing (Sneha)
*Week of Jan 8 - Check-in meeting to discuss preliminary results (Sneha)
+
* knitr integration (Sneha + Nate)
*By Jan 15 - Draft results section of the paper, identify changes to be made to framing/intro based on results (Sneha)
+
* plots (Salt)
*By Jan 31 - Complete first draft of paper (Sneha)
+
* Better pictures of message walls (Sneha)
* April 16 - CSCW abstract + metadata deadline
+
* Better explanations of why talk pages suck (Sneha)
* April 19 - CSCW submission deadline
+
* Zotero streamlining
 
 
==Next Steps (Nov 30)==
 
*(Nate) Update code diagram to say we're using wikiList.3.csv
 
*(Nate) Write python code to create training set
 
*(Nate and Salt) Link missing dumps to online dump sources
 
*(Sneha) Write R code for analyzing data at the edit weeks level
 
  
=== Archive ===
+
== Archive ==
  
 
* [[/Archived_tasks|Past next steps]]
 
* [[/Archived_tasks|Past next steps]]

Latest revision as of 20:31, 14 June 2018

Useful Resources[edit]

Robustness Checks[edit]

  • Pre-period matching placebo test
  • Normal placebo test

Task Management[edit]

Overview[edit]

(Updated March 15th)

Get missing wikis[edit]

  • ASAP Need to use wikilist3.csv to determine which wikis we don't have - Salt (with Mako's help)
  • ASAP Download the rest and put them through wikiq and build edit weeks - Salt (with Mako's help)

Analysis[edit]

  • Another meeting with full team to go over the results and try to make sense of them (after Sneha takes a first stab)
  • Determine any other models we want to run

Writing[edit]

  • Switch from Haythornwaite to Reader to Leader framing (Sneha)
  • knitr integration (Sneha + Nate)
  • plots (Salt)
  • Better pictures of message walls (Sneha)
  • Better explanations of why talk pages suck (Sneha)
  • Zotero streamlining

Archive[edit]