CommunityData:Message Walls: Difference between revisions
From CommunityData
Line 15: | Line 15: | ||
** Salt will email Mako the list, and he will ask the Spaniards whether they have them | ** Salt will email Mako the list, and he will ask the Spaniards whether they have them | ||
** Collect any others we need (Salt, Mako) | ** Collect any others we need (Salt, Mako) | ||
*By Dec 6: Gather data for all wikis that made the switch within the first wave of migrations to msg walls (including re-running wikiq, new wikilist, and so on.) | *By Dec 6: Gather data for all wikis that made the switch within the first wave of migrations to msg walls (including re-running wikiq, new wikilist, and so on.) (Nate) | ||
** Partition Danny Horn csv into train and test sets | ** Partition Danny Horn csv into train and test sets (Nate) | ||
*Write down preliminary inclusion criteria for analysis | *Write down preliminary inclusion criteria for analysis (Sneha) | ||
*By Dec 12: Implement current inclusion criteria for analysis on training set, ensure all variables are built as intended | *By Dec 12: Implement current inclusion criteria for analysis on training set, ensure all variables are built as intended (Sneha) | ||
*Dec 12 - 22: Complete initial version of results | *Dec 12 - 22: Complete initial version of results (Sneha) | ||
*Dec 22 - Jan 5th: Generate models that we plan to report | *Dec 22 - Jan 5th: Generate models that we plan to report (Sneha) | ||
* Early April - run analysis on training set | * Early April - run analysis on training set | ||
Revision as of 00:27, 6 December 2017
Useful Resources
- Notes on Wikia Dumps CommunityData:Wikia Dumps
- Notes on the code -- Now with a diagram! CommunityData:Message Walls Code
Task Management
Overview
(last updated December 5)
Data collection & analysis
- Get missing wikis:
- Salt will use wikiList.3.csv to find which wikis we don't have.
- Salt will email Mako the list, and he will ask the Spaniards whether they have them
- Collect any others we need (Salt, Mako)
- By Dec 6: Gather data for all wikis that made the switch within the first wave of migrations to msg walls (including re-running wikiq, new wikilist, and so on.) (Nate)
- Partition Danny Horn csv into train and test sets (Nate)
- Write down preliminary inclusion criteria for analysis (Sneha)
- By Dec 12: Implement current inclusion criteria for analysis on training set, ensure all variables are built as intended (Sneha)
- Dec 12 - 22: Complete initial version of results (Sneha)
- Dec 22 - Jan 5th: Generate models that we plan to report (Sneha)
- Early April - run analysis on training set
Writing
- By Dec 15 - Convert draft of framing, literature review and methodology sections to ACM format
- Week of Jan 8 - Check-in meeting to discuss preliminary results
- By Jan 15 - Draft results section of the paper, identify changes to be made to framing/intro based on results
- By Jan 31 - Complete first draft of paper
- April 16 - CSCW abstract + metadata deadline
- April 19 - CSCW submission deadline
Next Steps (Nov 30)
- (Nate) Update code diagram to say we're using wikiList.3.csv
- (Nate) Write python code to create training set
- (Nate and Salt) Link missing dumps to online dump sources
- (Sneha) Write R code for analyzing data at the edit weeks level