Editing CommunityData:Message Walls
From CommunityData
Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.
The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.
Latest revision | Your text | ||
Line 3: | Line 3: | ||
* Notes on Wikia Dumps [[CommunityData:Wikia Dumps]] | * Notes on Wikia Dumps [[CommunityData:Wikia Dumps]] | ||
* Notes on the code -- Now with a diagram! [[CommunityData:Message Walls Code]] | * Notes on the code -- Now with a diagram! [[CommunityData:Message Walls Code]] | ||
= Task Management = | = Task Management = | ||
Line 14: | Line 10: | ||
(Updated March 15th) | (Updated March 15th) | ||
*Get missing wikis: | |||
*'''ASAP''' Need to use wikilist3.csv to determine which wikis we don't have | **'''ASAP''' Need to use wikilist3.csv to determine which wikis we don't have | ||
*'''ASAP''' Download the rest and put them through wikiq and build edit weeks | **'''ASAP''' Download the rest and put them through wikiq and build edit weeks | ||
=== | *Analysis | ||
* | **'''March 22:''' Determine models to report | ||
* | **'''March 25:''' Clean up code and change to knitrable format | ||
**'''April 10th:''' Run the analysis on the rest of the data set | |||
*Writing | |||
**'''By April 1st:''' have a full draft of the paper ready for collaborative editing | |||
**'''First week of April:''' Revise paper, present at Seattle meetup for comments/suggestions | |||
(last updated December 5) | |||
===Data collection & analysis=== | |||
*Get missing wikis: | |||
** Salt will use wikiList.3.csv to find which wikis we don't have. | |||
** Salt will email Mako the list, and he will ask the Spaniards whether they have them | |||
** Collect any others we need (Salt, Mako) | |||
*By Dec 6: Gather data for all wikis that made the switch within the first wave of migrations to msg walls (including re-running wikiq, new wikilist, and so on.) (Nate) | |||
** Partition Danny Horn csv into train and test sets (Nate) ['''done'''] | |||
*Write down preliminary inclusion criteria for analysis (Sneha, Nate) ['''done'''] | |||
*By Dec 12: Implement current inclusion criteria for analysis on training set ['''done'''] | |||
** Update existing READMEs, and write codebook describing all variables in detail (Sneha) ['''done'''] | |||
* By Jan 25 - Debug editweeks code/identify source of pre-cutoff edits (Nate) | |||
* By Jan 25 - Start writing analysis code | |||
* By Feb 15 - Get initial results, make headway on written draft | |||
* Early April - run analysis on test set | |||
===Writing=== | ===Writing=== | ||
* | *By Dec 15 - Convert draft of framing, literature review and methodology sections to ACM format (Sneha) | ||
* | *Week of Jan 8 - Check-in meeting to discuss preliminary results (Sneha) | ||
* | *By Jan 15 - Draft results section of the paper, identify changes to be made to framing/intro based on results (Sneha) | ||
* | *By Jan 31 - Complete first draft of paper (Sneha) | ||
* | * April 16 - CSCW abstract + metadata deadline | ||
* April 19 - CSCW submission deadline | |||
==Next Steps (Nov 30)== | |||
*(Nate) Update code diagram to say we're using wikiList.3.csv | |||
*(Nate) Write python code to create training set | |||
*(Nate and Salt) Link missing dumps to online dump sources | |||
*(Sneha) Write R code for analyzing data at the edit weeks level | |||
== Archive == | === Archive === | ||
* [[/Archived_tasks|Past next steps]] | * [[/Archived_tasks|Past next steps]] |