CommunityData:Message Walls: Difference between revisions
From CommunityData
Line 19: | Line 19: | ||
*Generate variables for newcomer edits to ns0, ns3 and ns1200 separately (Nate) | *Generate variables for newcomer edits to ns0, ns3 and ns1200 separately (Nate) | ||
*Add variable for number of newcomers who make an edit in two consecutive weeks (Nate) | *Add variable for number of newcomers who make an edit in two consecutive weeks (Nate) | ||
*Fix negative age in weeks bug (Nate) | |||
*Fix sessions.ns1201/sessions.ns1202 bug (Nate) | |||
===Analysis=== | ===Analysis=== |
Revision as of 18:13, 22 March 2018
Useful Resources
- Notes on Wikia Dumps CommunityData:Wikia Dumps
- Notes on the code -- Now with a diagram! CommunityData:Message Walls Code
Task Management
Overview
(Updated March 15th)
Get missing wikis
- ASAP Need to use wikilist3.csv to determine which wikis we don't have
- ASAP Download the rest and put them through wikiq and build edit weeks
Dataset construction
- Fix non-reverted edits bug (Nate)
- Add session variables to wiki weeks (Nate)
- Generate variables for newcomer edits to ns0, ns3 and ns1200 separately (Nate)
- Add variable for number of newcomers who make an edit in two consecutive weeks (Nate)
- Fix negative age in weeks bug (Nate)
- Fix sessions.ns1201/sessions.ns1202 bug (Nate)
Analysis
- Figure out what's causing the spike in total edits around the transition date - DONE
- Run a version of the analyses that takes into account whether newcomers ever edited
- Use unique editors as an outcome variable - DONE
- By March 22:
- Run negative binomial versions of models on Hyak
- Determine models to report
- By March 25:
- Clean up code and change to knitrable format
- On April 10th:
- Run the analysis on the test dataset
Writing
- By April 1st: have a full draft of the paper ready for collaborative editing
- First week of April: Revise paper, present at Seattle meetup for comments/suggestions
- April 16: CSCW abstract + metadata deadline
- April 19: CSCW submission deadline