CommunityData:Message Walls: Difference between revisions

From CommunityData
Line 15: Line 15:
** Salt will email Mako the list, and he will ask the Spaniards whether they have them
** Salt will email Mako the list, and he will ask the Spaniards whether they have them
** Collect any others we need (Salt, Mako)
** Collect any others we need (Salt, Mako)
*By Dec 6: Gather data for all wikis that made the switch within the first wave of migrations to msg walls (including re-running wikiq, new wikilist, and so on.)
*By Dec 6: Gather data for all wikis that made the switch within the first wave of migrations to msg walls (including re-running wikiq, new wikilist, and so on.) (Nate)
** Partition Danny Horn csv into train and test sets
** Partition Danny Horn csv into train and test sets (Nate)
*Write down preliminary inclusion criteria for analysis
*Write down preliminary inclusion criteria for analysis (Sneha)
*By Dec 12: Implement current inclusion criteria for analysis on training set, ensure all variables are built as intended
*By Dec 12: Implement current inclusion criteria for analysis on training set, ensure all variables are built as intended (Sneha)
*Dec 12 - 22: Complete initial version of results
*Dec 12 - 22: Complete initial version of results (Sneha)
*Dec 22 - Jan 5th: Generate models that we plan to report
*Dec 22 - Jan 5th: Generate models that we plan to report (Sneha)
* Early April - run analysis on training set
* Early April - run analysis on training set



Revision as of 02:27, 6 December 2017

Useful Resources

Task Management

Overview

(last updated December 5)

Data collection & analysis

  • Get missing wikis:
    • Salt will use wikiList.3.csv to find which wikis we don't have.
    • Salt will email Mako the list, and he will ask the Spaniards whether they have them
    • Collect any others we need (Salt, Mako)
  • By Dec 6: Gather data for all wikis that made the switch within the first wave of migrations to msg walls (including re-running wikiq, new wikilist, and so on.) (Nate)
    • Partition Danny Horn csv into train and test sets (Nate)
  • Write down preliminary inclusion criteria for analysis (Sneha)
  • By Dec 12: Implement current inclusion criteria for analysis on training set, ensure all variables are built as intended (Sneha)
  • Dec 12 - 22: Complete initial version of results (Sneha)
  • Dec 22 - Jan 5th: Generate models that we plan to report (Sneha)
  • Early April - run analysis on training set

Writing

  • By Dec 15 - Convert draft of framing, literature review and methodology sections to ACM format
  • Week of Jan 8 - Check-in meeting to discuss preliminary results
  • By Jan 15 - Draft results section of the paper, identify changes to be made to framing/intro based on results
  • By Jan 31 - Complete first draft of paper
  • April 16 - CSCW abstract + metadata deadline
  • April 19 - CSCW submission deadline

Next Steps (Nov 30)

  • (Nate) Update code diagram to say we're using wikiList.3.csv
  • (Nate) Write python code to create training set
  • (Nate and Salt) Link missing dumps to online dump sources
  • (Sneha) Write R code for analyzing data at the edit weeks level

Archive