CommunityData:Message Walls: Difference between revisions
From CommunityData
Groceryheist (talk | contribs) No edit summary |
|||
Line 17: | Line 17: | ||
==Next Steps (June 13th)== | ==Next Steps (June 13th)== | ||
* Build a new dataset of dumps of the ~4800 wikis (Salt/Nate) | * Build a new dataset of dumps of the ~4800 wikis (Salt/Nate) (May take more than a week to generate all the new dumps) | ||
* Build a msgwall version of the build_edit_weeks file from the anon_edits paper (Nate) | * Build a msgwall version of the build_edit_weeks file from the anon_edits paper (Nate) | ||
Line 24: | Line 24: | ||
* Create list of criteria to identify wikis we want to use in this study (Sneha) | * Create list of criteria to identify wikis we want to use in this study (Sneha) | ||
==Future Tasks== | |||
* Scrape admin and bot edits using a script from Mako |
Revision as of 19:25, 13 June 2017
Next Steps (June 6th)
- Identify list of Wikis we will analyze from the tsv file.
- Attempt to obtain a good dump for each of these wikis. See CommunityData:Wikia Dumps for information.
- This may depend on mapping between the urls in the tsv file and the dumps. Consider using HTTP redirects from the url under <siteinfo>.
- Modify Wikiq to give an error message if the closing </mediawiki> tag is missing.
- Sneha to take a look althistory data from Nate.
- Nate will write a version of build_edit_weeks for the message wall project
- Check back next meeting Tuesday (June 13th)
Next Steps (June 13th)
- Build a new dataset of dumps of the ~4800 wikis (Salt/Nate) (May take more than a week to generate all the new dumps)
- Build a msgwall version of the build_edit_weeks file from the anon_edits paper (Nate)
- Do analysis of alt history wiki and update (Sneha)
- Create list of criteria to identify wikis we want to use in this study (Sneha)
Future Tasks
- Scrape admin and bot edits using a script from Mako