Editing CommunityData:Message Walls
From CommunityData
The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.
Latest revision | Your text | ||
Line 2: | Line 2: | ||
* Notes on Wikia Dumps [[CommunityData:Wikia Dumps]] | * Notes on Wikia Dumps [[CommunityData:Wikia Dumps]] | ||
* Notes on the code | * Notes on the code [[CommunityData:Message Walls Code]] | ||
= Task Management = | = Task Management = | ||
==Future Tasks== | |||
* Scrape admin and bot edits using a script from Mako | |||
* Check that dumps, even if valid xml, have message wall data. | |||
== | == Next Steps (June 27th) == | ||
* Take a look namespaces 1200-1202 to understand what they mean. | |||
* (Sneha) create list of subsetting characteristics (inclusion criteria for Wikis) for study. | |||
* Download wikis available on Special:statistics. | |||
* Request new dumps for missing wikis. | |||
=== | ==Next Steps (June 20th)== | ||
* | * (Nate) Improve wiki list by identifying wikis that turn off the feature without turning on first (Done) | ||
* | * (Nate) Get <strike>muppet wiki</strike> Dr. Horrible Wiki edit weeks for Sneha (Done) | ||
* (Nate) Do brute force mapping using revision ids and and hashing texts (Done) | |||
* (Sneha) Will play with Dr. Horrible data (Done) | |||
* (Sneha) create list of subsetting characteristics for study | |||
== | ==Next Steps (June 13th)== | ||
* Build a new dataset of dumps of the ~4800 wikis (Salt/Nate) (May take more than a week to generate all the new dumps) | |||
* | |||
* Build a msgwall version of the build_edit_weeks file from the anon_edits paper (Nate) | |||
* | |||
* Do analysis of alt history wiki and update (Sneha) | |||
* [[/ | * Create list of criteria to identify wikis we want to use in this study (Sneha) | ||
== Next Steps (June 6th)== | |||
* Identify list of Wikis we will analyze from the tsv file. | |||
* Attempt to obtain a good dump for each of these wikis. See [[CommunityData:Wikia Dumps]] for information. | |||
** This may depend on mapping between the urls in the tsv file and the dumps. Consider using HTTP redirects from the url under <siteinfo>. | |||
* Modify Wikiq to give an error message if the closing </mediawiki> tag is missing. | |||
* Sneha to take a look althistory data from Nate. | |||
* Nate will write a version of build_edit_weeks for the message wall project | |||
* Check back next meeting Tuesday (June 13th) |