CommunityData:Message Walls: Difference between revisions

From CommunityData
m (Groceryheist moved page Message Walls to CommunityData:Message Walls: This is a research project page)
(Update next steps based on new knowledge.)
Line 1: Line 1:
== Next Steps ==


* Identify list of Wikis we will analyze from the tsv file. 


== Next Steps ==
* Attempt to obtain a good dump for each of these wikis. See [[CommunityData:Wikia Dumps]] for information.


* Talk to Mako about dumps. Find a good dump. If Mako doesn't know, download a fresh dump from Wikia.  
** This may depend on mapping between the urls in the tsv file and the dumps. Consider using HTTP redirects from the url under <siteinfo>.  
   
   
* Test wikiq / pythonMediaWikiUtilities on good dump.
* Modify Wikiq to give an error message if the closing </mediawiki> tag is missing.  


* Send extracted tsv file to Sneha for preliminary analysis.
* Sneha to take a look althistory data from Nate.


* Check back next meeting Tuesday (May 30th)
* Check back next meeting Tuesday (May 30th)

Revision as of 04:29, 24 May 2017

Next Steps

  • Identify list of Wikis we will analyze from the tsv file.
    • This may depend on mapping between the urls in the tsv file and the dumps. Consider using HTTP redirects from the url under <siteinfo>.
  • Modify Wikiq to give an error message if the closing </mediawiki> tag is missing.
  • Sneha to take a look althistory data from Nate.
  • Check back next meeting Tuesday (May 30th)