DS4UX (Spring 2016)/Day 5 coding challenge: Difference between revisions

From CommunityData
No edit summary
No edit summary
Line 18: Line 18:
<div style="clear:both;"></div>
<div style="clear:both;"></div>


Each of the challenges this week will ask you to modify and work with code in the [[Community Data Science Course (Spring 2015)/Wikipedia API projects|Wikipedia API projects]] which you should have installed and begun working with in class.
Each of the challenges this week will ask you to modify and work with code in the [[DS4UX (Spring 2016)/Wikipedia API|Wikipedia API projects]] which you should have installed and begun working with in class.


As always, it's not essential that you solve or get through all of these — I'm not grading your answers on these. That said, being able to work through at least many of them is a good sign that you have mastered the concepts for the week. It is always fine to collaborate or work together on these problem sets. The only thing I ask is that you do not broadcast answers before Sunday at midnight on Canvas.
As always, it's not essential that you solve or get through all of these — I'm not grading your answers on these. That said, being able to work through at least many of them is a good sign that you have mastered the concepts for the week. It is always fine to collaborate or work together on these problem sets. The only thing I ask is that you do not broadcast answers before Sunday at midnight on Canvas.

Revision as of 02:56, 25 April 2016

This page is a work in progress.

Each of the challenges this week will ask you to modify and work with code in the Wikipedia API projects which you should have installed and begun working with in class.

As always, it's not essential that you solve or get through all of these — I'm not grading your answers on these. That said, being able to work through at least many of them is a good sign that you have mastered the concepts for the week. It is always fine to collaborate or work together on these problem sets. The only thing I ask is that you do not broadcast answers before Sunday at midnight on Canvas.

Challenges

  1. Save the revision metadata printed in wikipedia1-2.py (i.e., the material already being printed out) to a file called "wikipedia_revisions.tsv".
  2. Print out the revision ids and edit summaries (i.e., comment) of each revision for the article on Python.
  3. Find out what other data or metadata you can print out for a revision for an article.
  4. Which article is in more categories? Python (programming language) or Python (genus)?
  5. Find out how many revisions to the article on "Python (programming language)" were made by user "Peterl"? How about "Hfastedge"?
  6. How would you use the API to find out how many revisions/edits the user "Jtmorgan" has made to Wikipedia?
  7. Can you build a list of all of the articles edited by "Jtmorgan"? What is the article with the longest title that user Jtmorgan has edited?
  8. How many edits to the article "Python (programming language)" where made in 2015?