Community Data Science Course (Spring 2015)/Day 4 Coding Challenges

From CommunityData

Each of the challenges this week will ask you to modify and work with code in the Wikipedia API projects which you should have installed and begun working with in class.

As always, it's not essential that you solve or get through all of these — I'm not grading your answers on these. That said, being able to work through at least many of them is a good sign that you have mastered the concepts for the week. It is always fine to collaborate or work together on these problem sets. The only thing I ask is that you do not broadcast answers before Sunday at midnight on Canvas.

Challenges

  1. Save the revision metadata printed in wikipedia1-2.py to a file called "wikipedia_revisions.tsv".
  2. In addition to revision ids, print out the edit summary (i.e., comment) of each revision for the article on Python.
  3. Which article is in more categories? "Python (programming language)" or "R (programming language)"?
  4. Find out how many revisions to the article on "Python (programming language)" were made by user "Peterl"? How about "Hfastedge"?
  5. How many edits to the article "Python (programming language)" where made in 2014?
Here's a much more complicated challenge but a fun one that you know enough to solve
Check out the game Catfishing which shows you categories and has you guess an article. Write a version that uses the Wikipedia API. For example, pick 5 articles and write a program that will randomly show the categories for one of those articles and to ask you to guess the article. Read the guess with input() and let the user know if they go it right or wrong!