Editing Community Data Science Workshops (Fall 2015)/Day 2 projects/Wikipedia
From CommunityData
The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.
Latest revision | Your text | ||
Line 1: | Line 1: | ||
[[File:Wikipedia.png|right|250px]] | [[File:Wikipedia.png|right|250px]] | ||
__NOTOC__ | __NOTOC__ | ||
== Building a Dataset using the Wikipedia API == | |||
In this project, we will explore a few ways to gather data using the Wikipedia API. Once we've done that, we will extend this to code to create our own datasets of Wikipedia edits or other data that we might be able to use to ask and answer questions in the final session. | In this project, we will explore a few ways to gather data using the Wikipedia API. Once we've done that, we will extend this to code to create our own datasets of Wikipedia edits or other data that we might be able to use to ask and answer questions in the final session. | ||
== Goals == | === Goals === | ||
* Get set up to build datasets with the Wikipedia API | * Get set up to build datasets with the Wikipedia API | ||
Line 11: | Line 12: | ||
* Create a few collections of different types of data from Wikipedia that you can do research with in the final section | * Create a few collections of different types of data from Wikipedia that you can do research with in the final section | ||
== Download and test the Wikipedia | === Download and test the Wikipedia project === | ||
If you are confused by these steps, go back and refresh your memory with the [[Community Data Science Workshops ( | If you are confused by these steps, go back and refresh your memory with the [[Community Data Science Workshops (Spring 2015)/Day 0 setup and tutorial|Day 0 setup and tutorial]] and [[Community Data Science Workshops (Spring 2015)/Day 0 tutorial|Day 0 tutorial]] | ||
(Estimated time: 10 minutes) | (Estimated time: 10 minutes) | ||
* [[Community Data Science Workshops (Spring 2015)/Day 2 Projects/Wikipedia/Windows setup|Windows]] | |||
* [[Community Data Science Workshops (Spring 2015)/Day 2 Projects/Wikipedia/OS X setup|OS X]] | |||
=== Example topics we might cover in the lecture === | |||
== Example topics we might cover in the | |||
* explain [http://www.mediawiki.org/wiki/API:Main_page MediaWiki], exists on other wikis | * explain [http://www.mediawiki.org/wiki/API:Main_page MediaWiki], exists on other wikis | ||
Line 57: | Line 31: | ||
* get the content of the main page http://en.wikipedia.org/w/api.php?format=json&action=query&titles=Main%20Page&prop=revisions&rvprop=content | * get the content of the main page http://en.wikipedia.org/w/api.php?format=json&action=query&titles=Main%20Page&prop=revisions&rvprop=content | ||
== Resources == | === Resources === | ||
* [https://en.wikipedia.org/w/api.php?action=help&modules=query API documentation for the query module] | * [https://en.wikipedia.org/w/api.php?action=help&modules=query API documentation for the query module] | ||
* [https://en.wikipedia.org/wiki/Special:ApiSandbox API Sandbox] | * [https://en.wikipedia.org/wiki/Special:ApiSandbox API Sandbox] | ||
* [[Sample | * [[Sample API queries]] | ||
[[Category: | [[Category:Fall_2015_series]] |