Editing Human Centered Data Science (Fall 2019)/Assignments
From CommunityData
Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.
The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.
Latest revision | Your text | ||
Line 309: | Line 309: | ||
* if a country has 10 articles about politicians, and 2 of them are FA or GA class articles, then the percentage of high-quality articles would be 20%. | * if a country has 10 articles about politicians, and 2 of them are FA or GA class articles, then the percentage of high-quality articles would be 20%. | ||
==== | ==== Tables ==== | ||
The tables should be pretty straightforward. Produce four tables that show: | |||
#10 highest-ranked countries in terms of number of politician articles as a proportion of country population | |||
#10 lowest-ranked countries in terms of number of politician articles as a proportion of country population | |||
#10 highest-ranked countries in terms of number of GA and FA-quality articles as a proportion of all articles about politicians from that country | |||
#10 lowest-ranked countries in terms of number of GA and FA-quality articles as a proportion of all articles about politicians from that country | |||
Embed them in the Jupyter notebook. | |||
Embed | |||
''Reminder:'' you will find the list of geographic regions, which countries are in each region, and total regional population in the raw <tt>WPDS_2018_data.csv</tt> file. See "Cleaning the data" above for more information. | ''Reminder:'' you will find the list of geographic regions, which countries are in each region, and total regional population in the raw <tt>WPDS_2018_data.csv</tt> file. See "Cleaning the data" above for more information. |