Editing Data Into Insights (Spring 2021)/Final project

From CommunityData

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.

Latest revision Your text
Line 6: Line 6:
You all have different interests and I want you to be able to tell a story about something that interests you. So, the first step is to '''identify a dataset that you want to work on and to brainstorm at least three questions that you would like to explore''' in the dataset.
You all have different interests and I want you to be able to tell a story about something that interests you. So, the first step is to '''identify a dataset that you want to work on and to brainstorm at least three questions that you would like to explore''' in the dataset.


I strongly recommend that you find a dataset that is already organized and fairly well-cleaned. In other words, I don't recommend doing something that will require merging data from multiple datasets, gathering data from APIs, etc. (although talk to me if you have a great idea that you think you can pull off!). In general, you should be looking for data that's in CSV format (there are some even better formats like `.feather` or `.RData` but CSV is usually as good as it gets).
I strongly recommend that you find a dataset that is already organized and fairly well-cleaned. In other words, I don't recommend doing something that will require merging data from multiple datasets, gathering data from APIs, etc. (although talk to me if you have a great idea that you think you can pull off!)


=== Datasets ===
=== Datasets ===


There are lots of places to find data. Here are a few lists but feel free to identify your own:
There are lots of places to find data. Here are a few lists:


* [https://datavizm20.classes.andrewheiss.com/resource/data/ Andrew Heiss's list of lists of data]
* [https://datavizm20.classes.andrewheiss.com/resource/data/ Andrew Heiss's list of lists of data]
Line 16: Line 16:
* [https://www.gapminder.org/data/ Gapminder world development data]
* [https://www.gapminder.org/data/ Gapminder world development data]
* [http://www.gss.norc.org/get-the-data General Social Survey data]
* [http://www.gss.norc.org/get-the-data General Social Survey data]
* [https://www.pewresearch.org/download-datasets/ Pew Research Center data]
* [https://www.pewresearch.org/download-datasets/ Pew Rsearch Center data]
* [https://www.kaggle.com/datasets Kaggle Datasets]


Due date: March 23
Due date: March 23
Line 26: Line 25:


Write a short proposal (~1 page) explaining what you have done, why you think it is interesting, and your plan for analysis moving forward.
Write a short proposal (~1 page) explaining what you have done, why you think it is interesting, and your plan for analysis moving forward.
Due date: April 15


== Step 3: Write a rough draft ==
== Step 3: Write a rough draft ==
Line 36: Line 33:


Note that at this point it is not unlikely that you will learn that your original story is not true. In that case, tell us the story of why it isn't true!
Note that at this point it is not unlikely that you will learn that your original story is not true. In that case, tell us the story of why it isn't true!


=== Deliverables ===
=== Deliverables ===


For this project, I want you to be able to practice crafting both of the major kinds of data stories: static reports and presentations. Sometimes, these can be quite similar, and can use the same figures to tell the same story in the same way. On the other hand, you may also want to take advantages of different affordances that each medium offers. For example, in a static report much of the narrative has to occur in the text, while presentations can do things like build up a visualization one piece at a time.
There are two deliverables for this data story:
 
# An R Markdown presentation with ~3,000 words of text, knitted into a web page
There are two deliverables for your rough draft and your final project:
# An R Markdown report with ~3,000 words of text, knitted into a Word Doc file
# An ~8 minute recorded slide presentation. This can also use R Markdown if you want, or you can use something like Google Slides, Power Point, etc.
# An ~8 minute recorded slide presentation. This can also use R Markdown if you want, or you can use something like Google Slides, Power Point, etc.


Each deliverable should include 3-5 visualizations created by you, each of which helps to tell a story and make an argument.
These presentations should include 3-5 visualizations created by you, each of which helps to tell a story and make an argument.


You will create a rough draft of each of these deliverables and will be randomly assigned to a partner. On April 29, we will use our Thursday sessions to provide feedback.  
You will create a rough draft of each of these deliverables and will be randomly assigned to a partner. On April 29, we will use our Thursday sessions to provide feedback.  
Please note that all contributions to CommunityData are considered to be released under the Attribution-Share Alike 3.0 Unported (see CommunityData:Copyrights for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource. Do not submit copyrighted work without permission!

To protect the wiki against automated edit spam, we kindly ask you to solve the following CAPTCHA:

Cancel Editing help (opens in new window)