Editing Community Data Science Course (Spring 2023)
From CommunityData
The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.
Latest revision | Your text | ||
Line 4: | Line 4: | ||
:'''Instructors:''' [https://mako.cc/academic/ Benjamin Mako Hill] / [mailto:makohill@uw.edu makohill@uw.edu] and [https://kayleachampion.com/ Kaylea Champion] / she / [mailto:kaylea@uw.edu kaylea@uw.edu] (TA: you can call me Kaylea) | :'''Instructors:''' [https://mako.cc/academic/ Benjamin Mako Hill] / [mailto:makohill@uw.edu makohill@uw.edu] and [https://kayleachampion.com/ Kaylea Champion] / she / [mailto:kaylea@uw.edu kaylea@uw.edu] (TA: you can call me Kaylea) | ||
:'''Course Website''': We will use Canvas for [https://canvas.uw.edu/courses/1633288/assignments assignments] and [https://canvas.uw.edu/courses/1633288/announcements announcements]. Everything else will be linked on this page. | :'''Course Website''': We will use Canvas for [https://canvas.uw.edu/courses/1633288/assignments assignments] and [https://canvas.uw.edu/courses/1633288/announcements announcements]. Everything else will be linked on this page. | ||
:'''Course Playlist''': [https://open.spotify.com/playlist/4iM6wz4GYRX7O2S7Z2oyXj?si=638c49b13277429e&pt=3f46c0ca5049a1ced44ae09173e28d7b Feel free to add what's moving you lately] | :'''Course Playlist''': [https://open.spotify.com/playlist/4iM6wz4GYRX7O2S7Z2oyXj?si=638c49b13277429e&pt=3f46c0ca5049a1ced44ae09173e28d7b Feel free to add what's moving you lately] | ||
:Course Catalog Description (from '''Communication Leadership):''' | :Course Catalog Description (from '''Communication Leadership):''' | ||
Line 58: | Line 58: | ||
=== Microsoft Teams === | === Microsoft Teams === | ||
We'll be using Microsoft Teams for class chat. | We'll be using Microsoft Teams for class chat. More details are {{forthcoming}} soon! | ||
=== Open Labs === | === Open Labs === | ||
Line 71: | Line 71: | ||
=== Office Hours === | === Office Hours === | ||
Mako will not be holding additional office hours but is generally available. You can set up a 30 minute minute meeting by adding something to [https://mako.cc/calendar/ his online calendar] which has a meeting signup link attached to it. | Mako will not be holding additional office hours but is generally available. You can set up a 30 minute minute meeting by adding something to [https://mako.cc/calendar/ his online calendar] which has a meeting signup link attached to it. | ||
== Assignments == | == Assignments == | ||
Line 115: | Line 113: | ||
Your proposal should frame your final analysis, but it's also a chance to "sanity check" your plan. I will give you feedback on these proposals and suggest changes or modifications that are more likely to make them successful or compelling. I will also work with you to make sure that you have the resources and support necessary to carry out your project successfully. | Your proposal should frame your final analysis, but it's also a chance to "sanity check" your plan. I will give you feedback on these proposals and suggest changes or modifications that are more likely to make them successful or compelling. I will also work with you to make sure that you have the resources and support necessary to carry out your project successfully. | ||
Be as specific as possible about the data available on the sources you've chosen. I expect that you will have written at least some of the final code that you will use in this course. | Be as specific as possible about the data available on the sources you've chosen. I expect that you will have written at least some of the final code that you will use in this course. Identify the documentation and the API endpoints where required. If there are libraries that you think may help with access, note them. | ||
=== Final Project === | === Final Project === | ||
Line 123: | Line 119: | ||
For your final project, I expect you to build on the first two assignments to describe what they have done and what you have found. I'll expect every student to give both: | For your final project, I expect you to build on the first two assignments to describe what they have done and what you have found. I'll expect every student to give both: | ||
# A short presentation ( | # A short presentation (7 minutes) | ||
# A final report that is not more than 4500 words (~18 pages) | # A final report that is not more than 4500 words (~18 pages) | ||
Line 132: | Line 128: | ||
==== Final Presentation ==== | ==== Final Presentation ==== | ||
:'''Presentation Date:''' At some point during the beginning of the last week of the quarter (the week of May 29). Details will be {{forthcoming}}. | |||
:'''Turning in:''' Details are {{tentative}} but we will very likely ask you upload a video to Canvas and will provide information and links on how to do this. | |||
Your presentation should provide the teaching team and your classmates with a very clear idea of what to expect in your final paper. However, don't treat it as a comprehensive overview of your paper: I would rather you tell a subset of the story well than the whole story in a rushed fashion. For instance, you can give a completely successful presentation by describing the motivation and walking through one plot in your paper. I'm going to give you all at least a paragraph of feedback after your talk. This will be an opportunity for me to see a preview of your paper and give you a sense for what I think you can improve. It's to your advantage to both give a compelling talk and to give me a sense for your project. | |||
The current plan is: | |||
;Length: All presentations will need to be '''a maximum of | ;Length: All presentations will need to be '''a maximum of 7 minutes long'''. Do not exceed 7 minutes and 0 seconds; don't edit your video to speed it up if you go too long. Concisely communicating an idea in the time allotted is an important skill in its own right. | ||
;Slides: You are encouraged to use slides for your talk. Please keep in mind that your slides are meant to be additive, not a teleprompter. | ;Slides: You are encouraged to use slides for your talk. Please keep in mind that your slides are meant to be additive, not a teleprompter. | ||
; | ;Modality: We will not meet during the final week due to the Memorial Day holiday. As a result, we plan to conduct these virtually. | ||
==== Final Paper (and Code!) ==== | ==== Final Paper (and Code!) ==== | ||
:'''Due Date:''' Thursday June 8 at 11:59 p.m. | :'''Due Date:''' Thursday June 8 at 11:59 p.m. | ||
:'''Turning in:''' Turn in a PDF of your paper to [https://canvas.uw.edu/courses/1633288/assignments/8231894 the Canvas dropbox]; | :'''Turning in:''' Turn in a PDF of your paper to [https://canvas.uw.edu/courses/1633288/assignments/8231894 the Canvas dropbox]; Details on how to turn in your code will be {{forthcoming}}. | ||
Your final project should include detailed information on: | Your final project should include detailed information on: | ||
Line 177: | Line 158: | ||
Keep in mind that most stories on Five Thirty Eight are under 1000 words and I'm giving up to 4,500 words to show me what you've learned. As a result, you should do ''more'' than FiveThirtyEight does in a single story. You can ask and answer more questions, you can provide more background, context, and justification, you can provide more details on your methods and data sources, you can show us more graphs, you can discuss the implications of your findings more. Use the space I've given you to show off what you've done and what you've learned! | Keep in mind that most stories on Five Thirty Eight are under 1000 words and I'm giving up to 4,500 words to show me what you've learned. As a result, you should do ''more'' than FiveThirtyEight does in a single story. You can ask and answer more questions, you can provide more background, context, and justification, you can provide more details on your methods and data sources, you can show us more graphs, you can discuss the implications of your findings more. Use the space I've given you to show off what you've done and what you've learned! | ||
Finally, you should also share with me the full Python source code you used to collect the data as well as the data set itself. Your code | Finally, you should also share with me the full Python source code you used to collect the data as well as the data set itself. Your code along will not form a large portion of your final grade. Rather, I will focus on the degree to which you have been successful at answering the ''substantive'' questions you have identified. | ||
Visualization is critical to storytelling, so 25% of your grade for this project will be determined by the visualizations and tables in your report. Good visualizations should "stand alone" and motivate the core results in your paper all by themselves. A good question to keep in mind is "could I tell this story with the visualizations and a tweet?" | Visualization is critical to storytelling, so 25% of your grade for this project will be determined by the visualizations and tables in your report. Good visualizations should "stand alone" and motivate the core results in your paper all by themselves. A good question to keep in mind is "could I tell this story with the visualizations and a tweet?" | ||
Line 188: | Line 169: | ||
== Grades == | == Grades == | ||
Assignments will accrue to your final grade in the following way: | Assignments will accrue to your final grade in the following way: | ||
Line 228: | Line 207: | ||
* Be ready to start on Week 1 coding challenges. If all goes well, you'll start on them! | * Be ready to start on Week 1 coding challenges. If all goes well, you'll start on them! | ||
''' | '''Problem set answers:''' | ||
* [https://uw.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=f85d134d-fc3a-4cc3-b3b5-afda01621fe7 Video from class walking through the results] | * [https://uw.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=f85d134d-fc3a-4cc3-b3b5-afda01621fe7 Video from class walking through the results] | ||
* [https://github.com/kayleachampion/spr23_CDSW/blob/main/curriculum/week1/week1-programming_challenge_examples.ipynb Jupyter notebook with Kaylea's assembled list of examples from you along with notes] | * [https://github.com/kayleachampion/spr23_CDSW/blob/main/curriculum/week1/week1-programming_challenge_examples.ipynb Jupyter notebook with Kaylea's assembled list of examples from you along with notes] | ||
* [https://github.com/kayleachampion/spr23_CDSW/blob/main/week1-homework/hill-week_01_coding_challenges-20230403.ipynb Jupyter notebook with Mako's answers to the | * [https://github.com/kayleachampion/spr23_CDSW/blob/main/week1-homework/hill-week_01_coding_challenges-20230403.ipynb Jupyter notebook with Mako's answers to the problem set] | ||
=== Week 2: April 3 === | === Week 2: April 3 === | ||
Line 240: | Line 219: | ||
'''Class session:''' | '''Class session:''' | ||
* [https://uw.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=f85d134d-fc3a-4cc3-b3b5-afda01621fe7 Video of announcements and week 1 | * [https://uw.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=f85d134d-fc3a-4cc3-b3b5-afda01621fe7 Video of announcements and week 1 problem sets] [Requires Canvas Access] | ||
* [https://uw.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=47a9e5d4-2998-4943-a047-afda01621fe4 Video of week 2 lecture] [Requires Canvas Access] | * [https://uw.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=47a9e5d4-2998-4943-a047-afda01621fe4 Video of week 2 lecture] [Requires Canvas Access] | ||
* [[/Week 2 lecture notes]] | * [[/Week 2 lecture notes]] | ||
'''Resources:''' | '''Resources:''' | ||
Line 251: | Line 229: | ||
** [https://www.py4e.com/html3/03-conditional Chapter 3, Conditionals] | ** [https://www.py4e.com/html3/03-conditional Chapter 3, Conditionals] | ||
* [[Python data types cheat sheet]] | * [[Python data types cheat sheet]] | ||
* [[Python loops cheat sheet]] | |||
'''Assignment Due:''' | '''Assignment Due:''' | ||
Line 274: | Line 253: | ||
=== Week 3: April 10 === | === Week 3: April 10 === | ||
Today we'll be doing introducing some additional programming concepts in Python including aggregating and counting with dictionaries | Today we'll be doing introducing some additional programming concepts in Python including aggregating and counting with dictionaries. | ||
'''Resources:''' | '''Resources:''' | ||
Line 289: | Line 261: | ||
** [https://www.py4e.com/html3/06-strings Chapter 6, Strings] | ** [https://www.py4e.com/html3/06-strings Chapter 6, Strings] | ||
** [https://www.py4e.com/html3/08-lists Chapter 8, Lists]: You can skip the part on "Parsing lines" | ** [https://www.py4e.com/html3/08-lists Chapter 8, Lists]: You can skip the part on "Parsing lines" | ||
'''Assignment Due:''' | '''Assignment Due:''' | ||
Line 298: | Line 269: | ||
'''Class schedule:''' | '''Class schedule:''' | ||
* Go over last week's assignments | * Go over last week's assignments | ||
* [[/Week 3 lecture notes]] introducing new material on dictionaries and aggregations plus average, | * [[/Week 3 lecture notes]] <!-- [[Community_Data_Science_Course_%28Spring_2019%29/Day_2_Lecture|Lecture notes]] --> introducing new material on dictionaries and aggregations plus average, median (using the Wordplay data) | ||
* | * A break | ||
* Introduce the [[/Week 3 coding challenges]] based on the [ | * Introduce the [[/Week 3 coding challenges]] based on the [http://mako.cc/teaching/2015/cdsw-autumn/babynames.zip Baby names] project. <!-- [[Community Data Science Course (Spring 2019)/Day 3 Coding Challenges|Day 3 Coding Challenges]] --> | ||
* Project time—We'll work through projects | * Project time—We'll work through projects | ||
Line 315: | Line 285: | ||
=== Week 4: April 17 === | === Week 4: April 17 === | ||
Today we'll be learning | Today we'll be using Python to read and write files from disk and be learning to do some basic tricks with a very useful Python module called Pandas. | ||
'''Assignment Due:''' | '''Assignment Due:''' | ||
* Read chapters 8 and 10 of '''Python for Informatics''': | |||
** Dictionaries | |||
** Files | |||
* Finish and turn in your [[/Week 3 coding challenges]] (Due Sunday before class) | * Finish and turn in your [[/Week 3 coding challenges]] (Due Sunday before class) | ||
'''Class schedule:''' | '''Class schedule:''' | ||
* Go over the [[/Week 3 coding challenges]] | * Go over the [[/Week 3 coding challenges]] | ||
* [[/Week 4 lecture | * [[/Week 4 lecture]] introducing new material, plus building some basic histograms in Python. | ||
* Introduce the [[/Week 4 coding challenges]] | * Introduce the [[/Week 4 coding challenges]], which will also be based on the [http://mako.cc/teaching/2015/cdsw-autumn/babynames.zip Baby names] project. | ||
* Project time—We'll work through projects | * Project time—We'll work through projects | ||
<!-- * [[/Day 4 coding challenges]] [[Community Data Science Course (Spring 2019)/Day 4 Coding Challenges|Day 4 Coding Challenges]] --> | |||
=== Week 5: April 24 === | === Week 5: April 24 === | ||
Today we'll be | Today we'll be learning about the basic of web APIs and JSON. | ||
'''Assignment Due:''' | '''Assignment Due:''' | ||
* Readings {{tbd}} | |||
* Finish and turn in your [[/Week 4 coding challenges]] (Due Sunday before class) | * Finish and turn in your [[/Week 4 coding challenges]] (Due Sunday before class) | ||
'''Class schedule:''' | '''Class schedule:''' | ||
* Go over the [[/Week 4 coding challenges]] | * Go over the [[/Week 4 coding challenges]] and review some histograms | ||
* [[/Week 5 lecture | * [[/Week 5 lecture]] introducing new material on APIs and JSON <!-- [[Community Data Science Course (Spring 2019)/Day 5 Notes|Day 5 Notes]] --> | ||
* Introduce the [[/Week 5 coding challenges]] | * Introduce the [[/Week 5 coding challenges]] <!-- [[Community Data Science Course (Spring 2019)/Day 5 Coding Challenges|Day 5 Coding Challenges]] --> | ||
* Project time—We'll work through projects | * Project time—We'll work through projects | ||
=== Week 6: May 1 === | === Week 6: May 1 === | ||
Today we'll be putting everything together and walking through a project that builds a dataset from the Wikipedia API from start-to-finish, with a focus on cleaning and getting data into tools like Excel or Google Sheets, and introducing a set of tricks along the way. | Today we'll be putting everything together and walking through a project that builds a dataset from the Wikipedia API from start-to-finish, with a focus on cleaning and getting data into tools like Excel or Google Sheets, and introducing a set of tricks along the way. | ||
'''Assignment Due:''' | '''Assignment Due:''' | ||
* Readings {{tbd}} | |||
* Finish and turn in your [[/Week 5 coding challenges]] (Due Sunday before class) | * Finish and turn in your [[/Week 5 coding challenges]] (Due Sunday before class) | ||
Line 396: | Line 333: | ||
* Let's discuss remaining schedule | * Let's discuss remaining schedule | ||
* Go over the [[/Week 5 coding challenges]] | * Go over the [[/Week 5 coding challenges]] | ||
* [[/Week 6 lecture | * [[/Week 6 lecture]] introducing new material on APIs and JSON <!-- [[Community Data Science Course (Spring 2019)/Day 5 Notes|Day 5 Notes]] --><!-- [[Community Data Science Course (Sprint 2019)/Day 7 Notes|Day 7 Notes]] --> | ||
* Introduce the [[/Week 6 coding challenges]] | * Introduce the [[/Week 6 coding challenges]] <!-- [[Community Data Science Course (Spring 2019)/Day 7 Coding Challenges|Day 7 Coding Challenges]] --> | ||
* Project time—We'll work through projects | * Project time—We'll work through projects | ||
<!-- | <!-- | ||
Line 410: | Line 347: | ||
=== Week 7: May 8 === | === Week 7: May 8 === | ||
Today we'll | Today we'll be introducing two additional web APIs in parallel. We'll run a poll of the class to get a sense of what folks want to learn but we're considering Yelp, Reddit, Twitter, and data.seattle.gov. | ||
' | |||
'''Assignment Due:''' | '''Assignment Due:''' | ||
* Readings {{tbd}} | |||
* Finish and turn in your [[/Week 6 coding challenges]] (Due Sunday before class) | * Finish and turn in your [[/Week 6 coding challenges]] (Due Sunday before class) | ||
Line 429: | Line 357: | ||
* Go over the [[/Week 6 coding challenges]] | * Go over the [[/Week 6 coding challenges]] | ||
* [[/Week 7 lecture | * [[/Week 7 lecture]] introducing new material on new APIs | ||
* Introduce the [[/Week 7 coding challenges]] | * Introduce the [[/Week 7 coding challenges]] | ||
* Project time—We'll work through projects | * Project time—We'll work through projects | ||
Line 435: | Line 363: | ||
=== Week 8: May 15 === | === Week 8: May 15 === | ||
We'll | Today we'll be introducing two additional web APIs in parallel. We'll run a poll of the class to get a sense of what folks want to learn but we're considering Yelp, Reddit, Twitter, and data.seattle.gov. | ||
'''Assignment Due:''' | '''Assignment Due:''' | ||
* [[#Final Project Proposal]] | * [[#Final Project Proposal]] | ||
* Readings {{tbd}} | |||
* Finish and turn in your [[/Week 7 coding challenges]] (Due Sunday before class) | * Finish and turn in your [[/Week 7 coding challenges]] (Due Sunday before class) | ||
Line 457: | Line 375: | ||
* Let's discuss remaining schedule | * Let's discuss remaining schedule | ||
* Go over the [[/Week 7 coding challenges]] | * Go over the [[/Week 7 coding challenges]] | ||
* | * [[/Week 8 lecture]] | ||
* Introduce the [[/Week 8 coding challenges]] | |||
* Project time—We'll work through projects | * Project time—We'll work through projects | ||
Line 466: | Line 385: | ||
'''Assignment Due:''' | '''Assignment Due:''' | ||
* | * Readings {{tbd}} | ||
* Finish and turn in your [[/Week 8 coding challenges]] (Due Sunday before class) | |||
'''Class schedule:''' | '''Class schedule:''' | ||
* Project time—We'll | * Let's discuss remaining schedule | ||
* | * Go over the [[/Week 8 coding challenges]] | ||
* [[/Week 9 lecture]] | |||
* Project time—We'll work through projects | |||
'''Resources:''' | |||
* Visualization dos and don'ts. We'll discuss the European Environmental Agency's [https://www.eea.europa.eu/data-and-maps/daviz/learn-more/chart-dos-and-donts list of advice for making charts]. **I will refer to this guide as a grade your final projects.** | |||
* Two options for remainder of class. You can work through this introductory guide to visualization in python or you can work on your final project. I'll be here to answer any questions. | |||
* Self-guided visualization tutorial in python. [https://raw.githubusercontent.com/guyrt/teaching/master/2019/Com520B/VisualizationNotebook.ipynb Download here]. Save the file in a new directory in your desktop and open it with Jupyter Notebook. (If you are on Windows, you may run into an issue with missing path variables. [https://stackoverflow.com/questions/52821162/jupyter-notebook-failed-to-load-dll This SO post helped me solve it.]) | |||
=== Week 10: May 29 (NO MEETING) === | === Week 10: May 29 (NO MEETING) === |