Editing Community Data Science Course (Spring 2023)
From CommunityData
Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.
The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.
Latest revision | Your text | ||
Line 4: | Line 4: | ||
:'''Instructors:''' [https://mako.cc/academic/ Benjamin Mako Hill] / [mailto:makohill@uw.edu makohill@uw.edu] and [https://kayleachampion.com/ Kaylea Champion] / she / [mailto:kaylea@uw.edu kaylea@uw.edu] (TA: you can call me Kaylea) | :'''Instructors:''' [https://mako.cc/academic/ Benjamin Mako Hill] / [mailto:makohill@uw.edu makohill@uw.edu] and [https://kayleachampion.com/ Kaylea Champion] / she / [mailto:kaylea@uw.edu kaylea@uw.edu] (TA: you can call me Kaylea) | ||
:'''Course Website''': We will use Canvas for [https://canvas.uw.edu/courses/1633288/assignments assignments] and [https://canvas.uw.edu/courses/1633288/announcements announcements]. Everything else will be linked on this page. | :'''Course Website''': We will use Canvas for [https://canvas.uw.edu/courses/1633288/assignments assignments] and [https://canvas.uw.edu/courses/1633288/announcements announcements]. Everything else will be linked on this page. | ||
:'''Course Playlist''': [https://open.spotify.com/playlist/4iM6wz4GYRX7O2S7Z2oyXj?si= | :'''Course Playlist''': [https://open.spotify.com/playlist/4iM6wz4GYRX7O2S7Z2oyXj?si=61e66a02b303488b&pt=ba5cc2809c1035b38d22753143c0fb67 Feel free to add what's moving you lately] | ||
:Course Catalog Description (from '''Communication Leadership):''' | :Course Catalog Description (from '''Communication Leadership):''' | ||
Line 55: | Line 55: | ||
The best way to get in touch with the teaching team will either be to ask questions in the Teams server via of the public text channels. This is preferable because any questions you have can be answered in a way that is visible to others in the class. Also, it's quite possible that others in the class can you help you out! | The best way to get in touch with the teaching team will either be to ask questions in the Teams server via of the public text channels. This is preferable because any questions you have can be answered in a way that is visible to others in the class. Also, it's quite possible that others in the class can you help you out! | ||
Th | |||
=== Microsoft Teams === | === Microsoft Teams === | ||
We'll be using Microsoft Teams for class chat. | We'll be using Microsoft Teams for class chat. More details are {{forthcoming}} soon! | ||
=== Open Labs === | === Open Labs === | ||
Line 64: | Line 66: | ||
We will be running three sets of open lab spaces. This is a great time to ask a question, or just show up and work on your programming challenges with others in the group. Those times will be: | We will be running three sets of open lab spaces. This is a great time to ask a question, or just show up and work on your programming challenges with others in the group. Those times will be: | ||
* '''Tuesday | * '''Tuesday 6-7pm'' virtually in Microsoft Teams with Kaylea | ||
* '''Wednesdays 5- | * '''Wednesdays 5:30-730pm''' in Communications (CMU) 242 with Mako | ||
* '''Thursday | * '''Thursday 6-7pm'' virtually in Microsoft Teams with Kaylea | ||
* '''Friday 12:30-2pm | * '''Friday 12:30-2pm in the CSE Building with Josh in a room that is {{tbd}} | ||
=== Office Hours === | === Office Hours === | ||
Mako will not be holding additional office hours but is generally available. You can set up a 30 minute minute meeting by adding something to [https://mako.cc/calendar/ his online calendar] which has a meeting signup link attached to it. | Mako will not be holding additional office hours but is generally available. You can set up a 30 minute minute meeting by adding something to [https://mako.cc/calendar/ his online calendar] which has a meeting signup link attached to it. | ||
== Assignments == | == Assignments == | ||
Line 115: | Line 115: | ||
Your proposal should frame your final analysis, but it's also a chance to "sanity check" your plan. I will give you feedback on these proposals and suggest changes or modifications that are more likely to make them successful or compelling. I will also work with you to make sure that you have the resources and support necessary to carry out your project successfully. | Your proposal should frame your final analysis, but it's also a chance to "sanity check" your plan. I will give you feedback on these proposals and suggest changes or modifications that are more likely to make them successful or compelling. I will also work with you to make sure that you have the resources and support necessary to carry out your project successfully. | ||
Be as specific as possible about the data available on the sources you've chosen. I expect that you will have written at least some of the final code that you will use in this course. | Be as specific as possible about the data available on the sources you've chosen. I expect that you will have written at least some of the final code that you will use in this course. Identify the documentation and the API endpoints where required. If there are libraries that you think may help with access, note them. | ||
=== Final Project === | === Final Project === | ||
Line 123: | Line 121: | ||
For your final project, I expect you to build on the first two assignments to describe what they have done and what you have found. I'll expect every student to give both: | For your final project, I expect you to build on the first two assignments to describe what they have done and what you have found. I'll expect every student to give both: | ||
# A short presentation ( | # A short presentation (7 minutes) | ||
# A final report that is not more than 4500 words (~18 pages) | # A final report that is not more than 4500 words (~18 pages) | ||
Line 132: | Line 130: | ||
==== Final Presentation ==== | ==== Final Presentation ==== | ||
:'''Presentation Date:''' At some point during the beginning of the last week of the quarter (the week of May 29). Details will be {{forthcoming}}. | |||
:'''Turning in:''' Details are {{tentative}} but we will very likely ask you upload a video to Canvas and will provide information and links on how to do this. | |||
Your presentation should provide the teaching team and your classmates with a very clear idea of what to expect in your final paper. However, don't treat it as a comprehensive overview of your paper: I would rather you tell a subset of the story well than the whole story in a rushed fashion. For instance, you can give a completely successful presentation by describing the motivation and walking through one plot in your paper. I'm going to give you all at least a paragraph of feedback after your talk. This will be an opportunity for me to see a preview of your paper and give you a sense for what I think you can improve. It's to your advantage to both give a compelling talk and to give me a sense for your project. | |||
The current plan is: | |||
;Length: All presentations will need to be '''a maximum of | ;Length: All presentations will need to be '''a maximum of 7 minutes long'''. Do not exceed 7 minutes and 0 seconds; don't edit your video to speed it up if you go too long. Concisely communicating an idea in the time allotted is an important skill in its own right. | ||
;Slides: You are encouraged to use slides for your talk. Please keep in mind that your slides are meant to be additive, not a teleprompter. | ;Slides: You are encouraged to use slides for your talk. Please keep in mind that your slides are meant to be additive, not a teleprompter. | ||
; | ;Modality: We will not meet during the final week due to the Memorial Day holiday. As a result, we plan to conduct these virtually. | ||
==== Final Paper (and Code!) ==== | ==== Final Paper (and Code!) ==== | ||
:'''Due Date:''' Thursday June 8 at 11:59 p.m. | :'''Due Date:''' Thursday June 8 at 11:59 p.m. | ||
:'''Turning in:''' Turn in a PDF of your paper to [https://canvas.uw.edu/courses/1633288/assignments/8231894 the Canvas dropbox]; | :'''Turning in:''' Turn in a PDF of your paper to [https://canvas.uw.edu/courses/1633288/assignments/8231894 the Canvas dropbox]; Details on how to turn in your code will be {{forthcoming}}. | ||
Your final project should include detailed information on: | Your final project should include detailed information on: | ||
Line 177: | Line 160: | ||
Keep in mind that most stories on Five Thirty Eight are under 1000 words and I'm giving up to 4,500 words to show me what you've learned. As a result, you should do ''more'' than FiveThirtyEight does in a single story. You can ask and answer more questions, you can provide more background, context, and justification, you can provide more details on your methods and data sources, you can show us more graphs, you can discuss the implications of your findings more. Use the space I've given you to show off what you've done and what you've learned! | Keep in mind that most stories on Five Thirty Eight are under 1000 words and I'm giving up to 4,500 words to show me what you've learned. As a result, you should do ''more'' than FiveThirtyEight does in a single story. You can ask and answer more questions, you can provide more background, context, and justification, you can provide more details on your methods and data sources, you can show us more graphs, you can discuss the implications of your findings more. Use the space I've given you to show off what you've done and what you've learned! | ||
Finally, you should also share with me the full Python source code you used to collect the data as well as the data set itself. Your code | Finally, you should also share with me the full Python source code you used to collect the data as well as the data set itself. Your code along will not form a large portion of your final grade. Rather, I will focus on the degree to which you have been successful at answering the ''substantive'' questions you have identified. | ||
Visualization is critical to storytelling, so 25% of your grade for this project will be determined by the visualizations and tables in your report. Good visualizations should "stand alone" and motivate the core results in your paper all by themselves. A good question to keep in mind is "could I tell this story with the visualizations and a tweet?" | Visualization is critical to storytelling, so 25% of your grade for this project will be determined by the visualizations and tables in your report. Good visualizations should "stand alone" and motivate the core results in your paper all by themselves. A good question to keep in mind is "could I tell this story with the visualizations and a tweet?" | ||
Line 188: | Line 171: | ||
== Grades == | == Grades == | ||
Assignments will accrue to your final grade in the following way: | Assignments will accrue to your final grade in the following way: | ||
Line 227: | Line 208: | ||
* Have learned a few basics about programming in Python including variables and lists. | * Have learned a few basics about programming in Python including variables and lists. | ||
* Be ready to start on Week 1 coding challenges. If all goes well, you'll start on them! | * Be ready to start on Week 1 coding challenges. If all goes well, you'll start on them! | ||
=== Week 2: April 3 === | === Week 2: April 3 === | ||
Today we'll be doing a crash course | Today we'll be doing a crash course is basic programming in Python. | ||
'''Assignment Due:''' | '''Assignment Due:''' | ||
* Read chapters 2 and 3 of ''Python for Everyone'': | |||
** Chapter 2, Variables | |||
** Chapter 3, Conditionals | |||
* Complete the Code Academy assignment from class last week | |||
* Finish and turn in your [[/Week 1 coding challenges]] (Due Sunday before class) | * Finish and turn in your [[/Week 1 coding challenges]] (Due Sunday before class) | ||
'''Class schedule:''' | '''Class schedule:''' | ||
* Answer any questions that folks have about the final projects. | |||
* Answer any questions that folks have about the final projects | * Review material from last week: variables, assignment, if statements | ||
* Review answers to last week's coding challenges | * Review answers to last week's coding challenges | ||
* [[/Week 2 lecture]] <!-- [[Community_Data_Science_Course_%28Spring_2019%29/Day_2_Lecture|Lecture notes]] --> introducing new material including loops and lists | |||
* [[/Week 2 lecture notes]] | * Introduce the [[/Week 2 coding challenges]] <!-- [[Community Data Science Course (Spring 2019)/Day 3 Coding Challenges|Day 3 Coding Challenges]] --> | ||
* Introduce the [[/Week 2 coding challenges]] and the | * Project time — We'll begin working on [[Wordplay]] and the [[/Week 2 coding challenges]] independently or in small groups. | ||
'''By the end of class you will:''' | '''By the end of class you will:''' | ||
* Have a better understanding of the expectations for your final project, | * Have written your first program with loops and lists. | ||
* Have a better understanding of the expectations for your final project, and be ready to hand in your initial assignment. | |||
=== Week 3: April 10 === | === Week 3: April 10 === | ||
Today we'll be doing introducing some additional programming concepts in Python including aggregating and counting with dictionaries | Today we'll be doing introducing some additional programming concepts in Python including aggregating and counting with dictionaries. | ||
'''Assignment Due:''' | '''Assignment Due:''' | ||
* [[#Final Project Idea|Final project idea]] (Due at 5pm on Monday, | * [[#Final Project Idea|Final project idea]] (Due at 5pm on Monday) | ||
* Read chapter 4, 5 of ''Python for Everyone'': {{tentative}} | |||
** Functions (this is mostly new) | |||
** Iteration (this is mostly review) | |||
** Dictionaries {{tbd}} | |||
* Finish and turn in your [[/Week 2 coding challenges]] (Due Sunday before class) | * Finish and turn in your [[/Week 2 coding challenges]] (Due Sunday before class) | ||
'''Class schedule:''' | '''Class schedule:''' | ||
* Go over last week's assignments | * Go over last week's assignments | ||
* [[/Week 3 lecture notes]] introducing new material on dictionaries and aggregations plus average, | * [[/Week 3 lecture]] <!-- [[Community_Data_Science_Course_%28Spring_2019%29/Day_2_Lecture|Lecture notes]] --> introducing new material on dictionaries and aggregations plus average, median (using the Wordplay data) | ||
* | * A break | ||
* Introduce the [[/Week 3 coding challenges]] based on the [ | * Introduce the [[/Week 3 coding challenges]] based on the [http://mako.cc/teaching/2015/cdsw-autumn/babynames.zip Baby names] project. <!-- [[Community Data Science Course (Spring 2019)/Day 3 Coding Challenges|Day 3 Coding Challenges]] --> | ||
* Project time—We'll work through projects | * Project time—We'll work through projects | ||
'''Resources:''' | |||
* [[Python data types cheat sheet]] A cheat sheet with everything we've covered in class so far including today. | * [[Python data types cheat sheet]] A cheat sheet with everything we've covered in class so far including today. | ||
<!-- '''By the end of class you will:''' | |||
* Have written your first program with loops and lists. | |||
* Have a better understanding of the expectations for your final project, and be ready to hand in your initial assignment. | |||
--> | --> | ||
Line 315: | Line 268: | ||
=== Week 4: April 17 === | === Week 4: April 17 === | ||
Today we'll be learning | Today we'll be using Python to read and write files from disk and be learning to do some basic tricks with a very useful Python module called Pandas. | ||
'''Assignment Due:''' | '''Assignment Due:''' | ||
* Read chapters 8 and 10 of '''Python for Informatics''': | |||
** Dictionaries | |||
** Files | |||
* Finish and turn in your [[/Week 3 coding challenges]] (Due Sunday before class) | * Finish and turn in your [[/Week 3 coding challenges]] (Due Sunday before class) | ||
'''Class schedule:''' | '''Class schedule:''' | ||
* Go over the [[/Week 3 coding challenges]] | * Go over the [[/Week 3 coding challenges]] | ||
* [[/Week 4 lecture | * [[/Week 4 lecture]] introducing new material, plus building some basic histograms in Python. | ||
* Introduce the [[/Week 4 coding challenges]] | * Introduce the [[/Week 4 coding challenges]], which will also be based on the [http://mako.cc/teaching/2015/cdsw-autumn/babynames.zip Baby names] project. | ||
* Project time—We'll work through projects | * Project time—We'll work through projects | ||
<!-- * [[/Day 4 coding challenges]] [[Community Data Science Course (Spring 2019)/Day 4 Coding Challenges|Day 4 Coding Challenges]] --> | |||
=== Week 5: April 24 === | === Week 5: April 24 === | ||
Today we'll be | Today we'll be learning about the basic of web APIs and JSON. | ||
'''Assignment Due:''' | '''Assignment Due:''' | ||
* Readings {{tbd}} | |||
* Finish and turn in your [[/Week 4 coding challenges]] (Due Sunday before class) | * Finish and turn in your [[/Week 4 coding challenges]] (Due Sunday before class) | ||
'''Class schedule:''' | '''Class schedule:''' | ||
* Go over the [[/Week 4 coding challenges]] | * Go over the [[/Week 4 coding challenges]] and review some histograms | ||
* [[/Week 5 lecture | * [[/Week 5 lecture]] introducing new material on APIs and JSON <!-- [[Community Data Science Course (Spring 2019)/Day 5 Notes|Day 5 Notes]] --> | ||
* Introduce the [[/Week 5 coding challenges]] | * Introduce the [[/Week 5 coding challenges]] <!-- [[Community Data Science Course (Spring 2019)/Day 5 Coding Challenges|Day 5 Coding Challenges]] --> | ||
* Project time—We'll work through projects | * Project time—We'll work through projects | ||
=== Week 6: May 1 === | === Week 6: May 1 === | ||
Today we'll be putting everything together and walking through a project that builds a dataset from the Wikipedia API from start-to-finish, with a focus on cleaning and getting data into tools like Excel or Google Sheets, and introducing a set of tricks along the way. | Today we'll be putting everything together and walking through a project that builds a dataset from the Wikipedia API from start-to-finish, with a focus on cleaning and getting data into tools like Excel or Google Sheets, and introducing a set of tricks along the way. | ||
'''Assignment Due:''' | '''Assignment Due:''' | ||
* Readings {{tbd}} | |||
* Finish and turn in your [[/Week 5 coding challenges]] (Due Sunday before class) | * Finish and turn in your [[/Week 5 coding challenges]] (Due Sunday before class) | ||
Line 396: | Line 316: | ||
* Let's discuss remaining schedule | * Let's discuss remaining schedule | ||
* Go over the [[/Week 5 coding challenges]] | * Go over the [[/Week 5 coding challenges]] | ||
* [[/Week 6 lecture | * [[/Week 6 lecture]] introducing new material on APIs and JSON <!-- [[Community Data Science Course (Spring 2019)/Day 5 Notes|Day 5 Notes]] --><!-- [[Community Data Science Course (Sprint 2019)/Day 7 Notes|Day 7 Notes]] --> | ||
* Introduce the [[/Week 6 coding challenges]] | * Introduce the [[/Week 6 coding challenges]] <!-- [[Community Data Science Course (Spring 2019)/Day 7 Coding Challenges|Day 7 Coding Challenges]] --> | ||
* Project time—We'll work through projects | * Project time—We'll work through projects | ||
<!-- | <!-- | ||
Line 410: | Line 330: | ||
=== Week 7: May 8 === | === Week 7: May 8 === | ||
Today we'll | Today we'll be introducing two additional web APIs in parallel. We'll run a poll of the class to get a sense of what folks want to learn but we're considering Yelp, Reddit, Twitter, and data.seattle.gov. | ||
' | |||
'''Assignment Due:''' | '''Assignment Due:''' | ||
* Readings {{tbd}} | |||
* Finish and turn in your [[/Week 6 coding challenges]] (Due Sunday before class) | * Finish and turn in your [[/Week 6 coding challenges]] (Due Sunday before class) | ||
Line 429: | Line 340: | ||
* Go over the [[/Week 6 coding challenges]] | * Go over the [[/Week 6 coding challenges]] | ||
* [[/Week 7 lecture | * [[/Week 7 lecture]] introducing new material on new APIs | ||
* Introduce the [[/Week 7 coding challenges]] | * Introduce the [[/Week 7 coding challenges]] | ||
* Project time—We'll work through projects | * Project time—We'll work through projects | ||
Line 435: | Line 346: | ||
=== Week 8: May 15 === | === Week 8: May 15 === | ||
We'll | Today we'll be introducing two additional web APIs in parallel. We'll run a poll of the class to get a sense of what folks want to learn but we're considering Yelp, Reddit, Twitter, and data.seattle.gov. | ||
'''Assignment Due:''' | '''Assignment Due:''' | ||
* [[#Final Project Proposal]] | * [[#Final Project Proposal]] | ||
* Readings {{tbd}} | |||
* Finish and turn in your [[/Week 7 coding challenges]] (Due Sunday before class) | * Finish and turn in your [[/Week 7 coding challenges]] (Due Sunday before class) | ||
Line 457: | Line 358: | ||
* Let's discuss remaining schedule | * Let's discuss remaining schedule | ||
* Go over the [[/Week 7 coding challenges]] | * Go over the [[/Week 7 coding challenges]] | ||
* | * [[/Week 8 lecture]] | ||
* Introduce the [[/Week 8 coding challenges]] | |||
* Project time—We'll work through projects | * Project time—We'll work through projects | ||
Line 466: | Line 368: | ||
'''Assignment Due:''' | '''Assignment Due:''' | ||
* | * Readings {{tbd}} | ||
* Finish and turn in your [[/Week 8 coding challenges]] (Due Sunday before class) | |||
'''Class schedule:''' | '''Class schedule:''' | ||
* Project time—We'll | * Let's discuss remaining schedule | ||
* | * Go over the [[/Week 8 coding challenges]] | ||
* [[/Week 9 lecture]] | |||
* Project time—We'll work through projects | |||
'''Resources:''' | |||
* Visualization dos and don'ts. We'll discuss the European Environmental Agency's [https://www.eea.europa.eu/data-and-maps/daviz/learn-more/chart-dos-and-donts list of advice for making charts]. **I will refer to this guide as a grade your final projects.** | |||
* Two options for remainder of class. You can work through this introductory guide to visualization in python or you can work on your final project. I'll be here to answer any questions. | |||
* Self-guided visualization tutorial in python. [https://raw.githubusercontent.com/guyrt/teaching/master/2019/Com520B/VisualizationNotebook.ipynb Download here]. Save the file in a new directory in your desktop and open it with Jupyter Notebook. (If you are on Windows, you may run into an issue with missing path variables. [https://stackoverflow.com/questions/52821162/jupyter-notebook-failed-to-load-dll This SO post helped me solve it.]) | |||
=== Week 10: May 29 (NO MEETING) === | === Week 10: May 29 (NO MEETING) === |