Editing Intro to Programming and Data Science (Fall 2021)

From CommunityData

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.

Latest revision Your text
Line 102: Line 102:


* An abstract of the proposed study including the topic, research question, theoretical motivation, object(s) of study, and anticipated research contribution.
* An abstract of the proposed study including the topic, research question, theoretical motivation, object(s) of study, and anticipated research contribution.
* An identification of the dataset you will use and a description of the columns or type of data it will include. If you do not currently have access to these data, explain why not and when you will have access (If you need ideas, [[/Datasets|this page]] lists some open datasets).
* An identification of the dataset you will use and a description of the columns or type of data it will include. If you do not currently have access to these data, explain why not and when you will have access (If you need ideas, [[Data_Into_Insights_(Spring_2021)/Final_project#Datasets|this page]] from one of my undergrad classes lists some open datasets).
* A short (several sentences) description of how the project will fit into your career trajectory.
* A short (several sentences) description of how the project will fit into your career trajectory.


Line 287: Line 287:
* Python for Everybody, chapters 9 and 10
* Python for Everybody, chapters 9 and 10
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_5/day_5.ipynb Today's Jupyter Notebook]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_5/day_5.ipynb Today's Jupyter Notebook]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819501/View Video walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819502/View Video walkthrough]
* Margolin, D. B., Hannak, A., & Weber, I. (2018). [https://doi.org/10.1080/10584609.2017.1334018 Political Fact-Checking on Twitter: When Do Corrections Have an Effect?] Political Communication, 35(2), 196–219.
* Margolin, D. B., Hannak, A., & Weber, I. (2018). [https://doi.org/10.1080/10584609.2017.1334018 Political Fact-Checking on Twitter: When Do Corrections Have an Effect?] Political Communication, 35(2), 196–219.
** Discussant:  
** Discussant:  
Line 296: Line 296:
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_5/lecture/day_5.html Today's slides]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_5/lecture/day_5.html Today's slides]


== CATCH UP Week (September 30) ==
== Day 6: Dataframes and Visualization (September 30) ==
 
'''Readings:'''
* Shen, C., Monge, P., & Williams, D. (2014). [https://libkey.io/libraries/228/articles/5013123/full-text-file Virtual brokerage and closure: Network structure and soci8al capital in a massively multiplayer online game]. Communication Research. 41(4): 459–480.
 
 
== Day 6: Dataframes and Visualization (October 7) ==


'''Assignment Due:'''
'''Assignment Due:'''
Line 310: Line 304:
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_6/day_6.ipynb Day 6 notebook]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_6/day_6.ipynb Day 6 notebook]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819502/View Notebook walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819502/View Notebook walkthrough]
* Shaw, A., & Hill, B. M. (2014). Laboratories of oligarchy? How the iron law extends to peer production. Journal of Communication, 64(2), 215–238. https://doi.org/10.1111/jcom.12082
* Benefield, G. A., Shen, C., & Leavitt, A. (2016). [https://doi.org/10.1145/2818048.2819935 Virtual Team Networks: How Group Social Capital Affects Team Success in a Massively Multiplayer Online Game]. Proceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing, 679–690.
** Discussant: Sandra




Line 317: Line 312:
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_6/lecture/day_6.html Today's slides]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_6/lecture/day_6.html Today's slides]


== Day 7: Dataframes and visualization (continued) (October 14) ==
== Day 7: Dataframes and visualization (continued) (October 7) ==


'''Assignment Due:'''
'''Assignment Due:'''
Line 333: Line 328:
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_7/lecture/day_7.html Today's slides]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_7/lecture/day_7.html Today's slides]


== Day 8: Collecting Data with APIs (October 21) ==
== Day 8: Collecting Data with APIs (October 14) ==


'''Assignment Due:'''
'''Assignment Due:'''
* [[/Day 8 Coding Challenges|Day 8 Coding Challenges]].
* [[/Day 8 Coding Challenges|Day 8 Coding Challenges]].
** [https://youtu.be/TASX3evcgG4 Video instructions to install tweepy]
** [https://youtu.be/TASX3evcgG4 Video instructions to install tweepy]
* First [[Self_Assessment_Reflection | self-assessment reflection]] is due (on Brightspace).
* Project Planning Document Due


'''Readings:'''
'''Readings:'''
Line 345: Line 338:
** (Long) [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819504/View walkthrough of notebook]
** (Long) [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819504/View walkthrough of notebook]
* Kieran Healy and James Moody (2014). “[https://doi.org/10.1146/annurev-soc-071312-145551 Data Visualization in Sociology].” American Review of Sociology. 40: 105-28.
* Kieran Healy and James Moody (2014). “[https://doi.org/10.1146/annurev-soc-071312-145551 Data Visualization in Sociology].” American Review of Sociology. 40: 105-28.
** Discussant:
** Discussant: Diane


'''Agenda:'''
'''Agenda:'''
Line 352: Line 345:
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_8/lecture/day_8.html Today's slides]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_8/lecture/day_8.html Today's slides]


== Day 9: Collecting Data with APIs (continued) (October 28) ==
== Day 9: Collecting Data with APIs (continued) (October 21) ==


'''Assignment Due:'''
'''Assignment Due:'''
* [[/Day 9 Coding Challenges|Day 9 Coding Challenges]]
* Start on [[Intro to Programming and Data Science (Fall 2021)/Day 9 Coding Challenges|Day 9 Coding Challenges]]
 
* First [[Self_Assessment_Reflection | self-assessment reflection]] is due (on Brightspace).
* Project Planning Document Due




Line 364: Line 358:
* Python for Everybody, Chapter 13
* Python for Everybody, Chapter 13
* Vitak, J., Shilton, K., & Ashktorab, Z. (2016). [https://doi.org/10.1145/2818048.2820078 Beyond the Belmont Principles: Ethical Challenges, Practices, and Beliefs in the Online Data Research Community]. Proceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing, 941–953.
* Vitak, J., Shilton, K., & Ashktorab, Z. (2016). [https://doi.org/10.1145/2818048.2820078 Beyond the Belmont Principles: Ethical Challenges, Practices, and Beliefs in the Online Data Research Community]. Proceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing, 941–953.
** Discussant:  Diane
** Discussant:   


* (Optional) Williams, M. L., Burnap, P., & Sloan, L. (2017). [https://doi.org/10.1177/0038038517708140 Towards an Ethical Framework for Publishing Twitter Data in Social Research: Taking into Account Users’ Views, Online Context and Algorithmic Estimation]: Sociology.
* (Optional) Williams, M. L., Burnap, P., & Sloan, L. (2017). [https://doi.org/10.1177/0038038517708140 Towards an Ethical Framework for Publishing Twitter Data in Social Research: Taking into Account Users’ Views, Online Context and Algorithmic Estimation]: Sociology.
* (Optional) Salganik, M. [https://www.bitbybitbook.com/en/1st-ed/ethics/ Ethics] chapter from Bit By Bit.  
* (Optional) Salganik, M. [https://www.bitbybitbook.com/en/1st-ed/ethics/ Ethics] chapter from Bit By Bit.  
* (Optional) Crawford, K., & Finn, M. (2015). [https://doi.org/10.1007/s10708-014-9597-z The limits of crisis data: Analytical and ethical challenges of using social and mobile data to understand disasters]. GeoJournal, 80(4), 491–502.
* (Optional) Crawford, K., & Finn, M. (2015). [https://doi.org/10.1007/s10708-014-9597-z The limits of crisis data: Analytical and ethical challenges of using social and mobile data to understand disasters]. GeoJournal, 80(4), 491–502.
* If you are interested in doing web scraping, then look at this [https://github.com/CU-ITSS/Web-Data-Scraping-S2019 incredible mini-course on the topic]. It is all done with Jupyter Notebooks and you have all of the prerequisite knowledge to understand it.
* [https://youtu.be/daUuC-PMZc4 Very brief lecture on web scraping from Spring 2020].




Line 380: Line 370:
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_9/lecture/day_9.html Today's slides]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_9/lecture/day_9.html Today's slides]


== Day 10: Introduction to Computational Text Analysis (November 4) ==
== Day 10: Introduction to Computational Text Analysis (October 28) ==


'''Assignment Due:'''
'''Assignment Due:'''
* [[/Day 9 Coding Challenges|Day 9 Coding Challenges]]
* [[/Day 10 Coding Challenges|Day 10 Coding Challenges]]
* [[/Day 10 Coding Challenges|Day 10 Coding Challenges]]


Line 398: Line 389:
* [https://github.com/jdfoote/Intro-to-Programming-and-Data-Science/blob/master/resources/solutions/Twitter_answers.ipynb My answers to the Day 8 problems]
* [https://github.com/jdfoote/Intro-to-Programming-and-Data-Science/blob/master/resources/solutions/Twitter_answers.ipynb My answers to the Day 8 problems]


== Day 11: Data cleaning and operationalization (November 11) ==
== Day 11: Data cleaning and operationalization (November 4) ==


'''Assignment Due:'''
'''Assignment Due:'''
Line 414: Line 405:
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_11/lecture/day_11.html Today's slides]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_11/lecture/day_11.html Today's slides]


== Day 12: Organizing and storing computational projects (November 18) ==
== Day 12: Organizing and storing computational projects (November 11) ==


'''Assignment Due:'''
'''Assignment Due:'''
Line 433: Line 424:
'''Resources:'''
'''Resources:'''


== Day 13: Statistical summaries and tests (December 2) ==
== Day 13: Statistical summaries and tests (November 18) ==


'''Assignment Due:'''
'''Assignment Due:'''
Line 448: Line 439:
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_13/lecture/day_13.html Today's slides]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_13/lecture/day_13.html Today's slides]


== Day 14: Screen scraping (December 2) ==
'''Assignment Due:'''
'''Readings:'''
* Shaw, A., & Hill, B. M. (2014). [https://doi.org/10.1111/jcom.12082 Laboratories of oligarchy? How the iron law extends to peer production]. Journal of Communication, 64(2), 215–238.
** Discussant:
* [https://towardsdatascience.com/ethics-in-web-scraping-b96b18136f01 Ethics in Web Scraping] by James Densmore
'''Agenda:'''
* If you are interested in doing web scraping, then look at this [https://github.com/CU-ITSS/Web-Data-Scraping-S2019 incredible mini-course on the topic]. It is all done with Jupyter Notebooks and you have all of the prerequisite knowledge to understand it.
* [https://youtu.be/daUuC-PMZc4 Very brief lecture on web scraping from Spring 2020].


== Day 15: Final Project Presentation (December 9) ==
== Day 15: Final Project Presentation (December 9) ==
Line 474: Line 477:
These are some topics we touched on in class covered in more depth
These are some topics we touched on in class covered in more depth


* [https://youtu.be/rQEsIs9LERM Using Tweepy to do full historical search on Twitter]
* [https://www.youtube.com/watch?v=K8L6KVGG-7o Regular Expressions]
* [https://www.youtube.com/watch?v=K8L6KVGG-7o Regular Expressions]
* [https://www.youtube.com/watch?v=3dt4OGnU5sM List Comprehensions]
* [https://www.youtube.com/watch?v=3dt4OGnU5sM List Comprehensions]
Please note that all contributions to CommunityData are considered to be released under the Attribution-Share Alike 3.0 Unported (see CommunityData:Copyrights for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource. Do not submit copyrighted work without permission!

To protect the wiki against automated edit spam, we kindly ask you to solve the following CAPTCHA:

Cancel Editing help (opens in new window)

Template used on this page: