Editing Intro to Programming and Data Science (Fall 2022)

From CommunityData

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.

Latest revision Your text
Line 57: Line 57:


# Although details on this syllabus will change, I will not change readings or assignments less than three days before they are due. If you plan to read more than three days ahead, contact me first.
# Although details on this syllabus will change, I will not change readings or assignments less than three days before they are due. If you plan to read more than three days ahead, contact me first.
# Check your email. Because this a wiki, you will be able to track every change by clicking the ''history'' button on this page. I will also summarize these changes in a announcements that will be sent via Brightspace email.
# Closely monitor Discord. Because this a wiki, you will be able to track every change by clicking the ''history'' button on this page. I will also summarize these changes in a announcements that will be posted in the #announcements channel on Discord.
# I will ask the class for voluntary anonymous feedback frequently. Please let me know what is working and what can be improved.
# I will ask the class for voluntary anonymous feedback frequently. Please let me know what is working and what can be improved.


Line 64: Line 64:
Our class time will follow the "flipped" classroom model. I will provide asynchronous materials (readings, recorded lectures, assignments, etc.) which you will work on before class and we will use our class time to review concepts, identify confusion, and synthesize.
Our class time will follow the "flipped" classroom model. I will provide asynchronous materials (readings, recorded lectures, assignments, etc.) which you will work on before class and we will use our class time to review concepts, identify confusion, and synthesize.


The class sessions will typically follow the same format.  
The class sessions will typically follow the same format. First, we will discuss the reading for the day, with discussions generally led by a student. We will then have a discussion about the concepts that are still confusing. Next, we will go over the homework questions followed by optional co-working time to start on the next assignment.
 
On Tuesdays, we will review the Coding Challenges that were due. Those who were assigned will explain how they solved (or didn't solve!) each problem.
 
On Thursdays, we will begin by discussing the reading for the day, with discussions generally led by a student. We will then have a discussion about the concepts that are still confusing. Next, we will have by optional co-working time to start on the next assignment.


I try to make the classes useful and to tailor them to the needs of the students, and I highly encourage you to attend as many of our sessions as possible. My teaching style is very conversational and relies on students being willing to speak up to express confusion, seek clarification, or make a well-informed point, so please participate!
I try to make the classes useful and to tailor them to the needs of the students, and I highly encourage you to attend as many of our sessions as possible. My teaching style is very conversational and relies on students being willing to speak up to express confusion, seek clarification, or make a well-informed point, so please participate!
Line 87: Line 83:
== Research project ==
== Research project ==


As a demonstration of your learning in this course, you will design and carry out a quantitative research project, start to finish. This means you will:
As a demonstration of your learning in this course, you will design and carry out a quantitative research project, start to finish. This means you will all:


* '''Design and describe a plan for a study''' — The study you design should involve quantitative analysis and should be something you can complete at least a first pass on during this module.
* '''Design and describe a plan for a study''' — The study you design should involve quantitative analysis and should be something you can complete at least a first pass on during this module.
Line 100: Line 96:
=== Project idea and dataset identification ===
=== Project idea and dataset identification ===


;Due date: September 6
;Due date: September 9
;Maximum length: 500 words (~1-2 pages)
;Maximum length: 500 words (~1-2 pages)


Line 111: Line 107:
=== Project planning document ===
=== Project planning document ===


;Due date: October 18
;Due date: October 21
;Maximum length: ~4-5 pages
;Maximum length: ~4-5 pages


Line 123: Line 119:
=== Project presentation and report ===
=== Project presentation and report ===


;Report due date: December 14
;Report due date: December 15
;Maximum length: 4000 words (~15 pages)
;Maximum length: 4000 words (~15 pages)


;Presentation due date: December 6 and December 8
;Presentation due date: December 9
;Maximum length: 8 minutes
;Maximum length: 8 minutes


Line 135: Line 131:
The great thing about a Jupyter Notebook is that it allows you to provide data, code, and any documentation sufficient to enable the replication of all analysis and visualizations. If you choose to write the report as a Word document, then you will need to include the code in a separate file.
The great thing about a Jupyter Notebook is that it allows you to provide data, code, and any documentation sufficient to enable the replication of all analysis and visualizations. If you choose to write the report as a Word document, then you will need to include the code in a separate file.


Because the emphasis in this class is on methods and because I'm not an expert in each of your fields, I'm happy to assume that your paper, proposal, or thesis chapter has already established the relevance and significance of your study and has a comprehensive literature review, well-grounded conceptual approach, and compelling reason why this research is important. As a result, you do not need to focus on these elements of the work in your written submission. Instead, feel free to start with a brief summary of the purpose and importance of this research followed by an introduction of your research questions or hypotheses. If you provide more detail, that's fine, but I won't give you detailed feedback on these parts and they will not figure prominently in my assessment of the work.
Because the emphasis in this class is on methods and because I'm not an expert in each of your fields, I'm happy to assume that your paper, proposal, or thesis chapter has already established the relevance and significance of your study and has a comprehensive literature review, well-grounded conceptual approach, and compelling reason why this research is important. As a result, you need not focus on these elements of the work in your written submission. Instead, feel free to start with a brief summary of the purpose and importance of this research followed by an introduction of your research questions or hypotheses. If you provide more detail, that's fine, but I won't give you detailed feedback on these parts and they will not figure prominently in my assessment of the work.


Jupyter Notebooks do not have all of the tools for citations that Word or LaTeX or even Google Docs have, so while I expect you to cite related work your references section does not need to be as polished as citation management software would make it.
Jupyter Notebooks do not have all of the tools for citations that Word or LaTeX or even Google Docs have, so while I expect you to cite related work it is OK if it isn't as polished as citation management software would make it.


==== The presentation ====
==== The presentation ====


The presentation will provide an opportunity to share a brief summary of your project and findings with the other members of the class. However, don't treat it as a comprehensive overview of your paper: I would rather you tell a subset of the story well than the whole story in a rushed fashion. For instance, you can give a completely successful presentation by describing the motivation and walking through one plot in your paper. Since many of you will give many research presentations throughout your career, I strongly encourage you to take the opportunity to refine your academic presentation skills.
The presentation will provide an opportunity to share a brief summary of your project and findings with the other members of the class. However, don't treat it as a comprehensive overview of your paper: I would rather you tell a subset of the story well than the whole story in a rushed fashion. For instance, you can give a completely successful presentation by describing the motivation and walking through one plot in your paper. Since you will all give other research presentations throughout your career, I strongly encourage you to take the opportunity to refine your academic presentation skills.


I anticipate that most people will either create a PowerPoint presentation or will walk us through a simple Jupyter Notebook. All presentations will need to be ''a maximum of 8 minutes long''. Concisely communicating an idea in the time allotted is an important skill in its own right. Presentations should be uploaded to the Discussion forum on Brightspace created for this purpose.
I anticipate that most people will either create a PowerPoint presentation or will walk us through a simple Jupyter Notebook. All presentations will need to be ''a maximum of 8 minutes long''. Concisely communicating an idea in the time allotted is an important skill in its own right. Presentations should be uploaded to the Discussion forum on Brightspace created for this purpose.


== Coding Challenges ==
== Daily Coding Challenges ==


Nearly every week you will have set of coding challenges to complete. These coding challenges will be turned in on Brightspace but will not be graded. I encourage you to work together on these challenges but to make sure that you understand the concepts yourself.
Nearly every day I will give you a set of coding challenges before the end of class that will involve writing code or adding to code that I've given you. These coding challenges will be turned in on Brightspace but will not be graded. I encourage you to work together on these challenges but to make sure that you understand the concepts yourself.


Each day I will randomly select a set of students to share their solutions to a selected exercise. This will involve putting your solution on Piazza at least one hour before the next day's lecture starts, and being prepared to walk us through the solution. If you can't figure out the problem that's been assigned to you, then explain where you got stuck and what you tried. I encourage you to also use Piazza to ask and answer each other's questions as you work on the assignments. We will use some of our lecture time to review the problems and I will make sure that a correct solution is posted by the end of that day. As you will see over the course of the module, there are many possible solutions to many programming problems and my own approaches will often be different than yours. That's completely fine! Coding is a creative act!
Each day I will randomly select a set of students to share their solutions to a selected exercise. This will involve putting your solution on Discord at least one hour before the next day's lecture starts, and being prepared to walk us through the solution. If you can't figure out the problem that's been assigned to you, then explain where you got stuck and what you tried. I encourage you to also use Discord to ask and answer each other's questions as you work on the assignments. We will use some of our lecture time to review the problems and I will make sure that a correct solution is posted by the end of that day. As you will see over the course of the module, there are many possible solutions to many programming problems and my own approaches will often be different than yours. That's completely fine! Coding is a creative act!


While I won't be grading these assignments, I will review a sample of them to look for common difficulties or problems. If you want me to provide specific feedback for an assignment, please let me know and I will be happy to do so.


== Paper Discussions ==
== Paper Discussions ==


Every Thursday we will review a paper that uses computational methods. On the first day, I will ask you to sign up to lead the discussion for one or more of these papers. When leading the discussion, you will prepare a presentation as though you were presenting the paper at a conference and then lead a discussion about it.
Every day we will review a paper that uses computational methods. On the first day, I will ask you to sign up to lead the discussion for one or more of these papers. When leading the discussion, you will prepare a presentation as though you were presenting the paper at a conference and then lead a discussion about it.
 
I am eager to update the papers that we read in the class. If you would like to propose a different paper that you would prefer to present on, I will almost certainly say yes.
 
When you are not presenting, I do not expect you to read the week's paper thoroughly. However, you should read it closely enough that you are prepared to discuss it.


== Reflection papers ==
== Reflection papers ==
Line 192: Line 183:
* Exceed requirements, but in fairly straightforward ways
* Exceed requirements, but in fairly straightforward ways
* Compose complete and sufficiently detailed reflections.
* Compose complete and sufficiently detailed reflections.
* Complete nearly all of the programming assignments at a high level
* Complete many of the programming assignments at a high level


C: This reflects meeting the minimum expectations of the course. Students reaching this level of achievement
C: This reflects meeting the minimum expectations of the course. Students reaching this level of achievement
Line 217: Line 208:


'''Required Readings:'''  
'''Required Readings:'''  
* [https://www.youtube.com/watch?v=5pf0_bpNbkw Intro to Jupyter Notebooks video]
* [https://www.youtube.com/watch?v=HW29067qVWk Intro to Jupyter Notebooks video]


'''Agenda:'''
'''Agenda:'''
Line 234: Line 225:
* Fill out this [https://docs.google.com/forms/d/e/1FAIpQLSfUiGogs2jDXIHaXz1ooVBZFkRF2NdMaf00IgZvk7f69rby9w/viewform?usp=sf_link short survey]
* Fill out this [https://docs.google.com/forms/d/e/1FAIpQLSfUiGogs2jDXIHaXz1ooVBZFkRF2NdMaf00IgZvk7f69rby9w/viewform?usp=sf_link short survey]
* Sign up to be a discussant [https://docs.google.com/spreadsheets/d/1uSo-Ya5DghaLu1BYk94EVU2kBVmExRWwOa1586GbFUU/edit?usp=sharing here]
* Sign up to be a discussant [https://docs.google.com/spreadsheets/d/1uSo-Ya5DghaLu1BYk94EVU2kBVmExRWwOa1586GbFUU/edit?usp=sharing here]
* [[/Discord Signup|Sign up for Discord]] and introduce yourself
* [[/Week_2_Coding_Challenges|Week 2 Coding Challenge]] (turn in on Brightspace)
* [[/Week_2_Coding_Challenges|Week 2 Coding Challenge]] (turn in on Brightspace)


Line 240: Line 232:
* Python for Everybody, chapters 1-4
* Python for Everybody, chapters 1-4
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_2/day_2.ipynb Today's Jupyter Notebook] (Right-click, save, and open in Jupyter)
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_2/day_2.ipynb Today's Jupyter Notebook] (Right-click, save, and open in Jupyter)
** [https://purdue.brightspace.com/d2l/le/content/598734/viewContent/10014636/View Notebook walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819498/View Notebook walkthrough]


'''Agenda:'''
'''Agenda:'''
Line 257: Line 249:
  chapters_to_read = [5, 6, 8]
  chapters_to_read = [5, 6, 8]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_3/day_3.ipynb Today's Jupyter Notebook]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_3/day_3.ipynb Today's Jupyter Notebook]
** [https://purdue.brightspace.com/d2l/le/content/598734/viewContent/10014637/View Notebook walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819499/View Notebook walkthrough]
* Foote, J., Shaw, A., & Hill, B.M. (2017). [https://jeremydfoote.com/files/foote_computational_2017.pdf Computational analysis of social media scholarship]. In Burgess, J., Poell, T., Marwick, A. (Eds.), The Sage Handbook of Social Media. Sage.
* Foote, J., Shaw, A., & Hill, B.M. (2017). [https://jeremydfoote.com/files/foote_computational_2017.pdf Computational analysis of social media scholarship]. In Burgess, J., Poell, T., Marwick, A. (Eds.), The Sage Handbook of Social Media. Sage.
** Discussant: Christina
** Discussant:  


'''Agenda:'''
'''Agenda:'''
Line 278: Line 270:
  book.close()
  book.close()
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_4/day_4.ipynb Today's Jupyter Notebook]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_4/day_4.ipynb Today's Jupyter Notebook]
** [https://purdue.brightspace.com/d2l/le/content/598734/viewContent/10014638/View Notebook walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819500/View Notebook walkthrough]
* Nelson, Laura K. 2017. "[https://doi.org/10.1177%2F0049124117729703 Computational Grounded Theory: A Methodological Framework]." Sociological Methods and Research.
* Nelson, Laura K. 2017. "[https://doi.org/10.1177%2F0049124117729703 Computational Grounded Theory: A Methodological Framework]." Sociological Methods and Research.
** Discussant: Chiara
** Discussant: Elizabeth


'''Agenda:'''
'''Agenda:'''
Line 295: Line 287:
* Python for Everybody, chapters 9 and 10
* Python for Everybody, chapters 9 and 10
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_5/day_5.ipynb Today's Jupyter Notebook]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_5/day_5.ipynb Today's Jupyter Notebook]
** [https://purdue.brightspace.com/d2l/le/content/598734/viewContent/10014639/View Video walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819501/View Video walkthrough]
* Margolin, D. B., Hannak, A., & Weber, I. (2018). [https://doi.org/10.1080/10584609.2017.1334018 Political Fact-Checking on Twitter: When Do Corrections Have an Effect?] Political Communication, 35(2), 196–219.
* Margolin, D. B., Hannak, A., & Weber, I. (2018). [https://doi.org/10.1080/10584609.2017.1334018 Political Fact-Checking on Twitter: When Do Corrections Have an Effect?] Political Communication, 35(2), 196–219.
** Discussant: Alisa
** Discussant:  


'''Agenda:'''
'''Agenda:'''
Line 307: Line 299:


'''Readings:'''
'''Readings:'''
* Shen, C., Monge, P., & Williams, D. (2014). [https://libkey.io/libraries/228/articles/5013123/full-text-file Virtual brokerage and closure: Network structure and social capital in a massively multiplayer online game]. Communication Research. 41(4): 459–480.
* Shen, C., Monge, P., & Williams, D. (2014). [https://libkey.io/libraries/228/articles/5013123/full-text-file Virtual brokerage and closure: Network structure and soci8al capital in a massively multiplayer online game]. Communication Research. 41(4): 459–480.
Discussant: Ajay
 


== Week 6: Dataframes and Visualization (October 4) ==
== Week 6: Dataframes and Visualization (October 4) ==
Line 317: Line 309:
'''Readings:'''
'''Readings:'''
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_6/day_6.ipynb Week 6 notebook]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_6/day_6.ipynb Week 6 notebook]
** [https://purdue.brightspace.com/d2l/le/content/598734/viewContent/10014640/View Notebook walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819502/View Notebook walkthrough]
* Shaw, A., & Hill, B. M. (2014). Laboratories of oligarchy? How the iron law extends to peer production. Journal of Communication, 64(2), 215–238. https://doi.org/10.1111/jcom.12082
* Shaw, A., & Hill, B. M. (2014). Laboratories of oligarchy? How the iron law extends to peer production. Journal of Communication, 64(2), 215–238. https://doi.org/10.1111/jcom.12082
Discussants: Miriam




Line 335: Line 326:
'''Readings:'''
'''Readings:'''
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_7/day_7.ipynb Week 7 notebook]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_7/day_7.ipynb Week 7 notebook]
** [https://purdue.brightspace.com/d2l/le/content/598734/viewContent/10014641/View Notebook walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819503/View Notebook walkthrough]
* Lazer, D., & Radford, J. (2017). [https://doi.org/10.1146/annurev-soc-060116-053457 Data ex Machina: Introduction to Big Data]. Annual Review of Sociology, 43(1), 19–39.
* Lazer, D., & Radford, J. (2017). [https://doi.org/10.1146/annurev-soc-060116-053457 Data ex Machina: Introduction to Big Data]. Annual Review of Sociology, 43(1), 19–39.
** Discussant: Paxton and Luhao
** Discussant: Elizabeth




Line 354: Line 345:
'''Readings:'''
'''Readings:'''
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_8/day_8.ipynb Intro to APIs Notebook]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_8/day_8.ipynb Intro to APIs Notebook]
** (Long) [https://purdue.brightspace.com/d2l/le/content/598734/viewContent/10014642/View walkthrough of notebook]
** (Long) [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819504/View walkthrough of notebook]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/extra_topics/twitter_v2_example.ipynb Twitter V2 Academic API Notebook]
** [https://www.youtube.com/watch?v=rQEsIs9LERM&lc=UgzEMuqQB8UpkOQr0Hd4AaABAg Video walkthrough]
* Kieran Healy and James Moody (2014). “[https://doi.org/10.1146/annurev-soc-071312-145551 Data Visualization in Sociology].” American Review of Sociology. 40: 105-28.
* Kieran Healy and James Moody (2014). “[https://doi.org/10.1146/annurev-soc-071312-145551 Data Visualization in Sociology].” American Review of Sociology. 40: 105-28.
** Discussant: Keqin
** Discussant:


'''Agenda:'''
'''Agenda:'''
Line 374: Line 363:
'''Readings:'''
'''Readings:'''
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_9/day_9.ipynb Week 9 Notebook]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_9/day_9.ipynb Week 9 Notebook]
** [https://purdue.brightspace.com/d2l/le/content/598734/viewContent/10014643/View Notebook walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819505/View Notebook walkthrough]
* Python for Everybody, Chapter 13
* Python for Everybody, Chapter 13
* Vitak, J., Shilton, K., & Ashktorab, Z. (2016). [https://doi.org/10.1145/2818048.2820078 Beyond the Belmont Principles: Ethical Challenges, Practices, and Beliefs in the Online Data Research Community]. Proceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing, 941–953.
** Discussant:  Diane


* Christopher A. Bail et al. 2018. [https://doi.org/10.1073/pnas.1804840115 Exposure to opposing views on social media can increase political polarization]. PNAS 115(37): 9216-9221
* (Optional) Williams, M. L., Burnap, P., & Sloan, L. (2017). [https://doi.org/10.1177/0038038517708140 Towards an Ethical Framework for Publishing Twitter Data in Social Research: Taking into Account Users’ Views, Online Context and Algorithmic Estimation]: Sociology.
** Discussant: Ryan
* (Optional) Salganik, M. [https://www.bitbybitbook.com/en/1st-ed/ethics/ Ethics] chapter from Bit By Bit.
* (Optional) Crawford, K., & Finn, M. (2015). [https://doi.org/10.1007/s10708-014-9597-z The limits of crisis data: Analytical and ethical challenges of using social and mobile data to understand disasters]. GeoJournal, 80(4), 491–502.


* If you are interested in doing web scraping, then look at this [https://github.com/CU-ITSS/Web-Data-Scraping-S2019 incredible mini-course on the topic]. It is all done with Jupyter Notebooks and you have all of the prerequisite knowledge to understand it.
* If you are interested in doing web scraping, then look at this [https://github.com/CU-ITSS/Web-Data-Scraping-S2019 incredible mini-course on the topic]. It is all done with Jupyter Notebooks and you have all of the prerequisite knowledge to understand it.
Line 397: Line 389:
'''Readings:'''
'''Readings:'''
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_10/day_10.ipynb Today's Notebook]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_10/day_10.ipynb Today's Notebook]
** [https://purdue.brightspace.com/d2l/le/content/598734/viewContent/10014644/View Notebook walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819506/View Notebook walkthrough]
* Sara Klingenstein, Tim Hitchcock, and Simon DeDeo. 2014. [https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4084475/ The civilizing process in London’s Old Baily]. Proceedings of the National Academy of Sciences 111(26): 9419-9424.
* Christopher A. Bail et al. 2018. [https://doi.org/10.1073/pnas.1804840115 Exposure to opposing views on social media can increase political polarization]. PNAS 115(37): 9216-9221
** Discussant: Diane


'''Agenda:'''
'''Agenda:'''
Line 415: Line 408:
'''Readings:'''
'''Readings:'''
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_11/day_11.ipynb Today's Notebook]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_11/day_11.ipynb Today's Notebook]
** [https://purdue.brightspace.com/d2l/le/content/598734/viewContent/10014645/View Notebook walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819507/View Notebook walkthrough]
* Robert K. Merton. 1948. [https://www-jstor-org.ezproxy.lib.purdue.edu/stable/2087142?sid=primo&origin=crossref&seq=1#metadata_info_tab_contents The Bearing of Empirical Research Upon the Development of Social Theory]. American Sociological Review 13(5): 505-515.
* Robert K. Merton. 1948. [https://www-jstor-org.ezproxy.lib.purdue.edu/stable/2087142?sid=primo&origin=crossref&seq=1#metadata_info_tab_contents The Bearing of Empirical Research Upon the Development of Social Theory]. American Sociological Review 13(5): 505-515.
* DellaPosta, D., Shi, Y., & Macy, M. (2015). [https://doi.org/10.1086/681254 Why Do Liberals Drink Lattes]? American Journal of Sociology, 120(5), 1473–1511.
* Sara Klingenstein, Tim Hitchcock, and Simon DeDeo. 2014. [https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4084475/ The civilizing process in London’s Old Baily]. Proceedings of the National Academy of Sciences 111(26): 9419-9424.
** Discussant: Dyuti and Bakhti
** Discussant:  
 


'''Resources:'''
'''Resources:'''
Line 433: Line 425:
* [https://www.youtube.com/watch?v=SWYqp7iY_Tc Git & GitHub Crash Course For Beginners] - YouTube video (not by me) introducing Git and Github
* [https://www.youtube.com/watch?v=SWYqp7iY_Tc Git & GitHub Crash Course For Beginners] - YouTube video (not by me) introducing Git and Github
* [https://learngitbranching.js.org/ Interactive git branching tutorial]
* [https://learngitbranching.js.org/ Interactive git branching tutorial]
* Tan, C. (2018). [https://aaai.org/ocs/index.php/ICWSM/ICWSM18/paper/view/17811 Tracing community genealogy: How new communities emerge from the old]. Proceedings of the Twelfth International Conference on Web and Social Media (ICWSM ’18), 395–404.
* DellaPosta, D., Shi, Y., & Macy, M. (2015). [https://doi.org/10.1086/681254 Why Do Liberals Drink Lattes]? American Journal of Sociology, 120(5), 1473–1511.
** Discussant: Woojin
** Discussant:
 


'''Agenda:'''
'''Agenda:'''
Line 444: Line 435:
'''Resources:'''
'''Resources:'''


 
== Week 13: Statistical summaries and tests (November 29) ==
'''NO CLASS ON NOVEMBER 18'''
* I will show up in the classroom if people want to have a co-working session / ask questions.
 
 
== Week 13: Statistics and Statistical Programming (November 22)


'''Assignment Due:'''
'''Assignment Due:'''
Line 455: Line 441:




 
'''Readings:'''
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_13/day_13.ipynb Week 13 Notebook]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_13/day_13.ipynb Week 13 Notebook]
** [https://purdue.brightspace.com/d2l/le/content/598734/viewContent/10014646/View Notebook walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819508/View Notebook walkthrough]
* Tan, C. (2018). [https://aaai.org/ocs/index.php/ICWSM/ICWSM18/paper/view/17811 Tracing community genealogy: How new communities emerge from the old]. Proceedings of the Twelfth International Conference on Web and Social Media (ICWSM ’18), 395–404.
** Discussant: Sandra


'''Agenda:'''
'''Agenda:'''
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_13/lecture/day_13.html Today's slides]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_13/lecture/day_13.html Today's slides]


== Week 14: Ethics of Online Research (November 29) ==
'''Readings:'''
* Vitak, J., Shilton, K., & Ashktorab, Z. (2016). [https://doi.org/10.1145/2818048.2820078 Beyond the Belmont Principles: Ethical Challenges, Practices, and Beliefs in the Online Data Research Community]. Proceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing, 941–953.
* (Optional) Williams, M. L., Burnap, P., & Sloan, L. (2017). [https://doi.org/10.1177/0038038517708140 Towards an Ethical Framework for Publishing Twitter Data in Social Research: Taking into Account Users’ Views, Online Context and Algorithmic Estimation]: Sociology.
* (Optional) Salganik, M. [https://www.bitbybitbook.com/en/1st-ed/ethics/ Ethics] chapter from Bit By Bit.
* (Optional) Crawford, K., & Finn, M. (2015). [https://doi.org/10.1007/s10708-014-9597-z The limits of crisis data: Analytical and ethical challenges of using social and mobile data to understand disasters]. GeoJournal, 80(4), 491–502.
Dec 1
'''Reading:'''
* Field, A., Kliger, D., Wintner, S., Pan, J., Jurafsky, D., & Tsvetkov, Y. (2018). [https://doi.org/10.18653/v1/D18-1393 Framing and Agenda-setting in Russian News: A Computational Analysis of Intricate Political Strategies]. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 3570–3580.
Discussant: Angela


== Week 15: Final Project Presentation (December 6) ==
== Week 15: Final Project Presentation (December 6) ==
Line 506: Line 477:


* [https://youtu.be/rQEsIs9LERM Using Tweepy to do full historical search on Twitter]
* [https://youtu.be/rQEsIs9LERM Using Tweepy to do full historical search on Twitter]
* [https://github.com/CU-ITSS/Web-Data-Scraping-S2019 Mini course on screen scraping]
* [https://www.youtube.com/watch?v=K8L6KVGG-7o Regular Expressions]
* [https://www.youtube.com/watch?v=K8L6KVGG-7o Regular Expressions]
* [https://www.youtube.com/watch?v=3dt4OGnU5sM List Comprehensions]
* [https://www.youtube.com/watch?v=3dt4OGnU5sM List Comprehensions]
Please note that all contributions to CommunityData are considered to be released under the Attribution-Share Alike 3.0 Unported (see CommunityData:Copyrights for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource. Do not submit copyrighted work without permission!

To protect the wiki against automated edit spam, we kindly ask you to solve the following CAPTCHA:

Cancel Editing help (opens in new window)

Template used on this page: