Editing Intro to Programming and Data Science (Fall 2021)

From CommunityData

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.

Latest revision Your text
Line 102: Line 102:


* An abstract of the proposed study including the topic, research question, theoretical motivation, object(s) of study, and anticipated research contribution.
* An abstract of the proposed study including the topic, research question, theoretical motivation, object(s) of study, and anticipated research contribution.
* An identification of the dataset you will use and a description of the columns or type of data it will include. If you do not currently have access to these data, explain why not and when you will have access (If you need ideas, [[/Datasets|this page]] lists some open datasets).
* An identification of the dataset you will use and a description of the columns or type of data it will include. If you do not currently have access to these data, explain why not and when you will have access (If you need ideas, [[Data_Into_Insights_(Spring_2021)/Final_project#Datasets|this page]] from one of my undergrad classes lists some open datasets).
* A short (several sentences) description of how the project will fit into your career trajectory.
* A short (several sentences) description of how the project will fit into your career trajectory.


Line 127: Line 127:
==== The project report ====
==== The project report ====


You will write a document or a Jupyter Notebook that will ideally provide the foundation for a high quality short research paper that you might revise and submit for publication. I do not expect the report to be ready for publication, but it should contain polished drafts of all the necessary components of a scholarly quantitative empirical research study. In terms of the structure, please see the page on the [[structure of a quantitative empirical research paper]].
You will craft a Jupyter Notebook that will ideally provide the foundation for a high quality short research paper that you might revise and submit for publication. I do not expect the report to be ready for publication, but it should contain polished drafts of all the necessary components of a scholarly quantitative empirical research study. In terms of the structure, please see the page on the [[structure of a quantitative empirical research paper]].


The great thing about a Jupyter Notebook is that it allows you to provide data, code, and any documentation sufficient to enable the replication of all analysis and visualizations. If you choose to write the report as a Word document, then you will need to include the code in a separate file.
The great thing about a Jupyter Notebook is that it allows you to provide data, code, and any documentation sufficient to enable the replication of all analysis and visualizations. If that is not possible/appropriate for some reason, please talk to me so that we can find another solution.


Because the emphasis in this class is on methods and because I'm not an expert in each of your fields, I'm happy to assume that your paper, proposal, or thesis chapter has already established the relevance and significance of your study and has a comprehensive literature review, well-grounded conceptual approach, and compelling reason why this research is important. As a result, you need not focus on these elements of the work in your written submission. Instead, feel free to start with a brief summary of the purpose and importance of this research followed by an introduction of your research questions or hypotheses. If you provide more detail, that's fine, but I won't give you detailed feedback on these parts and they will not figure prominently in my assessment of the work.
Because the emphasis in this class is on methods and because I'm not an expert in each of your fields, I'm happy to assume that your paper, proposal, or thesis chapter has already established the relevance and significance of your study and has a comprehensive literature review, well-grounded conceptual approach, and compelling reason why this research is important. As a result, you need not focus on these elements of the work in your written submission. Instead, feel free to start with a brief summary of the purpose and importance of this research followed by an introduction of your research questions or hypotheses. If you provide more detail, that's fine, but I won't give you detailed feedback on these parts and they will not figure prominently in my assessment of the work.
Line 225: Line 225:
* Fill out this [https://docs.google.com/forms/d/e/1FAIpQLSfUiGogs2jDXIHaXz1ooVBZFkRF2NdMaf00IgZvk7f69rby9w/viewform?usp=sf_link short survey]
* Fill out this [https://docs.google.com/forms/d/e/1FAIpQLSfUiGogs2jDXIHaXz1ooVBZFkRF2NdMaf00IgZvk7f69rby9w/viewform?usp=sf_link short survey]
* Sign up to be a discussant [https://docs.google.com/spreadsheets/d/1uSo-Ya5DghaLu1BYk94EVU2kBVmExRWwOa1586GbFUU/edit?usp=sharing here]
* Sign up to be a discussant [https://docs.google.com/spreadsheets/d/1uSo-Ya5DghaLu1BYk94EVU2kBVmExRWwOa1586GbFUU/edit?usp=sharing here]
* [[/Discord Signup|Sign up for Discord]] and introduce yourself
* [[/Day_2_Coding_Challenges|Day 2 Coding Challenge]] (turn in on Brightspace)
* [[/Day_2_Coding_Challenges|Day 2 Coding Challenge]] (turn in on Brightspace)


Line 231: Line 230:
* Bit By bit, [https://www.bitbybitbook.com/en/1st-ed/introduction/ Introduction]
* Bit By bit, [https://www.bitbybitbook.com/en/1st-ed/introduction/ Introduction]
* Python for Everybody, chapters 1-4
* Python for Everybody, chapters 1-4
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_2/day_2.ipynb Today's Jupyter Notebook] (Right-click, save, and open in Jupyter)
* [https://github.com/jdfoote/Intro-to-Programming-and-Data-Science/raw/summer2021/day_2/day_2.ipynb Today's Jupyter Notebook]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819498/View Notebook walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819498/View Notebook walkthrough]


Line 248: Line 247:
* Python for Everybody
* Python for Everybody
  chapters_to_read = [5, 6, 8]
  chapters_to_read = [5, 6, 8]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_3/day_3.ipynb Today's Jupyter Notebook]
* [https://github.com/jdfoote/Intro-to-Programming-and-Data-Science/raw/summer2021/day_3/day_3.ipynb Today's Jupyter Notebook]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819499/View Notebook walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819499/View Notebook walkthrough]
* Foote, J., Shaw, A., & Hill, B.M. (2017). [https://jeremydfoote.com/files/foote_computational_2017.pdf Computational analysis of social media scholarship]. In Burgess, J., Poell, T., Marwick, A. (Eds.), The Sage Handbook of Social Media. Sage.
* Foote, J., Shaw, A., & Hill, B.M. (2017). [https://jeremydfoote.com/files/foote_computational_2017.pdf Computational analysis of social media scholarship]. In Burgess, J., Poell, T., Marwick, A. (Eds.), The Sage Handbook of Social Media. Sage.
** Discussant:  
** Discussant: Juan Pablo


'''Agenda:'''
'''Agenda:'''
Line 269: Line 268:
         read(chapter)
         read(chapter)
  book.close()
  book.close()
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_4/day_4.ipynb Today's Jupyter Notebook]
* [https://github.com/jdfoote/Intro-to-Programming-and-Data-Science/raw/summer2021/day_4/day_4.ipynb Today's Jupyter Notebook]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819500/View Notebook walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819500/View Notebook walkthrough]
* Nelson, Laura K. 2017. "[https://doi.org/10.1177%2F0049124117729703 Computational Grounded Theory: A Methodological Framework]." Sociological Methods and Research.
* Nelson, Laura K. 2017. "[https://doi.org/10.1177%2F0049124117729703 Computational Grounded Theory: A Methodological Framework]." Sociological Methods and Research.
** Discussant: Elizabeth
** Discussant: Beth Ann


'''Agenda:'''
'''Agenda:'''
Line 286: Line 285:
'''Readings:'''
'''Readings:'''
* Python for Everybody, chapters 9 and 10
* Python for Everybody, chapters 9 and 10
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_5/day_5.ipynb Today's Jupyter Notebook]
* [https://github.com/jdfoote/Intro-to-Programming-and-Data-Science/raw/summer2021/day_5/day_5.ipynb Today's Jupyter Notebook]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819501/View Video walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819502/View Video walkthrough]
* Margolin, D. B., Hannak, A., & Weber, I. (2018). [https://doi.org/10.1080/10584609.2017.1334018 Political Fact-Checking on Twitter: When Do Corrections Have an Effect?] Political Communication, 35(2), 196–219.
* Margolin, D. B., Hannak, A., & Weber, I. (2018). [https://doi.org/10.1080/10584609.2017.1334018 Political Fact-Checking on Twitter: When Do Corrections Have an Effect?] Political Communication, 35(2), 196–219.
** Discussant:  
** Discussant: Katelyn


'''Agenda:'''
'''Agenda:'''
Line 296: Line 295:
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_5/lecture/day_5.html Today's slides]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_5/lecture/day_5.html Today's slides]


== CATCH UP Week (September 30) ==
== Day 6: Dataframes and Visualization (September 30) ==
 
'''Readings:'''
* Shen, C., Monge, P., & Williams, D. (2014). [https://libkey.io/libraries/228/articles/5013123/full-text-file Virtual brokerage and closure: Network structure and soci8al capital in a massively multiplayer online game]. Communication Research. 41(4): 459–480.
 
 
== Day 6: Dataframes and Visualization (October 7) ==


'''Assignment Due:'''
'''Assignment Due:'''
Line 308: Line 301:


'''Readings:'''
'''Readings:'''
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_6/day_6.ipynb Day 6 notebook]
* [https://github.com/jdfoote/Intro-to-Programming-and-Data-Science/raw/summer2021/day_6/day_6.ipynb Day 6 notebook]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819502/View Notebook walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819502/View Notebook walkthrough]
* Shaw, A., & Hill, B. M. (2014). Laboratories of oligarchy? How the iron law extends to peer production. Journal of Communication, 64(2), 215–238. https://doi.org/10.1111/jcom.12082
* Benefield, G. A., Shen, C., & Leavitt, A. (2016). [https://doi.org/10.1145/2818048.2819935 Virtual Team Networks: How Group Social Capital Affects Team Success in a Massively Multiplayer Online Game]. Proceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing, 679–690.
** Discussant: Anna




Line 317: Line 311:
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_6/lecture/day_6.html Today's slides]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_6/lecture/day_6.html Today's slides]


== Day 7: Dataframes and visualization (continued) (October 14) ==
== Day 7: Dataframes and visualization (continued) (October 7) ==


'''Assignment Due:'''
'''Assignment Due:'''
Line 323: Line 317:


'''Readings:'''
'''Readings:'''
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_7/day_7.ipynb Day 7 notebook]
* [https://github.com/jdfoote/Intro-to-Programming-and-Data-Science/raw/summer2021/day_7/day_7.ipynb Day 7 notebook]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819503/View Notebook walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819503/View Notebook walkthrough]
* Lazer, D., & Radford, J. (2017). [https://doi.org/10.1146/annurev-soc-060116-053457 Data ex Machina: Introduction to Big Data]. Annual Review of Sociology, 43(1), 19–39.
* Lazer, D., & Radford, J. (2017). [https://doi.org/10.1146/annurev-soc-060116-053457 Data ex Machina: Introduction to Big Data]. Annual Review of Sociology, 43(1), 19–39.
** Discussant: Elizabeth
** Discussant: Yong




Line 333: Line 327:
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_7/lecture/day_7.html Today's slides]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_7/lecture/day_7.html Today's slides]


== Day 8: Collecting Data with APIs (October 21) ==
== Day 8: Collecting Data with APIs (October 14) ==


'''Assignment Due:'''
'''Assignment Due:'''
* [[/Day 8 Coding Challenges|Day 8 Coding Challenges]].
* [[/Day 8 Coding Challenges|Day 8 Coding Challenges]].
** [https://youtu.be/TASX3evcgG4 Video instructions to install tweepy]
** [https://youtu.be/TASX3evcgG4 Video instructions to install tweepy]
* First [[Self_Assessment_Reflection | self-assessment reflection]] is due (on Brightspace).
* Project Planning Document Due


'''Readings:'''
'''Readings:'''
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_8/day_8.ipynb Intro to APIs Notebook]
* [https://github.com/jdfoote/Intro-to-Programming-and-Data-Science/raw/summer2021/day_8/day_8.ipynb Intro to APIs Notebook]
** (Long) [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819504/View walkthrough of notebook]
** (Long) [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819504/View walkthrough of notebook]
* Kieran Healy and James Moody (2014). “[https://doi.org/10.1146/annurev-soc-071312-145551 Data Visualization in Sociology].” American Review of Sociology. 40: 105-28.
* Kieran Healy and James Moody (2014). “[https://doi.org/10.1146/annurev-soc-071312-145551 Data Visualization in Sociology].” American Review of Sociology. 40: 105-28.
** Discussant:
** Discussant: Pearlynne


'''Agenda:'''
'''Agenda:'''
Line 352: Line 344:
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_8/lecture/day_8.html Today's slides]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_8/lecture/day_8.html Today's slides]


== Day 9: Collecting Data with APIs (continued) (October 28) ==
== Day 9: Collecting Data with APIs (continued) (October 21) ==


'''Assignment Due:'''
'''Assignment Due:'''
* [[/Day 9 Coding Challenges|Day 9 Coding Challenges]]
* Start on [[Intro to Programming and Data Science (Fall 2021)/Day 9 Coding Challenges|Day 9 Coding Challenges]]
 
* First [[Self_Assessment_Reflection | self-assessment reflection]] is due (on Brightspace).
* Project Planning Document Due




'''Readings:'''
'''Readings:'''
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_9/day_9.ipynb Day 9 Notebook]
* [https://github.com/jdfoote/Intro-to-Programming-and-Data-Science/raw/summer2021/day_9/day_9.ipynb Day 9 Notebook]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819505/View Notebook walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819505/View Notebook walkthrough]
* Python for Everybody, Chapter 13
* Python for Everybody, Chapter 13
* Vitak, J., Shilton, K., & Ashktorab, Z. (2016). [https://doi.org/10.1145/2818048.2820078 Beyond the Belmont Principles: Ethical Challenges, Practices, and Beliefs in the Online Data Research Community]. Proceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing, 941–953.
* Vitak, J., Shilton, K., & Ashktorab, Z. (2016). [https://doi.org/10.1145/2818048.2820078 Beyond the Belmont Principles: Ethical Challenges, Practices, and Beliefs in the Online Data Research Community]. Proceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing, 941–953.
** Discussant: Diane
** Discussant: Casey Lynn


* (Optional) Williams, M. L., Burnap, P., & Sloan, L. (2017). [https://doi.org/10.1177/0038038517708140 Towards an Ethical Framework for Publishing Twitter Data in Social Research: Taking into Account Users’ Views, Online Context and Algorithmic Estimation]: Sociology.
* (Optional) Williams, M. L., Burnap, P., & Sloan, L. (2017). [https://doi.org/10.1177/0038038517708140 Towards an Ethical Framework for Publishing Twitter Data in Social Research: Taking into Account Users’ Views, Online Context and Algorithmic Estimation]: Sociology.
* (Optional) Salganik, M. [https://www.bitbybitbook.com/en/1st-ed/ethics/ Ethics] chapter from Bit By Bit.  
* (Optional) Salganik, M. [https://www.bitbybitbook.com/en/1st-ed/ethics/ Ethics] chapter from Bit By Bit.  
* (Optional) Crawford, K., & Finn, M. (2015). [https://doi.org/10.1007/s10708-014-9597-z The limits of crisis data: Analytical and ethical challenges of using social and mobile data to understand disasters]. GeoJournal, 80(4), 491–502.
* (Optional) Crawford, K., & Finn, M. (2015). [https://doi.org/10.1007/s10708-014-9597-z The limits of crisis data: Analytical and ethical challenges of using social and mobile data to understand disasters]. GeoJournal, 80(4), 491–502.
* If you are interested in doing web scraping, then look at this [https://github.com/CU-ITSS/Web-Data-Scraping-S2019 incredible mini-course on the topic]. It is all done with Jupyter Notebooks and you have all of the prerequisite knowledge to understand it.
* [https://youtu.be/daUuC-PMZc4 Very brief lecture on web scraping from Spring 2020].




Line 380: Line 369:
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_9/lecture/day_9.html Today's slides]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_9/lecture/day_9.html Today's slides]


== Day 10: Introduction to Computational Text Analysis (November 4) ==
== Day 10: Introduction to Computational Text Analysis (October 28) ==


'''Assignment Due:'''
'''Assignment Due:'''
* [[/Day 9 Coding Challenges|Day 9 Coding Challenges]]
* [[/Day 10 Coding Challenges|Day 10 Coding Challenges]]
* [[/Day 10 Coding Challenges|Day 10 Coding Challenges]]


'''Readings:'''
'''Readings:'''
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_10/day_10.ipynb Today's Notebook]
* [https://github.com/jdfoote/Intro-to-Programming-and-Data-Science/raw/summer2021/day_10/day_10.ipynb Today's Notebook]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819506/View Notebook walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819506/View Notebook walkthrough]
* Christopher A. Bail et al. 2018. [https://doi.org/10.1073/pnas.1804840115 Exposure to opposing views on social media can increase political polarization]. PNAS 115(37): 9216-9221
* Christopher A. Bail et al. 2018. [https://doi.org/10.1073/pnas.1804840115 Exposure to opposing views on social media can increase political polarization]. PNAS 115(37): 9216-9221
** Discussant: Diane
** Discussant: Caitlyn


'''Agenda:'''
'''Agenda:'''
Line 398: Line 388:
* [https://github.com/jdfoote/Intro-to-Programming-and-Data-Science/blob/master/resources/solutions/Twitter_answers.ipynb My answers to the Day 8 problems]
* [https://github.com/jdfoote/Intro-to-Programming-and-Data-Science/blob/master/resources/solutions/Twitter_answers.ipynb My answers to the Day 8 problems]


== Day 11: Data cleaning and operationalization (November 11) ==
== Day 11: Data cleaning and operationalization (November 4) ==


'''Assignment Due:'''
'''Assignment Due:'''
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_11/day_11.ipynb Day 11 Coding Challenges]
* [https://github.com/jdfoote/Intro-to-Programming-and-Data-Science/raw/summer2021/day_11/day_11.ipynb Day 11 Coding Challenges]




'''Readings:'''
'''Readings:'''
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_11/day_11.ipynb Today's Notebook]
* [https://github.com/jdfoote/Intro-to-Programming-and-Data-Science/raw/summer2021/day_11/day_11.ipynb Today's Notebook]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819507/View Notebook walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819507/View Notebook walkthrough]
* Robert K. Merton. 1948. [https://www-jstor-org.ezproxy.lib.purdue.edu/stable/2087142?sid=primo&origin=crossref&seq=1#metadata_info_tab_contents The Bearing of Empirical Research Upon the Development of Social Theory]. American Sociological Review 13(5): 505-515.
* Robert K. Merton. 1948. [https://www-jstor-org.ezproxy.lib.purdue.edu/stable/2087142?sid=primo&origin=crossref&seq=1#metadata_info_tab_contents The Bearing of Empirical Research Upon the Development of Social Theory]. American Sociological Review 13(5): 505-515.
* Sara Klingenstein, Tim Hitchcock, and Simon DeDeo. 2014. [https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4084475/ The civilizing process in London’s Old Baily]. Proceedings of the National Academy of Sciences 111(26): 9419-9424.
* Sara Klingenstein, Tim Hitchcock, and Simon DeDeo. 2014. [https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4084475/ The civilizing process in London’s Old Baily]. Proceedings of the National Academy of Sciences 111(26): 9419-9424.
** Discussant:  
** Discussant: Jeremy


'''Resources:'''
'''Resources:'''
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_11/lecture/day_11.html Today's slides]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_11/lecture/day_11.html Today's slides]


== Day 12: Organizing and storing computational projects (November 18) ==
== Day 12: Organizing and storing computational projects (November 11) ==


'''Assignment Due:'''
'''Assignment Due:'''
Line 424: Line 414:
* [https://learngitbranching.js.org/ Interactive git branching tutorial]
* [https://learngitbranching.js.org/ Interactive git branching tutorial]
* DellaPosta, D., Shi, Y., & Macy, M. (2015). [https://doi.org/10.1086/681254 Why Do Liberals Drink Lattes]? American Journal of Sociology, 120(5), 1473–1511.
* DellaPosta, D., Shi, Y., & Macy, M. (2015). [https://doi.org/10.1086/681254 Why Do Liberals Drink Lattes]? American Journal of Sociology, 120(5), 1473–1511.
** Discussant:   
** Discussant:  Lucy


'''Agenda:'''
'''Agenda:'''
Line 433: Line 423:
'''Resources:'''
'''Resources:'''


== Day 13: Statistical summaries and tests (December 2) ==
== Day 13: Statistical summaries and tests (November 18) ==


'''Assignment Due:'''
'''Assignment Due:'''
Line 440: Line 430:


'''Readings:'''
'''Readings:'''
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_13/day_13.ipynb Day 13 Notebook]
* [https://github.com/jdfoote/Intro-to-Programming-and-Data-Science/raw/summer2021/day_13/day_13.ipynb Day 13 Notebook]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819508/View Notebook walkthrough]
** [https://purdue.brightspace.com/d2l/le/content/335095/viewContent/6819508/View Notebook walkthrough]
* Tan, C. (2018). [https://aaai.org/ocs/index.php/ICWSM/ICWSM18/paper/view/17811 Tracing community genealogy: How new communities emerge from the old]. Proceedings of the Twelfth International Conference on Web and Social Media (ICWSM ’18), 395–404.
* Tan, C. (2018). [https://aaai.org/ocs/index.php/ICWSM/ICWSM18/paper/view/17811 Tracing community genealogy: How new communities emerge from the old]. Proceedings of the Twelfth International Conference on Web and Social Media (ICWSM ’18), 395–404.
** Discussant: Sandra
** Discussant:  


'''Agenda:'''
'''Agenda:'''
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_13/lecture/day_13.html Today's slides]
* [https://jeremydfoote.com/Intro-to-Programming-and-Data-Science/day_13/lecture/day_13.html Today's slides]


== Day 14: Screen scraping (December 2) ==
'''Assignment Due:'''
'''Readings:'''
* Shaw, A., & Hill, B. M. (2014). [https://doi.org/10.1111/jcom.12082 Laboratories of oligarchy? How the iron law extends to peer production]. Journal of Communication, 64(2), 215–238.
** Discussant:
* [https://towardsdatascience.com/ethics-in-web-scraping-b96b18136f01 Ethics in Web Scraping] by James Densmore
'''Agenda:'''
* If you are interested in doing web scraping, then look at this [https://github.com/CU-ITSS/Web-Data-Scraping-S2019 incredible mini-course on the topic]. It is all done with Jupyter Notebooks and you have all of the prerequisite knowledge to understand it.
* [https://youtu.be/daUuC-PMZc4 Very brief lecture on web scraping from Spring 2020].


== Day 15: Final Project Presentation (December 9) ==
== Day 15: Final Project Presentation (December 9) ==
Line 474: Line 476:
These are some topics we touched on in class covered in more depth
These are some topics we touched on in class covered in more depth


* [https://youtu.be/rQEsIs9LERM Using Tweepy to do full historical search on Twitter]
* [https://www.youtube.com/watch?v=K8L6KVGG-7o Regular Expressions]
* [https://www.youtube.com/watch?v=K8L6KVGG-7o Regular Expressions]
* [https://www.youtube.com/watch?v=3dt4OGnU5sM List Comprehensions]
* [https://www.youtube.com/watch?v=3dt4OGnU5sM List Comprehensions]
Please note that all contributions to CommunityData are considered to be released under the Attribution-Share Alike 3.0 Unported (see CommunityData:Copyrights for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource. Do not submit copyrighted work without permission!

To protect the wiki against automated edit spam, we kindly ask you to solve the following CAPTCHA:

Cancel Editing help (opens in new window)

Template used on this page: