CDSC Computational Social Science Workshop (Fall 2022): Difference between revisions

From CommunityData
 
(20 intermediate revisions by the same user not shown)
Line 31: Line 31:


:'''Material''': <font size="+1">[[CDSW/Day 0 setup and tutorial|Click here for the the setup and tutorial material.]]</font>
:'''Material''': <font size="+1">[[CDSW/Day 0 setup and tutorial|Click here for the the setup and tutorial material.]]</font>
::  Python practice through short projects (see below) on a variety of fun and practical topics:
::* [[Baby_Names | Baby Names]]
::* [[Learnpython.org exercises]]


===10/11 Introduction to Programming ===
===10/11 Introduction to Programming ===
Intro to Programming [[CDSW/Day_1_lecture|A 2.5 hour lecture-based introduction to the Python programming language]]


: '''Objectives''': Programming is an essential tool for data science and is useful for solving many other problems. The goal of this session will be to introduce programming in the [http://www.python.org/ Python programming language]. Each participant will leave having solved a real problem and will have built their first real programming project.
: '''Objectives''': Programming is an essential tool for data science and is useful for solving many other problems. The goal of this session will be to introduce programming in the [http://www.python.org/ Python programming language]. Each participant will leave having solved a real problem and will have built their first real programming project.


=== 10/18 Introduction to Programming Part 2 ===
: '''Class Material''': We will use this [https://communitydata.science/~mako/cdsw-wi2020-lecture1-20200118.ogv overview on using Python] -- there are notes on what this includes located here: [[CDSW/Day_1_lecture|a lecture-based introduction to the Python programming language]]
 
:: Python practice through short projects (see below) on a variety of fun and practical topics:
::* [[Baby_Names | Baby Names]]
::* [[Learnpython.org exercises]]
:: Wrap-up, next steps, and upcoming opportunities for learning and practicing Python


===10/25 Importing Data from web APIs Part 1 ===
===10/18 Importing Data from web APIs Part 1 ===


'''Course Material'''
[[CDSW/Day_2_Lecture|An interactive introduction to the web programming and APIs]]
[[CDSW/Day_2_Lecture|An interactive introduction to the web programming and APIs]]


'''Prep for next week''' Complete the [[Twitter authentication setup]] ''before'' we meet. If you plan to complete the Yelp session, you need to complete the [[Yelp authentication setup]] instructions.
'''Prep for next week''' Complete the [[Twitter authentication setup]] ''before'' we meet. If you plan to complete the Yelp session, you need to complete the [[Yelp authentication setup]] instructions.


===11/1 Importing Data from web APIs Part 2 ===
===10/25 Importing Data from web APIs Part 2 ===


Web API practice through short projects (see below) on a variety of fun and practical topics:
Web API practice through short projects (see below) on a variety of fun and practical topics:
Line 57: Line 57:
::* [[Wikipedia (CDSW)|Wikipedia]]
::* [[Wikipedia (CDSW)|Wikipedia]]
::* [[Yelp (CDSW)|Yelp]] - Get a [[Yelp API Key]] ahead of time so you're ready to go.
::* [[Yelp (CDSW)|Yelp]] - Get a [[Yelp API Key]] ahead of time so you're ready to go.
:: '''Wrap-up: 3:30pm-4pm''': Wrap-up, next steps, and upcoming opportunities for learning and practicing Python
::* [[TikTok (CDSW)|TikTok]]


: '''Objectives''': An important step in doing data science is collecting data. The goal of this session will be to teach participants how to get data from the [https://en.wikipedia.org/wiki/Application_programming_interface public application programming interfaces] ("APIs") common to many social media and online communities. Although we will use the APIs provided by Wikipedia, Twitter, and Yelp in the session, the principles and techniques are common to many other online communities.
: '''Objectives''': An important step in doing data science is collecting data. The goal of this session will be to teach participants how to get data from the [https://en.wikipedia.org/wiki/Application_programming_interface public application programming interfaces] ("APIs") common to many social media and online communities. Although we will use the APIs provided by Wikipedia, Twitter, and Yelp in the session, the principles and techniques are common to many other online communities.


=== 11/8: Data Analysis and Visualization  ===
=== 11/1: Data Analysis and Visualization  ===


:: [[CDSW/Day_3_Lecture|interactive lecture]]
:: [[CDSW/Day_3_Lecture|interactive lecture]]


=== 11/15 Web API practice through independent projects ===
=== 11/8 Web API practice through independent projects ===
::* [[Seattle_open_data|Visualization and analysis of civic data from data.seattle.gov]]
::* [[Seattle_open_data|Visualization and analysis of civic data from data.seattle.gov]]
::* [[CDSW/Review_Wk1_Wk2 | Review of Week 1 and Week 2]]
::* [[CDSW/Review_Wk1_Wk2 | Review of Previous Topics]]
::* Independent projects working on issues of ''your'' choosing!
::* Independent projects working on issues of ''your'' choosing!
:: '''Wrap-up, 3:30-4pm


: '''Objectives''': The goal of data science is to use data to answer questions. In our final session, we will use the Python skills we learned in the first session and the datasets we've created in the second to ask and answer common questions about online and offline communities. We will focus on learning how to generate visualizations, create summary statistics, and test hypotheses.
: '''Objectives''': The goal of data science is to use data to answer questions. In our final session, we will use the Python skills we learned in the first session and the datasets we've created in the second to ask and answer common questions about online and offline communities. We will focus on learning how to generate visualizations, create summary statistics, and test hypotheses.


=== 11/22 Leveling up: Using computation servers ===
=== 11/15 Leveling up: Using computation servers ===
* hyak
* kibo
* wikiq
* git


=== 11/29 ===
'''Objectives:'''
Let's not be limited to what our laptops can do: there are supercomputers at our disposal, and code developed by previous folks. Let's explore those group resources!


From datasets to dataframes: getting started with R
=== 11/22 From data files to dataframes: getting started with R ===


== 12/6 ==
=== 11/29 Functions and Figures in R ===


== 12/13 ==
=== 12/6 Putting the pieces together with Overleaf, Rmd, Dropbox, and Zotero ===

Latest revision as of 20:32, 15 November 2022

The CDSC Computational Social Science Workshop in Fall 2022 is a series of project-based workshops being held at the University of Washington for anyone interested in learning how to use programming and data science tools to ask and answer questions about online communities like Wikipedia, Twitter, free and open source software, and civic media. If taken with a faculty supervisor, the workshop is intended to be suitable for university-level credit.

The Fall 2022 workshop will take place:

  • Tuesdays, 12-2pm Pacific time. The first hour will be interactive lecture and discussion, and the second hour devoted to questions, co-working, and 1-on-1 discussion
  • In-Person for UW folks, on Jitsi for folks elsewhere.

This workshop is for people with absolutely no previous programming experience who want to use data to answer questions about the social world.

The goal is that, after the workshop, participants will be able to use data to produce numbers, hypothesis tests, tables, and graphical visualizations to answer questions like:

  • Are new contributors in Wikipedia this year sticking around longer or contributing more than people who joined last year?
  • Who are the most active or influential users of a particular Twitter hashtag?
  • Are people who join through a Wikipedia outreach event staying involved? How do they compare to people who decide to join the project outside of the event?

The workshop is based on the curriculum used for the Computational Data Science Workshops.

Schedule[edit]

All sessions are interactive and involve you programming on your own and on your own laptop. Everybody attending should bring a laptop and a power cord so that they don't run out of battery.


10/4 Set Up Your Environment[edit]

Objectives: During this session, you will:
work through any remaining items in the onboarding checklist
set up your development environment
learn how to write and execute Python code in a Jupyter Notebook
learn about printing and using Python as a calculator
Material: Click here for the the setup and tutorial material.


Python practice through short projects (see below) on a variety of fun and practical topics:


10/11 Introduction to Programming[edit]

Objectives: Programming is an essential tool for data science and is useful for solving many other problems. The goal of this session will be to introduce programming in the Python programming language. Each participant will leave having solved a real problem and will have built their first real programming project.
Class Material: We will use this overview on using Python -- there are notes on what this includes located here: a lecture-based introduction to the Python programming language

10/18 Importing Data from web APIs Part 1[edit]

Course Material An interactive introduction to the web programming and APIs

Prep for next week Complete the Twitter authentication setup before we meet. If you plan to complete the Yelp session, you need to complete the Yelp authentication setup instructions.

10/25 Importing Data from web APIs Part 2[edit]

Web API practice through short projects (see below) on a variety of fun and practical topics:

Objectives: An important step in doing data science is collecting data. The goal of this session will be to teach participants how to get data from the public application programming interfaces ("APIs") common to many social media and online communities. Although we will use the APIs provided by Wikipedia, Twitter, and Yelp in the session, the principles and techniques are common to many other online communities.

11/1: Data Analysis and Visualization[edit]

interactive lecture

11/8 Web API practice through independent projects[edit]

Objectives: The goal of data science is to use data to answer questions. In our final session, we will use the Python skills we learned in the first session and the datasets we've created in the second to ask and answer common questions about online and offline communities. We will focus on learning how to generate visualizations, create summary statistics, and test hypotheses.

11/15 Leveling up: Using computation servers[edit]

  • hyak
  • kibo
  • wikiq
  • git

Objectives: Let's not be limited to what our laptops can do: there are supercomputers at our disposal, and code developed by previous folks. Let's explore those group resources!

11/22 From data files to dataframes: getting started with R[edit]

11/29 Functions and Figures in R[edit]

12/6 Putting the pieces together with Overleaf, Rmd, Dropbox, and Zotero[edit]