Editing CDSC Computational Social Science Workshop (Fall 2022)

From CommunityData

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.

Latest revision Your text
Line 17: Line 17:


== Schedule ==
== Schedule ==
There will be a mandatory evening setup session '''6:00-9:00pm on Friday January 17''' and three workshops held from '''9:45am-4pm on three Saturdays (January 18 and February 1 and 15)'''. Each Saturday session will involve a period for lecture and technical demonstrations in the morning. This will be followed by a lunch graciously provided by the [http://escience.washington.edu/ eScience Institute at UW]. The rest of the day will be followed by group work on programming and data science projects supported by more experienced mentors.


'''All sessions are interactive and involve you programming on your own and on your own laptop. Everybody attending should bring a laptop and a power cord so that they don't run out of battery.'''
'''All sessions are interactive and involve you programming on your own and on your own laptop. Everybody attending should bring a laptop and a power cord so that they don't run out of battery.'''


=== Session 0: Setup and Programming Tutorial (Friday January 17 evening)===


===10/4 Set Up Your Environment===
Come to the '''[http://www.washington.edu/maps/#!/cmu Communications Building (CMU) 104]''' between 6:00 and 9:00pm. It's OK if you come a little late but you'll want to have as much time as you can to finish the setup and self-directed assignments so come as close to 6:00pm as you can. Most people will finish early but some people will definitely need the full 3 hours. It's hard to know in advance where problems will crop up so please come on time even if you are confident.


: '''Objectives''': During this session, you will:
:'''Time''': 6-9pm
:'''Location''': '''[http://www.washington.edu/maps/#!/cmu Communications Building (CMU) 104]'''
: '''Objectives''': During this session, mentors will help you:


:: work through any remaining items in the onboarding checklist
:: set up your development environment
:: set up your development environment
:: learn how to write and execute Python code in a Jupyter Notebook
:: learn how to write and execute Python code in a Jupyter Notebook
Line 32: Line 36:
:'''Material''': <font size="+1">[[CDSW/Day 0 setup and tutorial|Click here for the the setup and tutorial material.]]</font>
:'''Material''': <font size="+1">[[CDSW/Day 0 setup and tutorial|Click here for the the setup and tutorial material.]]</font>


'''Note''': Because we expect to hit the ground running on our first full day, we will meet to help participants get software installed and to work through a self-guided tutorial that will help ensure that everyone has the skills and vocabulary to start programming and learning when we meet the following morning.
=== Session 1: Introduction to Programming (January 18) ===
Come to '''[http://uw.edu/maps/?sav Savery Hall (SAV) 260] by 9:45am'''.
Plan to be on UW campus by 9:45am. You will need time to get settled and setup. We will start lecturing promptly at 10am. There will be coffee!
: '''Time:''' 9:45am-4pm
: '''Location:''' [http://uw.edu/maps/?sav Savery Hall (SAV)]


:: Python practice through short projects (see below) on a variety of fun and practical topics:
: '''Schedule'''
:: '''Morning, 10am-12:20 (SAV 260)''': [[CDSW/Day_1_lecture|A 2.5 hour lecture-based introduction to the Python programming language]]
:: '''Lunch, 12:20-1pm (Savery Hall in the downstairs hallway)''': We'll provide lunch (pizza!)
:: '''Afternoon, 1pm-3:30pm (SAV 130, 137, 138, 156):''' Python practice through short projects (see below) on a variety of fun and practical topics:
::* [[Baby_Names | Baby Names]]
::* [[Baby_Names | Baby Names]]
::* [[Learnpython.org exercises]]
::* [[Learnpython.org exercises]]
:: '''Wrap-up, 3:30pm-4pm (SAV 260):''' Wrap-up, next steps, and upcoming opportunities for learning and practicing Python


: '''Objectives''': Programming is an essential tool for data science and is useful for solving many other problems. The goal of this session will be to introduce programming in the [http://www.python.org/ Python programming language]. Each participant will leave having solved a real problem and will have built their first real programming project.


===10/11 Introduction to Programming ===
=== Session 2: Importing Data from web APIs (February 1) ===
Come to '''[http://uw.edu/maps/?sav Savery Hall (SAV) 260] by 9:45am'''. You will need time to get settled and setup. We will start lecturing promptly at 10am. There will be coffee!


: '''Objectives''': Programming is an essential tool for data science and is useful for solving many other problems. The goal of this session will be to introduce programming in the [http://www.python.org/ Python programming language]. Each participant will leave having solved a real problem and will have built their first real programming project.
: '''Time''': 9:45am-4pm


: '''Class Material''': We will use this [https://communitydata.science/~mako/cdsw-wi2020-lecture1-20200118.ogv overview on using Python] -- there are notes on what this includes located here: [[CDSW/Day_1_lecture|a lecture-based introduction to the Python programming language]]
: '''Location''': [http://uw.edu/maps/?sav Savery Hall (SAV)]  


===10/18 Importing Data from web APIs Part 1 ===
: '''Schedule''':
:: '''Morning: 10am-12:20 (SAV 260)''': [[CDSW/Day_2_Lecture|A 2.5 hour lecture-based introduction to the web programming and APIs]]
:: '''Lunch: 12:20-1pm (Savery Hall downstairs hallway)''': We'll provide lunch (TBD)
:: '''Afternoon: 1pm-3:30pm (SAV 130, 137, 138, 156)''': Web API practice through short projects (see below) on a variety of fun and practical topics:
::* [[Twitter (CDSW)|Twitter]]
::* [[Wikipedia (CDSW)|Wikipedia]]
::* [[Yelp (CDSW)|Yelp]] - Get a [[Yelp API Key]] ahead of time so you're ready to go.
:: '''Wrap-up: 3:30pm-4pm''': Wrap-up, next steps, and upcoming opportunities for learning and practicing Python


'''Course Material'''
: '''Objectives''': An important step in doing data science is collecting data. The goal of this session will be to teach participants how to get data from the [https://en.wikipedia.org/wiki/Application_programming_interface public application programming interfaces] ("APIs") common to many social media and online communities. Although we will use the APIs provided by Wikipedia, Twitter, and Socrata in the session, the principles and techniques are common to many other online communities.
[[CDSW/Day_2_Lecture|An interactive introduction to the web programming and APIs]]


'''Prep for next week''' Complete the [[Twitter authentication setup]] ''before'' we meet. If you plan to complete the Yelp session, you need to complete the [[Yelp authentication setup]] instructions.
: An outline for the lecture can be found [[CDSW/Day_2_lecture|here]] and a list of potential projects for the afternoon session are listed below:


===10/25 Importing Data from web APIs Part 2 ===
'''Important Note:''' If you plan to attend the Twitter afternoon session, you need to complete the [[Twitter authentication setup]] ''before'' the afternoon setup on Saturday. If you plan to complete the Yelp session, you need to complete the [[Yelp authentication setup]] instructions. There's no promise that we will be running these sessions this weekend unless there is demand but you will need to have done these if you want to attend the session.


Web API practice through short projects (see below) on a variety of fun and practical topics:
=== Session 3: Data Analysis and Visualization (February 15) ===
::* [[Twitter (CDSW)|Twitter]]
Come to '''[http://uw.edu/maps/?sav Savery Hall (SAV) 260] by 9:45am'''. You will need time to get settled and setup. We will start lecturing promptly at 10am. There will be coffee!
::* [[Wikipedia (CDSW)|Wikipedia]]
::* [[Yelp (CDSW)|Yelp]] - Get a [[Yelp API Key]] ahead of time so you're ready to go.
::* [[TikTok (CDSW)|TikTok]]


: '''Objectives''': An important step in doing data science is collecting data. The goal of this session will be to teach participants how to get data from the [https://en.wikipedia.org/wiki/Application_programming_interface public application programming interfaces] ("APIs") common to many social media and online communities. Although we will use the APIs provided by Wikipedia, Twitter, and Yelp in the session, the principles and techniques are common to many other online communities.
: '''Time''': 9:45am-4pm


=== 11/1: Data Analysis and Visualization  ===
: '''Location:''' [http://uw.edu/maps/?sav Savery Hall (SAV) 260]


:: [[CDSW/Day_3_Lecture|interactive lecture]]
: '''Schedule''':


=== 11/8 Web API practice through independent projects ===
:: '''Morning, 10am-12:20 (SAV 260):''' [[CDSW/Day_3_Lecture|2.5 hour interactive lecture]]
:: '''Lunch, 12:20-1pm (TBD):''' We'll provide lunch ([[:wikipedia:Bahn mi|Bahm mi]] from Saigon Deli)
:: '''Afternoon, 1pm-3:30pm (SAV 130, 137, 138, 156): '''Web API practice through independent projects
::* [[Seattle_open_data|Visualization and analysis of civic data from data.seattle.gov]]
::* [[Seattle_open_data|Visualization and analysis of civic data from data.seattle.gov]]
::* [[CDSW/Review_Wk1_Wk2 | Review of Previous Topics]]
::* [[CDSW/Review_Wk1_Wk2 | Review of Week 1 and Week 2]]
::* Independent projects working on issues of ''your'' choosing!
::* Independent projects working on issues of ''your'' choosing!
:: '''Wrap-up, 3:30-4pm


: '''Objectives''': The goal of data science is to use data to answer questions. In our final session, we will use the Python skills we learned in the first session and the datasets we've created in the second to ask and answer common questions about online and offline communities. We will focus on learning how to generate visualizations, create summary statistics, and test hypotheses.
: '''Objectives''': The goal of data science is to use data to answer questions. In our final session, we will use the Python skills we learned in the first session and the datasets we've created in the second to ask and answer common questions about online and offline communities. We will focus on learning how to generate visualizations, create summary statistics, and test hypotheses.


=== 11/15 Leveling up: Using computation servers ===
== Venue and Logistics ==
* hyak
* kibo
* wikiq
* git
 
'''Objectives:'''
Let's not be limited to what our laptops can do: there are supercomputers at our disposal, and code developed by previous folks. Let's explore those group resources!
 
=== 11/22 From data files to dataframes: getting started with R ===
 
=== 11/29 Functions and Figures in R ===


=== 12/6 Putting the pieces together with Overleaf, Rmd, Dropbox, and Zotero ===
=== What to bring ===
# a '''laptop'''
#*  for '''Session 0''' make sure that you have about 1GB of space free so you can install Python and all the necessary other software
#* for '''Sessions 1-3''' bring your laptop with Python set up
# a '''power cord'''
# a '''sense of adventure!'''
Please note that all contributions to CommunityData are considered to be released under the Attribution-Share Alike 3.0 Unported (see CommunityData:Copyrights for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource. Do not submit copyrighted work without permission!

To protect the wiki against automated edit spam, we kindly ask you to solve the following CAPTCHA:

Cancel Editing help (opens in new window)