Editing Statistics and Statistical Programming (Winter 2021)

From CommunityData

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.

Latest revision Your text
Line 9: Line 9:
:* [https://discord.com Discord] — for synchronous course meetings as well as asyncronous discussion and chat.
:* [https://discord.com Discord] — for synchronous course meetings as well as asyncronous discussion and chat.
:* [https://wiki.communitydata.science/Statistics_and_Statistical_Programming_(Winter_2021) This syllabus wiki page] — for nearly everything else.
:* [https://wiki.communitydata.science/Statistics_and_Statistical_Programming_(Winter_2021) This syllabus wiki page] — for nearly everything else.
:* [https://www.dropbox.com/home/COM520-shared_files-UW-2021-Q1 Dropbox] —Filesharing via Dropbox.


;Instructor: [[Benjamin Mako Hill]] ([mailto:makohill@uw.edu makohill@uw.edu])
;Instructor: [[Benjamin Mako Hill]] ([mailto:makohill@uw.edu makohill@uw.edu])
:Office Hours: By appointment (I'm usually available via chat during "business hours.") You can view out [https://mako.cc/calendar/ my calendar] and/or [https://harmonizely.com/mako put yourself on it]. If you schedule a meeting, we'll meet in the Jitsi link you'll get through the scheduling app.
:Office Hours: {{tbd}} and by appointment (I'm usually available via chat during "business hours.")


<br clear=all>
<br clear=all>
Line 54: Line 53:
You should expect this syllabus to be a dynamic document. Although the core expectations for this class are fixed, the details of readings and assignments ''will'' shift based on how the class goes, guest speakers that I might arrange, my own readings in this area, etc. As a result, there are three important things to keep in mind:
You should expect this syllabus to be a dynamic document. Although the core expectations for this class are fixed, the details of readings and assignments ''will'' shift based on how the class goes, guest speakers that I might arrange, my own readings in this area, etc. As a result, there are three important things to keep in mind:


* Although details on this syllabus will change, I will try to ensure that I never change readings more than six days before they are due. We will send an announcement '''no later than before we go to sleep each Tuesday evening''' that fixes the schedule for the next week. This means that if I don't fill in a reading marked "{{tbd}}" or "{{forthcoming}}" six days before it's due, it is dropped. If we don't change something marked "{{tentative}}" before the deadline, then it is assigned. This also means that if you plan to read more than six days ahead, contact the teaching team first.
* Although details on this syllabus will change, I will try to ensure that I never change readings more than six days before they are due. We will send an announcement '''no later than before we go to sleep each Tuesday evening''' that fixes the schedule for the next week. This means that if I don't fill in a reading marked "{{tbd}}" six days before it's due, it is dropped. If we don't change something marked "{{tentative}}" before the deadline, then it is assigned. This also means that if you plan to read more than six days ahead, contact the teaching team first.
* Because this syllabus a wiki, you will be able to track every change by clicking the history button on this page when I make changes. I will summarize these changes in the weekly [https://canvas.uw.edu/courses/1369415/announcements an announcement on Canvas] sent that will be emailed to everybody in the class. Closely monitor your email or the announcements section on the [https://canvas.uw.edu/courses/1369415/announcements course website on Canvas] to make sure you don't miss these announcements.
* Because this syllabus a wiki, you will be able to track every change by clicking the history button on this page when I make changes. I will summarize these changes in the weekly [https://canvas.uw.edu/courses/1369415/announcements an announcement on Canvas] sent that will be emailed to everybody in the class. Closely monitor your email or the announcements section on the [https://canvas.uw.edu/courses/1369415/announcements course website on Canvas] to make sure you don't miss these announcements.
* I will ask the class for voluntary anonymous feedback frequently — especially toward the beginning of the quarter. Please let me know what is working and what can be improved. In the past, I have made many adjustments to courses that I teach while the quarter progressed based on this feedback.
* I will ask the class for voluntary anonymous feedback frequently — especially toward the beginning of the quarter. Please let me know what is working and what can be improved. In the past, I have made many adjustments to courses that I teach while the quarter progressed based on this feedback.
Line 110: Line 109:
* Reinhart, Alex. 2015. ''Statistics Done Wrong: The Woefully Complete Guide''. SF, CA: No Starch Press. {{avail-uw|1=https://alliance-primo.hosted.exlibrisgroup.com/primo-explore/fulldisplay?docid=CP71226818410001451&context=L&vid=UW&lang=en_US&search_scope=all&adaptor=Local%20Search%20Engine&tab=default_tab&query=any,contains,statistics%20done%20wrong}}
* Reinhart, Alex. 2015. ''Statistics Done Wrong: The Woefully Complete Guide''. SF, CA: No Starch Press. {{avail-uw|1=https://alliance-primo.hosted.exlibrisgroup.com/primo-explore/fulldisplay?docid=CP71226818410001451&context=L&vid=UW&lang=en_US&search_scope=all&adaptor=Local%20Search%20Engine&tab=default_tab&query=any,contains,statistics%20done%20wrong}}


This book provides a readable conceptual introduction to some common failures in statistical analysis that you should learn to recognize and avoid. It was also written by a Ph.D. student. You have access to an electronic copy via the UW libraries (you'll need to sign-in and/or use the [[#VPN Notice|UW VPN]] to access it), but you may find it helpful to purchase as well.
This book provides a readable conceptual introduction to some common failures in statistical analysis that you should learn to recognize and avoid. It was also written by a Ph.D. student. You have access to an electronic copy via the NU library (you'll need to sign-in and/or use the NU VPN to access it), but you may find it helpful to purchase as well.


A few other books may be useful resources while you're learning to analyze, visualize, and interpret statistical data with R. I will share some advice about these during the first class meeting:
A few other books may be useful resources while you're learning to analyze, visualize, and interpret statistical data with R. I will share some advice about these during the first class meeting:
Line 129: Line 128:
* [https://depts.washington.edu/acelab/proj/Rstats/index.html Statistical Analysis and Reporting in R] — A set of resources created and distributed by Jacob Wobbrock (University of Washington, School of Information) in conjunction with a MOOC he teaches. Contains cheatsheets, code snippets, and data to help execute commonly encountered statistical procedures in R.
* [https://depts.washington.edu/acelab/proj/Rstats/index.html Statistical Analysis and Reporting in R] — A set of resources created and distributed by Jacob Wobbrock (University of Washington, School of Information) in conjunction with a MOOC he teaches. Contains cheatsheets, code snippets, and data to help execute commonly encountered statistical procedures in R.
* [https://www.datacamp.com DataCamp] offers introductory R courses. Northwestern usually has some free accounts that get passed out via Research Data Services each quarter. Apparently, if you are taking or teaching relevant coursework, instructors can [https://www.datacamp.com/groups/education request] free access to DataCamp for their courses from DataCamp. If folks are interested in this, I can reach out.
* [https://www.datacamp.com DataCamp] offers introductory R courses. Northwestern usually has some free accounts that get passed out via Research Data Services each quarter. Apparently, if you are taking or teaching relevant coursework, instructors can [https://www.datacamp.com/groups/education request] free access to DataCamp for their courses from DataCamp. If folks are interested in this, I can reach out.
* [https://brownmath.com/swt/symbol.htm Statistics symbols you need to know] which is just what it says on the tin. Thanks Kate Rich!


== Assignments ==
== Assignments ==
Line 167: Line 165:
==== Research project plan and dataset identification ====
==== Research project plan and dataset identification ====


;Due date: Friday January 15, 2021
;Due date: Friday January 8, 2021
;Maximum length: 500 words (~1-2 pages)
;Maximum length: 500 words (~1-2 pages)


Line 184: Line 182:
* Do some Google Scholar and normal internet searching for datasets in your research area. You'll probably be surprised at what's available.
* Do some Google Scholar and normal internet searching for datasets in your research area. You'll probably be surprised at what's available.
* Take a look at datasets available in the [https://dataverse.harvard.edu/ Harvard Dataverse] (a very large collection of social science research data) or one of the other members of the [http://dataverse.org/ Dataverse network].
* Take a look at datasets available in the [https://dataverse.harvard.edu/ Harvard Dataverse] (a very large collection of social science research data) or one of the other members of the [http://dataverse.org/ Dataverse network].
* Look at the collection of social scientific datasets at [https://www.icpsr.umich.edu/icpsrweb/ICPSR/ ICPSR at the University of Michigan] (UW is a member). There are an enormous number of very rich datasets.
* Look at the collection of social scientific datasets at [https://www.icpsr.umich.edu/icpsrweb/ICPSR/ ICPSR at the University of Michigan] (NU is a member). There are an enormous number of very rich datasets.
* Use the [http://scientificdata.isa-explorer.org/index.html ISA Explorer] to find datasets. Keep in mind the large majority of datasets it will search are drawn from the natural sciences.
* Use the [http://scientificdata.isa-explorer.org/index.html ISA Explorer] to find datasets. Keep in mind the large majority of datasets it will search are drawn from the natural sciences.
* The City of Seattle has one of the best [https://data.seattle.gov/ data portal sites] of any municipality in the U.S. (and better than many federal agencies). There are also numerous administrative datasets released by other public entities (try searching!) that you might find inspiring.
* The City of Seattle has one of the best [https://data.seattle.gov/ data portal sites] of any municipality in the U.S. (and better than many federal agencies). There are also numerous administrative datasets released by other public entities (try searching!) that you might find inspiring.
Line 195: Line 193:
==== Research project planning document ====
==== Research project planning document ====


;Due date: February 12, 2021
;Due date: January 31, 2021
;Suggested length: ~5 pages
;Suggested length: ~5 pages


Line 202: Line 200:
I will also provide example planning documents via our Canvas site:
I will also provide example planning documents via our Canvas site:
* [https://canvas.northwestern.edu/files/9439380/download?download_frd=1 One by public health researcher Mika Matsuzaki]. The first planning document I ever saw and still one of the best. It's missing a measures section. It's also focused on a research context that is probably very different from yours, but try not to get bogged down by that and imagine how you might map the structure of the document to your own work.
* [https://canvas.northwestern.edu/files/9439380/download?download_frd=1 One by public health researcher Mika Matsuzaki]. The first planning document I ever saw and still one of the best. It's missing a measures section. It's also focused on a research context that is probably very different from yours, but try not to get bogged down by that and imagine how you might map the structure of the document to your own work.
* [One provided as an appendix to Gerber and Green's excellent textbook, ''Field Experiments: Design, Analysis, and Interpretation'' (FEDAI)]. It's over-detailed and over-long for the purposes of this assignment, but nevertheless an exemplary approach to planning empirical quantitative research in a careful, intentional way that is worthy of imitation.
* [One by Jim Maddock] created as part of a qualifying exam early in 2019. Jim doesn't provide dummy tables or anticipated findings/contributions, but he has an especially phenomenal explanation of the conceptual relationships and processes he wants to test. {{tentative}}
* [One provided as an appendix to Gerber and Green's excellent textbook, ''Field Experiments: Design, Analysis, and Interpretation'' (FEDAI)]. It's over-detailed and over-long for the purposes of this assignment, but nevertheless an exemplary approach to planning empirical quantitative research in a careful, intentional way that is worthy of imitation. {{tentative}}


==== Research project presentation ====
==== Research project presentation ====


;Presentation due date: March 10, 2021
;Presentation due date: March 11, 2021
;Maximum length: 15 minutes
;Maximum length: 15 minutes


Line 212: Line 211:
[[Statistics_and_Statistical_Programming_(Spring_2019)/Final_project_presentations]]
[[Statistics_and_Statistical_Programming_(Spring_2019)/Final_project_presentations]]
--->
--->
You will also create and record a short presentation of your final project. The presentation will provide an opportunity to share a brief overview of your project and findings with the other members of the class. Since you will all give other research presentations throughout your career, I strongly encourage you to take the opportunity to refine your academic presentation skills. The document [https://canvas.uw.edu/files/74392679/download?download_frd=1 Creating a Successful Scholarly Presentation] (file posted to Canvas) may be useful.
You will also create and record a short presentation of your final project. The presentation will provide an opportunity to share a brief overview of your project and findings with the other members of the class. Since you will all give other research presentations throughout your career, I strongly encourage you to take the opportunity to refine your academic presentation skills. The document [Creating a Successful Scholarly Presentation] (file posted to Canvas) may be useful. {{tentative}}
 
Additional details about the presentation goals, format suggestions, resources, and more will be provided later in the quarter.


==== Research project paper ====
==== Research project paper ====
Line 243: Line 244:


* Problem set discussion: 40%
* Problem set discussion: 40%
* Project identification: 5%
* Proposal identification: 5%
* Final project planning document: 5%
* Final project planning document: 5%
* Final project presentation: 15%
* Final project presentation: 15%
* Final project paper: 35%
* Final project paper: 35%


I will jointly and holistically evaluate your participation in problem set discussions along four dimensions: participation, preparation, engagement, and contribution. These are quite similar to the dimensions described in the "Participation Rubric" section of [[User:Benjamin Mako Hill/Assessment|my assessment page]]. Exceptional participation means excelling along all four dimensions. Please note that participation ≠ talking/typing more and I encourage all of us to seek balance in our discussions.
I will jointly and holistically evaluate your participation in problem set discussions along four dimensions: attendance, preparation, engagement, and contribution. These are quite similar to the dimensions described in the "Participation Rubric" section of [[User:Benjamin Mako Hill/Assessment|my assessment page]]. Exceptional participation means excelling along all four dimensions. Please note that participation ≠ talking/typing more and I encourage all of us to seek balance in our discussions.


My assessment of your final project proposal, planning document, presentation, and paper will reflect the clarity of the work, the effective execution and presentation of quantitative empirical analysis, as well as the quality and originality of the analysis. Throughout the quarter, we will talk about the qualities of exemplary quantitative research. In general, I expect your final project to embody these exemplary qualities.
My assessment of your final project proposal, planning document, presentation, and paper will reflect the clarity of the work, the effective execution and presentation of quantitative empirical analysis, as well as the quality and originality of the analysis. Throughout the quarter, we will talk about the qualities of exemplary quantitative research. In general, I expect your final project to embody these exemplary qualities.
Line 256: Line 257:
When reading the schedule below, the following key might help resolve ambiguity: §n denotes chapter n; §n.x denotes section x of chapter n; §n.x-y denotes sections x through y (inclusive) of chapter n.
When reading the schedule below, the following key might help resolve ambiguity: §n denotes chapter n; §n.x denotes section x of chapter n; §n.x-y denotes sections x through y (inclusive) of chapter n.


The required and recommended tasks are meant to be completed '''before class''' and will typically be necessary to complete the problem sets for each day.
The required/recommended tasks should be completed '''before class.'''


=== Day 1: Monday January 4: Intro and setup ===
=== Day 1: Monday January 4: Intro and setup ===


'''Class material:'''
'''Resources:'''


* [[/Day 1 session plan]]
* [[/Day 1 session plan]]
Line 269: Line 270:
* Confirm course registration and access to [https://www.openintro.org/book/os/ the textbook] (pdf download available for $0 and b&w paperbacks for $20) as well as any software and web-services you'll need for course (Discord, Canvas, this wiki, R, RStudio). Discord invites will be sent via email.
* Confirm course registration and access to [https://www.openintro.org/book/os/ the textbook] (pdf download available for $0 and b&w paperbacks for $20) as well as any software and web-services you'll need for course (Discord, Canvas, this wiki, R, RStudio). Discord invites will be sent via email.


=== Day 2: Wednesday January 6: Data and R ===
=== Day 2: Wednesday January 6: Introduction to Data and R ===
 
'''Resources:'''


'''Class material:'''
* [[/Day 2 session plan]]
* [[/Day 2 session plan]]


'''Required readings and resources:'''
'''Required tasks:'''
* Read Diez, Çetinkaya-Rundel, and Barr: §1.1-1.3 (Introduction to data)
 
'''Recommended readings and resources:'''
* Watch [https://www.youtube.com/playlist?list=PLkIselvEzpM6pZ76FD3NoCvvgkj_p-dE8 lecture materials for §1.1-3 (Videos 1-4 in the playlist)].


'''Homework:'''
* Read Diez, Çetinkaya-Rundel, and Barr: §1.1-1.3 (Introduction to data).  
* Complete '''Problem set 2''': exercises from OpenIntro §1: (1.6, 1.9, 1.10, 1.16, 1.21, 1.40, 1.42, 1.43). Remember that solutions to odd-numbered problems are in the book!
* Complete [[/Problem set 2]]
* Problem set 2 worked solutions [[https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/worked_solutions/worked_solutions-pset_02.html HTML], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/worked_solutions/worked_solutions-pset_02.rmd RMarkdown], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/worked_solutions/worked_solutions-pset_02.pdf PDF]]


=== Day 3: Monday January 11: Numerical and categorical data ===
=== Day 3: Monday January 11:  ===


'''Class material:'''
'''Resources:'''


* [[/Day 3 session plan]]
* [[/Day 3 session plan]]
Line 292: Line 289:
'''Required tasks:'''
'''Required tasks:'''
* Read Diez, Çetinkaya-Rundel, and Barr: §2.1-2 (Numerical and categorical data).  
* Read Diez, Çetinkaya-Rundel, and Barr: §2.1-2 (Numerical and categorical data).  
* The R tutorial webcast and RMarkdown tutorial that I've put together including:
* Complete [[/Problem set 3]] (OpenIntro questions & programming challenges)
** COM520 R Tutorial #1: What is R, RStudio, etc? [[https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-01.html HTML], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-01.rmd RMarkdown], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-01.pdf PDF]]
** COM520 R Tutorial #2: Intro to R [[https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-02.html HTML], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-02.rmd RMarkdown], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-02.pdf PDF]]


'''Recommended tasks:'''
'''Recommended tasks:'''


* Watch [https://www.youtube.com/playlist?list=PLkIselvEzpM6pZ76FD3NoCvvgkj_p-dE8 Lecture materials for §2.1 and §2.2 (Videos 6-7 in the playlist)].
* Review [https://www.youtube.com/playlist?list=PLkIselvEzpM6pZ76FD3NoCvvgkj_p-dE8 Lecture materials for §2.1 and §2.2 (Videos 6-7 in the playlist)].
* Watch COM520 [https://uw.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=62fcbfcf-0b7e-4c6c-bf1a-aca500828992 R Tutorial #1 Screencast] on Panopto
* Work through one (or more) introduction(s) to R and Rstudio so that you can complete problem set 0. Here are several suggestions:
* Watch COM520 [https://uw.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=4d383a06-3df5-4607-9b16-aca5008289be R Tutorial #2 Screencast] on Panopto
** The [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w01-R_tutorial.html Week 01 R tutorial] (you should also download the [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w01-R_tutorial.rmd .rmd version of the tutorial] that you can open and read/edit in RStudio). These are accompanied by the R and Rstudio intro screencasts ([https://communitydata.cc/~ads/teaching/2019/stats/screencasts/w01-s01-intro.webm Part 1] and [https://communitydata.cc/~ads/teaching/2019/stats/screencasts/w01-s02-intro.webm Part 2]) Aaron created for the 2019 version of the course.
* If you want additional material that will provide an introductions to R, these are great resources:
** Modern Dive [https://moderndive.netlify.app/index.html Statistical inference via data science] Chapter 1: [https://moderndive.netlify.app/1-getting-started.html Getting started with R].
** Modern Dive [https://moderndive.netlify.app/index.html Statistical inference via data science] Chapter 1: [https://moderndive.netlify.app/1-getting-started.html Getting started with R].
** [https://rladiessydney.org/courses/ryouwithme/ RYouWithMe] course [https://rladiessydney.org/courses/ryouwithme/01-basicbasics-0/ "Basic basics" 1 & 2] (and maybe 3 if you're feeling ambitious)
** [https://rladiessydney.org/courses/ryouwithme/ RYouWithMe] course [https://rladiessydney.org/courses/ryouwithme/01-basicbasics-0/ "Basic basics" 1 & 2] (and maybe 3 if you're feeling ambitious).
** Verzani §1 (Getting started)
** Verzani §1 (Getting started).
** Healy §2 (Get started)
** Healy §2 (Get started).


'''Homework:'''
=== Day 4: Wednesday January 13: ===


* Complete [[/Problem set 3]] (OpenIntro questions & programming challenges)
'''Resources:'''
* Problem set 3 worked solutions [[https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/worked_solutions/worked_solutions-pset_03.html HTML], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/worked_solutions/worked_solutions-pset_03.rmd RMarkdown], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/worked_solutions/worked_solutions-pset_03.pdf PDF]]
 
=== Day 4: Wednesday January 13: Applied data manipulation ===
 
'''Class material:'''


* [[/Day 4 session plan]]
* [[/Day 4 session plan]]


'''Required tasks:'''
'''Required tasks:'''
* COM520 R Tutorial #3 Intro to R [[https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-03.html HTML], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-03.rmd RMarkdown], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-03.pdf PDF], [https://uw.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=c9ce270f-9192-4034-bcf7-acae0073b049 Screencast]]
* Complete [[/Problem set 4]] (programming challenges and statistical questions)


'''Recommended tasks:'''
'''Recommended tasks:'''
* [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w03-R_tutorial.html Week 3 R tutorial] (note that you can access .rmd or .pdf versions by replacing the suffix of the URL accordingly).
* Additional material from any of the recommended R learning resources suggested last week or elsewhere in the syllabus. In particular, you may find the ModernDive, RYouWithMe, Healy, and/or Wickham and Grolemund resources valuable.
* Additional material from any of the recommended R learning resources suggested last week or elsewhere in the syllabus. In particular, you may find the ModernDive, RYouWithMe, Healy, and/or Wickham and Grolemund resources valuable.
'''Homework:'''
* Complete [[/Problem set 4]] (programming challenges and statistical questions)
* Problem set 4 worked solutions [[https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/worked_solutions/worked_solutions-pset_04.html HTML], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/worked_solutions/worked_solutions-pset_04.rmd RMarkdown], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/worked_solutions/worked_solutions-pset_04.pdf PDF]]
<!---
<!---
'''Resources'''
'''Resources'''
Line 335: Line 320:


=== NO CLASS: Monday January 18: Martin Luther King Jr Day  ===
=== NO CLASS: Monday January 18: Martin Luther King Jr Day  ===
=== Day 5: Wednesday Janaury 20: Probability and R fundamentals ===
=== Day 5: Wednesday Janaury 20: ===
 
'''Resources:'''
 
* [[/Day 5 session plan]]


'''Required tasks:'''
'''Required tasks:'''
* Read Diez, Çetinkaya-Rundel, and Barr: §3 (Probability).  
* Read Diez, Çetinkaya-Rundel, and Barr: §3 (Probability).  
* COM520 R Tutorial #4: Additional R fundamentals [[https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-04.html HTML], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-04.rmd RMarkdown], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-04.pdf PDF]]
'''Recommended tasks:'''
* Watch [https://www.youtube.com/watch?list=PLkIselvEzpM5EgoOajhw83Ax_FktnlD6n&v=rG-SLQ2uF8U Probability introduction] and [https://www.youtube.com/watch?v=HxEz4ZHUY5Y&list=PLkIselvEzpM5EgoOajhw83Ax_FktnlD6n&index=2 Probability trees] OpenIntro lectures (just videos 1 and 2 in the playlist).
* Watch [https://www.youtube.com/watch?list=PLkIselvEzpM5EgoOajhw83Ax_FktnlD6n&v=rG-SLQ2uF8U Probability introduction] and [https://www.youtube.com/watch?v=HxEz4ZHUY5Y&list=PLkIselvEzpM5EgoOajhw83Ax_FktnlD6n&index=2 Probability trees] OpenIntro lectures (just videos 1 and 2 in the playlist).
* Watch COM520 [https://uw.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=25a9cdb2-0dcd-493e-b031-acb3004215b5 R Tutorial #4.1 Screencast] on Panopto
* Complete '''exercises from OpenIntro §3:''' 3.12, 3.15, 3.22, 3.28, 3.34, 3.38
* Watch COM520 [https://uw.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=02bae034-911a-4ddf-85b3-acb300886150 R Tutorial #4.2 Screencast] on Panopto


'''Resources'''
'''Resources'''
* [https://seeing-theory.brown.edu/index.html#secondPage Seeing Theory §1-2 (Basic Probability and Compound Probability)]
* [https://seeing-theory.brown.edu/index.html#secondPage Seeing Theory §1-2 (Basic Probability and Compound Probability)]


'''Homework:'''
=== Day 6: Monday January 25: Emotional contagion and more advanced R fundamentals: import, tidy, transform, and simulate data; write functions ===
* Complete [[/Problem set 5]] (OpenIntro excercises & programming challenges)
* Problem set 5 worked solutions [[https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/worked_solutions/worked_solutions-pset_05-pt1.html HTML], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/worked_solutions/worked_solutions-pset_05-pt1.rmd RMarkdown], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/worked_solutions/worked_solutions-pset_05-pt1.pdf PDF]]


=== Day 6: Monday January 25: Distributions ===
'''Resources:'''


<!-- '''Class material:'''
* [[/Day 6 session plan]]
* [[/Day 6 session plan]] -->


'''Required tasks:'''
'''Required tasks:'''
* Read Diez, Çetinkaya-Rundel, and Barr: §4.1-3 (Normal and binomial distributions).  
* Read the paper below as well as the attendant [https://www.pnas.org/content/111/29/10779.1 "Expression of editorial concern"] and [https://www.pnas.org/content/111/29/10779.2 "Correction"] that were subsequently appended to it.
:Kramer, Adam D. I., Jamie E. Guillory, and Jeffrey T. Hancock. 2014. “Experimental Evidence of Massive-Scale Emotional Contagion through Social Networks.” ''Proceedings of the National Academy of Sciences'' 111(24):8788–90. [[http://www.pnas.org/content/111/24/8788.full Open access]]
* Complete [[Statistics_and_Statistical_Programming_(Winter_2020)/pset2|problem set #2]] (due Monday, October 5 at 1pm CT)


'''Recommended tasks:'''
'''Recommended tasks:'''
* [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w04-R_tutorial.html Week 4 R tutorial] (as usual, also available as .rmd or .pdf)


* Watch [https://www.youtube.com/watch?list=PLkIselvEzpM6V9h55s0l9Kzivih9BUWeW&v=S_p5D-YXLS4 normal and binomial distributions] OpenIntro lectures (videos 1-3 in the playlist)
=== Day 7: Wednesday January 27: Distributions ===
 
'''Resources:'''
 
* [[/Day 7 session plan]]
 
'''Required tasks:'''
* Read Diez, Çetinkaya-Rundel, and Barr: §4.1-3 (Normal and binomial distributions).
* Watch [https://www.youtube.com/watch?list=PLkIselvEzpM6V9h55s0l9Kzivih9BUWeW&v=S_p5D-YXLS4 normal and binomial distributions] OpenIntro lectures (videos 1-3 in the playlist).
* Complete '''exercises from OpenIntro §4:''' 4.4, 4.6, 4.15, 4.22
 
'''Resources'''
* [https://seeing-theory.brown.edu/index.html#secondPage/chapter3 Seeing Theory §3 (Probability distributions)]
* [https://seeing-theory.brown.edu/index.html#secondPage/chapter3 Seeing Theory §3 (Probability distributions)]


'''Homework:'''
=== Day 8: Monday February 1: Descriptive analysis and visualization of data ===
* Go back and complete any questions from [[/Problem set 5]] that you were not able to get last time.
 
* Complete '''Problem set 6''': exercises from OpenIntro §4: 4.4, 4.6, 4.15, 4.22
'''Resources:'''


=== Day 7: Wednesday January 27: Descriptive analysis and visualization ===
* [[/Day 8 session plan]]


<!-- '''Class material:'''
* [[/Day 7 session plan]]
-->
'''Required tasks:'''
'''Required tasks:'''
* COM520 R Tutorial #5: Visualization using ''ggplot2'' [[https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-05.html HTML], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-05.rmd RMarkdown], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-05.pdf PDF]]
* Complete [[Statistics_and_Statistical_Programming_(Winter_2020)/pset3|problem set #3]] (due Monday, October 12 at 1pm CT)
 
'''Recommended tasks:'''
* [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w05-R_tutorial.html Week 5 R tutorial] and [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w05a-R_tutorial.html Week 5 R tutorial supplement] (both, as usual, also available as .rmd or .pdf).
 
=== Day 9: Wednesday February 3: Foundations for inference ===


'''Homework:'''
'''Resources:'''
* Complete [[/Problem set 7]]


=== Day 8: Monday February 1: Foundations for inference ===
* [[/Day 9 session plan]]


<!--'''Class material:'''
* [[/Day 8 session plan]]
-->
'''Required tasks:'''
'''Required tasks:'''
* Read Diez, Çetinkaya-Rundel, and Barr: §5 (Foundations for inference).  
* Read Diez, Çetinkaya-Rundel, and Barr: §5 (Foundations for inference).  
* Watch [https://www.youtube.com/watch?v=oLW_uzkPZGA&list=PLkIselvEzpM4SHQojH116fYAQJLaN_4Xo foundations for inference] (videos 1-3 in the playlist) OpenIntro lectures.
* Complete [https://www.openintro.org/book/stat/why05/ Why .05?] OpenIntro video/exercise.
* Complete [https://www.openintro.org/book/stat/why05/ Why .05?] OpenIntro video/exercise.
* Complete '''exercises from OpenIntro §5:''' 5.4, 5.8, 5.10, 5.17, 5.30, 5.35, 5.36


'''Recommended tasks:'''
'''Resources'''
* Read Kelly M., [https://rss.onlinelibrary.wiley.com/doi/pdf/10.1111/j.1740-9713.2013.00693.x Emily Dickinson and monkeys on the stair Or: What is the significance of the 5% significance level?] ''Significance'' 10:5. 2013.
* Kelly M., [https://rss.onlinelibrary.wiley.com/doi/pdf/10.1111/j.1740-9713.2013.00693.x Emily Dickinson and monkeys on the stair Or: What is the significance of the 5% significance level?] ''Significance'' 10:5. 2013.
* [https://seeing-theory.brown.edu/index.html#secondPage/chapter4 Seeing Theory §4 (Frequentist Inference)]
* [https://seeing-theory.brown.edu/index.html#secondPage/chapter4 Seeing Theory §4 (Frequentist Inference)]
* Watch [https://www.youtube.com/watch?v=oLW_uzkPZGA&list=PLkIselvEzpM4SHQojH116fYAQJLaN_4Xo foundations for inference] (videos 1-3 in the playlist) OpenIntro lectures.


'''Homework:'''
=== Day 10: Monday February 8: Reinforced foundations for inference ===
* Complete '''Problem set 8''': exercises from OpenIntro §5: 5.4, 5.8, 5.10, 5.17, 5.30, 5.35, 5.36
 
'''Resources:'''


=== Day 9: Wednesday February 3: Reinforced foundations for inference ===
* [[/Day 10 session plan]]


'''Required tasks:'''
'''Required:'''
* COM520 R Tutorial #6: Distributions in R and more [[https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-06.html HTML], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-06.rmd RMarkdown], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-06.pdf PDF], [https://uw.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=4d5cdd94-0e2a-4c21-b912-acc301383f9b Screencast]]
* Complete [[Statistics_and_Statistical_Programming_(Winter_2020)/pset4|problem set #4]]
* Read Reinhart, §1. {{avail-uw|1=https://alliance-primo.hosted.exlibrisgroup.com/primo-explore/fulldisplay?docid=CP71226818410001451&context=L&vid=UW&lang=en_US&search_scope=all&adaptor=Local%20Search%20Engine&tab=default_tab&query=any,contains,statistics%20done%20wrong}}
* Read Reinhart, §1.
* Read the following paper (it will be familiar to those of you in COM501): Kramer, Adam D. I., Jamie E. Guillory, and Jeffrey T. Hancock. 2014. “Experimental Evidence of Massive-Scale Emotional Contagion through Social Networks.” ''Proceedings of the National Academy of Sciences'' 111 (24): 8788–90. https://doi.org/10.1073/pnas.1320040111. {{avail-uw|https://doi.org/10.1073/pnas.1320040111}}
* Revisit the Kramer et al. (2014) paper we read a few weeks ago:
:Kramer, Adam D. I., Jamie E. Guillory, and Jeffrey T. Hancock. 2014. “Experimental Evidence of Massive-Scale Emotional Contagion through Social Networks.” ''Proceedings of the National Academy of Sciences'' 111(24):8788–90. [[http://www.pnas.org/content/111/24/8788.full Open access]] 


'''Recommended tasks:'''
=== Day 11: February 10: Inference for categorical data ===
* Check out [https://gallery.shinyapps.io/CLT_mean/ OpenIntro Central limit theorem for means demo].


'''Homework:'''
'''Resources:'''
* Complete [[/Problem set 9]]


=== Day 10: Monday February 8: Inference for categorical data ===
* [[/Day 11 session plan]]


'''Required tasks:'''
'''Required tasks:'''
* Read Diez, Çetinkaya-Rundel, and Barr: §6 (Inference for categorical data).  
* Read Diez, Çetinkaya-Rundel, and Barr: §6 (Inference for categorical data).  
* Watch [https://www.youtube.com/watch?list=PLkIselvEzpM5Gn-sHTw1NF0e8IvMxwHDW&v=_iFAZgpWsx0 inference for categorical data] (videos 1-3 in the playlist) OpenIntro lectures.
* Complete '''exercises from OpenIntro §6:''' 6.10, 6.16, 6.22, 6.30, 6.40 (just parts a and b; part c gets tedious)


'''Recommended tasks:'''
'''Resources'''
* Watch [https://www.youtube.com/watch?list=PLkIselvEzpM5Gn-sHTw1NF0e8IvMxwHDW&v=_iFAZgpWsx0 inference for categorical data] (videos 1-3 in the playlist) OpenIntro lectures.
* [https://gallery.shinyapps.io/CLT_prop/ OpenIntro Central limit theorem for proportions demo].
* [https://gallery.shinyapps.io/CLT_prop/ OpenIntro Central limit theorem for proportions demo].


'''Homework:'''
=== NO CLASS: Monday February 15: Presidents' Day ===
=== Day 12: Wednesday February 17: Applied inference for categorical data ===


* Complete '''Problem set 10''': exercises from OpenIntro §6: 6.10, 6.16, 6.22, 6.30, 6.40 (just parts a and b; part c gets tedious)
'''Resources:'''


=== Day 11: Wednesday February 10: Applied inference for categorical data ===
* [[/Day 12 session plan]]


'''Required tasks:'''
'''Required tasks:'''
* COM520 R Tutorial #7: Categorical data [[https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-07.html HTML], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-07.rmd RMarkdown], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-07.pdf PDF]]
* Read Reinhart, §4 and §5 (both are quite short).
* Read Reinhart, §4 and §5 (both are quite short).
* Skim the following (all are referenced in the problem set)
* Skim the following (all are referenced in the problem set)
**  Aronow PM, Karlan D, Pinson LE. (2018). The effect of images of Michelle Obama’s face on trick-or-treaters’ dietary choices: A randomized control trial. ''PLoS ONE'' 13(1): e0189693. https://doi.org/10.1371/journal.pone.0189693. {{avail-free|https://doi.org/10.1371/journal.pone.0189693}}
**  Aronow PM, Karlan D, Pinson LE. (2018). The effect of images of Michelle Obama’s face on trick-or-treaters’ dietary choices: A randomized control trial. PLoS ONE 13(1): e0189693. [https://doi.org/10.1371/journal.pone.0189693 https://doi.org/10.1371/journal.pone.0189693]
** Buechley, Leah and Benjamin Mako Hill. 2010. “LilyPad in the Wild: How Hardware’s Long Tail Is Supporting New Engineering and Design Communities.” Pp. 199–207 in ''Proceedings of the 8th ACM Conference on Designing Interactive Systems.'' Aarhus, Denmark: ACM. {{avail-free|https://mako.cc/academic/buechley_hill_DIS_10.pdf}}
** Buechley, Leah and Benjamin Mako Hill. 2010. “LilyPad in the Wild: How Hardware’s Long Tail Is Supporting New Engineering and Design Communities.” Pp. 199–207 in ''Proceedings of the 8th ACM Conference on Designing Interactive Systems.'' Aarhus, Denmark: ACM. [[https://mako.cc/academic/buechley_hill_DIS_10.pdf PDF available on Hill's personal website]]
** Shaw, Aaron and Yochai Benkler. 2012. A tale of two blogospheres: Discursive practices on the left and right. ''American Behavioral Scientist''. 56(4): 459-487. [[https://doi.org/10.1177%2F0002764211433793 available via NU libraries]]
* Complete [[Statistics_and_Statistical_Programming_(Winter_2020)/pset5|problem set #5]]
'''Resources'''
* [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w06-R_tutorial.html Week 06 R tutorial] (it's very short!)


'''Homework:'''
=== Day 13: Monday February 22: Inference for numerical data (part 1) ===


* Complete [[/Problem set 11]]
'''Resources:'''


=== NO CLASS: Monday February 15: Presidents' Day ===
* [[/Day 13 session plan]]
=== Day 12: Wednesday February 17: Inference for numerical data (t-tests and ANOVA) ===
<!--'''Class material:'''
This is a combo of two days, basically...
* [[/Day 12 session plan]] -->


'''Required tasks:'''
'''Required tasks:'''
* Read Diez, Çetinkaya-Rundel, and Barr: §7.1-5 (Inference for numerical data: differences of means; power calculations, ANOVA, and multiple comparisons).
* Read Diez, Çetinkaya-Rundel, and Barr: §7.1-3 (Inference for numerical data: differences of means).
* Watch [https://www.youtube.com/watch?list=PLkIselvEzpM5G3IO1tzQ-DUThsJKQzQCD&v=uVEj2uBJfq0 inference for numerical data] (videos 1-4 in the playlist) OpenIntro lectures (and featuring one of the textbook authors!).
* Complete '''exercises from OpenIntro §7:''' 7.12, 7.24, 7.26


'''Recommended tasks:'''
'''Resources'''
* [https://www.openintro.org/go/?id=stat_better_understand_anova&referrer=/book/os/index.php OpenIntro supplement on ANOVA calculations] (particularly useful if you think you'll be doing more ANOVAs).
* [https://gallery.shinyapps.io/CLT_mean/ OpenIntro Central limit theorem for means demo].
* Watch [https://www.youtube.com/watch?list=PLkIselvEzpM5G3IO1tzQ-DUThsJKQzQCD&v=uVEj2uBJfq0 inference for numerical data] (videos 1-8 in the playlist) OpenIntro lectures (and featuring one of the textbook authors!).
* COM520 R Tutorial #8: t-tests and ANOVA [[https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-08.html HTML], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-08.rmd RMarkdown], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-08.pdf PDF], [https://uw.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=70b906b1-88fc-475c-9541-acd100247d13 Screencast]]


'''Homework:'''
=== Day 14: Wednesday February 24: Inference for numerical data (part 2) ====
* Complete [[/Problem set 12]]


=== Day 13: Monday February 22: Linear regression ===
'''Resources:'''
<!-- '''Class material:'''
 
* [[/Day 13 session plan]] -->
* [[/Day 14 session plan]]


'''Required tasks:'''
'''Required tasks:'''
* Read Diez, Çetinkaya-Rundel, and Barr: §8 (Linear regression).
* Read Diez, Çetinkaya-Rundel, and Barr: §7.4-5 (Inference for numerical data: power calculations, ANOVA, and multiple comparisons).
* Read [https://www.openintro.org/go/?id=stat_more_inference_for_linear_regression&referrer=/book/os/index.php More inference for linear regression] (OpenIntro supplement).
* Watch [https://www.youtube.com/watch?list=PLkIselvEzpM5G3IO1tzQ-DUThsJKQzQCD&v=uVEj2uBJfq0 inference for numerical data] (videos 4-8 in the playlist) OpenIntro lectures (and featuring one of the textbook authors!).
* Complete '''exercises from OpenIntro §7:''' 7.42, 7.44, 7.46
 
'''Resources'''
* [https://www.openintro.org/go/?id=stat_better_understand_anova&referrer=/book/os/index.php OpenIntro supplement on ANOVA calculations] (useful if you think you'll be doing more ANOVAs).
 
=== Day 15: Monday March 1: Applied inference for numerical data (t-tests, power analysis, ANOVA) ===


'''Recommended tasks:'''
'''Resources:'''
* Watch [https://www.youtube.com/playlist?list=PLkIselvEzpM63ikRfN41DNIhSgzboELOM linear regression] (videos 1-4 in the playlist) OpenIntro lectures.
* Read [https://seeing-theory.brown.edu/index.html#secondPage/chapter6 Seeing Theory §6 (Regression analysis)]


'''Homework:'''
* [[/Day 15 session plan]]
* Complete [[/Problem set 13]]


=== Day 14: Wednesday February 24: Applied linear regression ===
;[[Statistics_and_Statistical_Programming_(Winter_2020)/w09_session_plan|Session plans]]
<!--
'''Class material:'''
* [[/Day 14 session plan]]
-->


'''Required tasks:'''
'''Required tasks:'''
* Complete [[Statistics_and_Statistical_Programming_(Winter_2020)/pset6|problem set #6]]


* COM520 R Tutorial #9: Linear regression [[https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-09.html HTML], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-09.rmd RMarkdown], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-09.pdf PDF], [https://uw.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=3fd1ff6d-ba80-46fd-b7e1-acd100247d42 Screencast]]
'''Resources'''
* [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w09-R_tutorial.html Week 09 R tutorial]


'''Homework:'''
=== Day 16: Wednesday March 3:  inear regression ===
* Complete [[/Problem set 14]]


=== Day 15: Monday March 1: Multiple and logistic regression ===
'''Resources:'''
<!-- '''Class material:'''


* [[/Day 15 session plan]]-->
* [[/Day 16 session plan]]


'''Required tasks:'''
'''Required tasks:'''
* Read Diez, Çetinkaya-Rundel, and Barr: §9 (Multiple and logistic regression). (Skim §9.2-9.4)
* Read Diez, Çetinkaya-Rundel, and Barr: §8 (Linear regression).
** '''Disclaimer:''' Mako doesn't like §9.2-9.3, but it should be useful to understand and discuss them, so we'll do that.  
* Watch [https://www.youtube.com/playlist?list=PLkIselvEzpM63ikRfN41DNIhSgzboELOM linear regression] (videos 1-4 in the playlist) OpenIntro lectures.
* Read [https://www.openintro.org/go/?id=stat_interaction_terms&referrer=/book/os/index.php Interaction terms] (OpenIntro supplement).
* Read [https://www.openintro.org/go/?id=stat_more_inference_for_linear_regression&referrer=/book/os/index.php More inference for linear regression] (OpenIntro supplement).
* Read [https://www.openintro.org/go/?id=stat_nonlinear_relationships&referrer=/book/os/index.php Fitting models for non-linear trends] (OpenIntro supplement).
* Complete '''exercises from OpenIntro §8:''' 8.6, 8.36, 8.40, 8.44
* Complete '''exercises from OpenIntro supplement:''' 4 and 5 (answers provided in the supplement).
'''Resources'''
* [https://seeing-theory.brown.edu/index.html#secondPage/chapter6 Seeing Theory §6 (Regression analysis)]
 
=== Day 17: Monday March 8: Applied linear regression ===


'''Recommended tasks:'''
'''Resources:'''
* Watch [https://www.youtube.com/playlist?list=PLkIselvEzpM5f1HYzIjFt52SD4izsJ2_I multiple and logistic regression] (videos 1-4 in the playlist) OpenIntro lectures.


'''Homework:'''
* [[/Day 17 session plan]]
* Complete '''Problem set 16''': exercises from OpenIntro §9: 9.4, 9.13, 9.16, 9.18,


'''Required tasks:'''
'''Required tasks:'''
* Complete [[Statistics_and_Statistical_Programming_(Winter_2020)/pset7|Problem set #7]]


=== Day 16: Wednesday March 3: Applied multiple and logistic regression ===
'''Resources'''
<!-- '''Class material:'''
* [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w10-R_tutorial.html Week 10 R tutorial]
* [[/Day 16 session plan]] -->


'''Required tasks:'''
=== Day 18: Wednesday March 10: Multiple and logistic regression ====
* COM520 R Tutorial #10: Tutorial on interpreting logistic regression in R [[https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-10.html HTML], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-10.rmd RMarkdown], [https://www.dropbox.com/preview/COM520-shared_files-UW-2021-Q1/r_tutorials/com520-r_tutorial-10.pdf PDF]]


'''Homework:'''
'''Resources:'''
* Complete [[/Problem set 16]]


=== Day 17: Monday March 8:  Consulting Day ===
* [[/Day 18 session plan]]


We'll forgo meeting as a group. Instead, I will meet one-on-one with each of you to work through challenges you're having with your own projects.
'''Required tasks:'''
* Read Diez, Çetinkaya-Rundel, and Barr: §9 (Multiple and logistic regression). (Skim §9.2-9.4)
** '''Disclaimer:''' Aaron doesn't like §9.2-9.3, but it should be useful to understand and discuss them, so we'll do that.  
* Watch [https://www.youtube.com/playlist?list=PLkIselvEzpM5f1HYzIjFt52SD4izsJ2_I multiple and logistic regression] (videos 1-4 in the playlist) OpenIntro lectures.
* Read [https://www.openintro.org/go/?id=stat_interaction_terms&referrer=/book/os/index.php Interaction terms] (OpenIntro supplement).
* Read [https://www.openintro.org/go/?id=stat_nonlinear_relationships&referrer=/book/os/index.php Fitting models for non-linear trends] (OpenIntro supplement).
* Complete '''exercises from OpenIntro §9:''' 9.4, 9.13, 9.16, 9.18,


* COM520 R Tutorial #11: Bonus material {{forthcoming}} <!-- logistic_regression_interpretation.html -->
'''Resources'''


=== Day 18: Wednesday March 10: Final Presentations ===
==== November 24: Applied multiple and logistic regression ====
<!--
;[[Statistics_and_Statistical_Programming_(Winter_2020)/w11_session_plan|Session plans]]
'''Class material:'''
'''Required tasks:'''
* [[/Day 18 session plan]]
* Complete [[Statistics_and_Statistical_Programming_(Winter_2020)/pset8|Problem set #8]]
-->
'''Resources'''
* Mako Hill created (and Aaron updated) a brief tutorial on [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/logistic_regression_interpretation.html interpreting logistic regression coefficients with examples in R]


<strike>'''Post your video via this "Discussion" on Canvas]''' {{forthcoming}} — Please view and provide constructive feedback on other's videos!
==== December 3: [[#Research project presentation|Research project presentation]] due by 5pm CT ====
'''[https://canvas.uw.edu/courses/1434003/discussion_topics/856868 Post your video via this "Discussion" on Canvas]'''. Please view and provide constructive feedback on other's videos!


* '''Post videos directly to the "Discussion."''' The Canvas text editor has an option to upload/record a video. That's what you want.
* '''Post videos directly to the "Discussion."''' The Canvas text editor has an option to upload/record a video. That's what you want.
* '''Please remember not to over-work/think this.''' I mentioned this in class, but just to reiterate, the focus of this assignment should not be your video editing skills. Please do what you can to record and convey your ideas clearly without devoting insane hours to creating the perfect video.  
* '''Please remember not to over-work/think this.''' I mentioned this in class, but just to reiterate, the focus of this assignment should not be your video editing skills. Please do what you can to record and convey your ideas clearly without devoting insane hours to creating the perfect video.  
* '''Some resources for recording presentations:''' There are a bunch of ways you might record/share your video. Some ideas include using the embedded media recorder in Canvas (!) that can record with with your webcam (maybe attach a few visuals to accompany this?); recording a "meeting" with yourself in Zoom; and "Panopto," a piece of high-end video recording, sharing, and editing software that UW licenses for campus use. Here are some pointers:
* '''Some resources for recording presentations:''' There are a bunch of ways you might record/share your video. Some ideas include using the embedded media recorder in Canvas (!) that can record with with your webcam (maybe attach a few visuals to accompany this?); recording a "meeting" with yourself in Zoom; and "Panopto," a piece of high-end video recording, sharing, and editing software that NU licenses for campus use. Here are some pointers:
** You should be able to use your UW zoom account to create a zoom meeting, record your meeting (in which you deliver your presentation and share your screen with any visuals), and then share a link to the recording via the "Recordings" item in the left-hand menu of your Zoom account page.
** NU has a "digital learning resource hub" which provides some [https://digitallearning.northwestern.edu/resource-hub#for-students resources for students]. The first item in that list has pointers for recording yourself and posting to Canvas and includes info about the Canvas media recorder and Panopto.
** You should be able to use your NU zoom account to create a zoom meeting, record your meeting (in which you deliver your presentation and share your screen with any visuals), and then share a link to the recording via the "Recordings" item in the left-hand menu of your [https://northwestern.zoom.us/ https://northwestern.zoom.us/] account page.
** If nothing works, please get in touch.
** If nothing works, please get in touch.
</strike>


Since the class is small, we'll meet and give presentations during class. Everyone should plan to give a ~15 minutes conference style presentation.


== Special Notes ==
=== Special Notes ===
== Teaching and learning in a pandemic ==
=== Teaching and learning in a pandemic ===


The COVID-19 pandemic will impact this course in various ways, some of them obvious and tangible and others harder to pin down. On the obvious and tangible front, we have things like a mix of remote, synchronous, and asynchronous instruction and the fact that many of us will not be anywhere near campus or each other this year. These will reshape our collective "classroom" experience in major ways.  
The COVID-19 pandemic will impact this course in various ways, some of them obvious and tangible and others harder to pin down. On the obvious and tangible front, we have things like a mix of remote, synchronous, and asynchronous instruction and the fact that many of us will not be anywhere near campus or each other this year. These will reshape our collective "classroom" experience in major ways.  
Line 546: Line 548:
:<div style="font-size: 80%; font-style: italic">This text is borrowed and adapted from [[Statistics and Statistical Programming (Fall 2020)|Aaron Shaw's statistics course]].</div>
:<div style="font-size: 80%; font-style: italic">This text is borrowed and adapted from [[Statistics and Statistical Programming (Fall 2020)|Aaron Shaw's statistics course]].</div>


== Expectations for synchronous remote sessions ==
=== Expectations for synchronous remote sessions ===


The following are some baseline expectations for our synchronous remote class sessions. I expect that these can and will evolve. Please feel free to ask questions, suggest changes, or raise concerns during the quarter. I welcome all input:
The following are some baseline expectations for our synchronous remote class sessions. I expect that these can and will evolve. Please feel free to ask questions, suggest changes, or raise concerns during the quarter. I welcome all input:
Line 560: Line 562:
* Children, family, pets, roommates, and others with whom you may share your workspace are welcome to join our class as needed.
* Children, family, pets, roommates, and others with whom you may share your workspace are welcome to join our class as needed.


== Statistics and power ==
=== Statistics and power ===


The subject matter of this course—statistics and statistical programming—has historical and present-day affinities with a variety of oppressive ideologies and projects, including white supremacy, discrimination on the basis of gender and sexuality, state violence, genocide, and colonialism. It has also been used to challenge and undermine these projects in various ways. I will work throughout the quarter to acknowledge and represent these legacies accurately, at the same time as I also strive to advance equity, inclusion, and justice through my teaching practice, the selection of curricular materials, and the cultivation of an inclusive classroom environment.
The subject matter of this course—statistics and statistical programming—has historical and present-day affinities with a variety of oppressive ideologies and projects, including white supremacy, discrimination on the basis of gender and sexuality, state violence, genocide, and colonialism. It has also been used to challenge and undermine these projects in various ways. I will work throughout the quarter to acknowledge and represent these legacies accurately, at the same time as I also strive to advance equity, inclusion, and justice through my teaching practice, the selection of curricular materials, and the cultivation of an inclusive classroom environment.
Please note that all contributions to CommunityData are considered to be released under the Attribution-Share Alike 3.0 Unported (see CommunityData:Copyrights for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource. Do not submit copyrighted work without permission!

To protect the wiki against automated edit spam, we kindly ask you to solve the following CAPTCHA:

Cancel Editing help (opens in new window)