Editing Statistics and Statistical Programming (Fall 2020)

From CommunityData
Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.

Latest revision Your text
Line 18: Line 18:
:Also usually available via chat during "business hours."
:Also usually available via chat during "business hours."


;'''Teaching Assistant:''' [http://nickmvincent.com Nick Vincent] ([mailto:nickvincent@u.northwestern.edu nickvincent@u.northwestern.edu])
:'''Teaching Assistant:''' [http://nickmvincent.com Nick Vincent] ([mailto:nickvincent@u.northwestern.edu nickvincent@u.northwestern.edu])
:Office Hours: Monday 10am-12pm and by appointment. I'll try to respond to any asynchronous questions in a timely fashion during "business hours" (9a-5p Central Time), and will also have OH by appointment. I'll respond best to email (above), but am also happy to use Discord for quicker back-and-forth.
::Office Hours: I'll try to respond to any asynchronous questions in a timely fashion during "business hours" (9a-5p Central Time), and will also have OH by appointment. I'll also try to schedule some fixed time during which I'll hang out on a video call, hours TBA.
:I am happy to try out alternative communication software for OH!
::I'll likely use whatever conference we use for class sessions, but am happy to try out alternative communication software for OH!


<br>
<br>
Line 58: Line 58:
--->
--->


This course will proceed in a '''remote''' format that includes ''asynchronous'' and ''synchronous'' elements (more on those below). In general, the organization of the course adopts a "flipped" approach where participants consume, discuss, and process instructional materials outside of "class" and we use synchronous meetings to answer questions, address challenges or concerns, work through solutions, and hold semi-structured discussions.  
This course will proceed in a '''remote''' format that includes ''asynchronous'' and ''synchronous'' elements (more on those below). In general, the organization of the course adopts a "flipped" approach where you consume instructional materials on your own and in working groups and we use synchronous meetings to answer questions, address challenges or concerns, work through solutions, and hold semi-structured discussions.  


The course introduces ''both'' basic statistical concepts as well as applications of those concepts through statistical programming. As a result, we will usually dedicate part of each week to a particular set of concepts and part of each week to applied data analysis and/or interpretation. A brief description of how I expect it all to work follows below. We'll talk about it more during the first class session.
The course introduces ''both'' basic statistical concepts as well as basic applications of those concepts through statistical programming. As a result, we will dedicate part of each week to a particular set of concepts and part of each week to applied data analysis and/or interpretation. A brief description of how I expect it all to work follows below. We'll talk about it more during the first class session.


====Asynchronous elements of the course====
====Asynchronous elements of the course====


These include all readings, recorded lectures/slides, tutorials, textbook exercises, problem sets, and other assignments. I expect you to complete (or at least attempt to complete!) these outside of our class meeting times. I also strongly encourage you to identify, submit, and discuss questions about the material '''before each class meeting''' whenever possible.
These include all readings, recorded lectures/slides, tutorials, problem sets, and other assignments. I expect you to complete (or at least attempt to complete!) these outside of our class meeting times. For nearly all of the "instructional" material introducing particular statistical concepts and techniques, you are expected to use the OpenIntro textbook and lecture materials created by the textbook authors. Please note that this means I will not deliver lectures during our class meetings. Please also note that this means you are responsible for coordinating your problem set groups and any collaborative work with other members of the class outside of our class meeting times.
 
We will use Discord for everyday discussions and chat related to the course. In general, the teaching team will try to keep an eye on the various server channels during "business hours." To the extent that we can respond to questions and concerns there, we'll do so. We'll also use the discussion channels to identify topics that might benefit from synchronous conversation during the course meetings. Hopefully, writing and talking about questions and concerns outside of the synchronous course meetings will help support accountability, learning, and more effective use of our meeting time.
 
For nearly all of the "instructional" material introducing particular statistical concepts and techniques, you are assigned materials from the OpenIntro textbook and lecture materials created by the textbook authors. Please note that this means I will not deliver lectures during our class meetings. Please also note that this means you are responsible for coordinating your working groups and any collaborative work with other members of the class outside of our class meeting times.


====Synchronous elements of the course====
====Synchronous elements of the course====
Line 74: Line 70:
The synchronous elements of the course will be the two weekly class meetings that will happen via video conference (Zoom). These are scheduled to run for a maximum of 110 minutes. Each session will include multiple short breaks.  
The synchronous elements of the course will be the two weekly class meetings that will happen via video conference (Zoom). These are scheduled to run for a maximum of 110 minutes. Each session will include multiple short breaks.  


We will use the class meetings to discuss and work through any questions or challenges you encounter in the materials assigned for that day. This means that I encourage you to identify, submit, and discuss questions about the material '''before each class meeting''' whenever possible. Doing so will give the teaching team time to sift, sort, and organize the questions into a hopefully-cohesive plan for each class session that is tailored to the specific concerns you encounter in the material. Obviously, we anticipate that questions will arise during the class sessions too as well and we'll do our best to adapt as we go.
We will use the class meetings to discuss and work through any questions or challenges you encounter in the materials assigned for that day. This means that I will encourage you to identify and submit questions about the material '''before each class meeting''' whenever possible. Doing so will give the teaching team time to sift, sort, and organize the questions into a hopefully-cohesive plan for each class session that is tailored to the specific concerns you encounter in the material. Obviously, we anticipate that questions will arise during the class sessions as well and we'll do our best to adapt as we go.
 
A couple of other notes about the synchronous course meetings:
* Aaron plans to record the course meetings and have them available to class participants only via Zoom/Canvas. Please get in touch if you have concerns or requests about this.
* The teaching team will do our best to notice and respond to any questions or comments that come up via Discord or Zoom during the class. Please do what you can to support these efforts.
* You might want to create/acquire something like [https://www.mccormick.northwestern.edu/news/articles/2020/08/back-to-school-hack-shares-students-handwritten-work-and-teacher-response-in-real-time.html NU Mechanical Engineering Professor Michael Peshkin's homebrew document camera] to facilitate sharing hand-written notes/drawings during class.


In addition, because randomness is extremely important in statistics, I may occasionally '''randomly assign''' different working groups to share and discuss their solutions to selected textbook exercises or problem set questions during class. These random assignments will be announced ahead of time so that the group has an opportunity to prepare. The idea here is to structure some participation in the synchronous sessions to ensure an equitable distribution of the responsibility to discuss questions, answers, points of confusion, and alternatives.
In addition, because randomness is extremely important in statistics, I may occasionally '''randomly assign''' different working groups to share and discuss their solutions to selected textbook exercises or problem set questions during class. These random assignments will be announced ahead of time so that the group has an opportunity to prepare. The idea here is to structure some participation in the synchronous sessions to ensure an equitable distribution of the responsibility to discuss questions, answers, points of confusion, and alternatives.
Line 85: Line 76:
==== Working groups ====  
==== Working groups ====  


At the start of the course you will be assigned to a small working group. This will be a group of 2-3 students (exact numbers will depend on the final enrollment) with whom you may meet outside of class time to discuss, complete, and/or review your weekly assignments (as well as some of the research project assignments). The groups will rotate at least once during the quarter to ensure that you get to work with different members of the class. The main idea is to support collaborative learning, peer support, and accountability. While the specifics of exactly when and how you work with your working group will largely be up to you, the teaching team will provide [[Statistics_and_Statistical_Programming_(Fall_2020)/Working_groups_template|suggestions in the form of a template]] that you can use as a starting point.
At the start of the course you will be assigned to a small working group. This will be a group of 2-3 students (exact numbers will depend on the final enrollment) with whom you will meet outside of class time to discuss, complete, and/or review your weekly assignments (as well as some of the research project assignments). The groups will rotate at least once during the quarter to ensure that you get to work with different members of the class. The main idea is to support collaborative learning, peer support, and accountability. While the specifics of exactly when and how you work with your working group will largely be up to you, the teaching team will provide suggestions and a template that you can use as a starting point.


As a general rule, we strongly encourage you to collaborate with members of your working group on any/all weekly (minor) assignments. You may, if you choose, also collaborate with others in your group or the class on your research project (major) assignments; however, collaborative research projects should be discussed with a member of the teaching team and all research project assignment submissions should include the names of all collaborators.  
As a general rule, we strongly encourage you to collaborate with members of your working group on any/all weekly (minor) assignments. You may, if you choose, also collaborate with others in your group or the class on your research project (major) assignments; however, collaborative research projects should be discussed with a member of the teaching team and all research project assignment submissions should include the names of all collaborators.  
Line 117: Line 108:
* Verzani, John. 2014. ''Using R for Introductory Statistics, Second Edition''. 2 edition. Boca Raton: Chapman and Hall/CRC. ([https://en.wikipedia.org/wiki/Special:BookSources/978-1-4665-9073-1 Various Sources]; [https://www.amazon.com/Using-Introductory-Statistics-Second-Chapman/dp/1466590734/ref=mt_hardcover?_encoding=UTF8&me= Amazon])
* Verzani, John. 2014. ''Using R for Introductory Statistics, Second Edition''. 2 edition. Boca Raton: Chapman and Hall/CRC. ([https://en.wikipedia.org/wiki/Special:BookSources/978-1-4665-9073-1 Various Sources]; [https://www.amazon.com/Using-Introductory-Statistics-Second-Chapman/dp/1466590734/ref=mt_hardcover?_encoding=UTF8&me= Amazon])
* Wickham, Hadley. 2010. ''ggplot2: Elegant Graphics for Data Analysis''. 1st ed. 2009. Corr. 3rd printing 2010 edition. New York: Springer. ([https://link.springer.com/book/10.1007%2F978-3-319-24277-4 Springer/NU Libraries]; [https://en.wikipedia.org/wiki/Special:BookSources/978-0-596-80915-7 Various Sources])
* Wickham, Hadley. 2010. ''ggplot2: Elegant Graphics for Data Analysis''. 1st ed. 2009. Corr. 3rd printing 2010 edition. New York: Springer. ([https://link.springer.com/book/10.1007%2F978-3-319-24277-4 Springer/NU Libraries]; [https://en.wikipedia.org/wiki/Special:BookSources/978-0-596-80915-7 Various Sources])
* Wickham, Hadly and Grolemund, Garret. 2017. ''R for Data Science''. Sebastopol, CA: O'Reilly. ([https://r4ds.had.co.nz/ Online version]).


There are also some invaluable non-textbook resources:
There are also some invaluable non-textbook resources:
Line 141: Line 131:


==== Problem sets ====
==== Problem sets ====
The course will include problem sets and these may incorporate several kinds of questions:
These may incorporate several kinds of questions:


* '''Statistics questions''' about statistical concepts and principles.
* '''Statistics questions''' about statistical concepts and principles.
Line 147: Line 137:
* '''Empirical paper questions''' about other assigned readings.  
* '''Empirical paper questions''' about other assigned readings.  


For the problem sets, I ask that you submit your work [https://canvas.northwestern.edu/courses/122522/assignments via Canvas 24 hours before class] (i.e., Monday afternoon for our Tuesday class sessions). Details of exactly how this will work will be elaborated during the first class. For the programming challenges, you should submit code and text for your solutions (again, more on how later). If you get completely stuck on a problem, that's okay, but please provide whatever you have.
For the problem sets, I ask that you submit your work via Canvas 24 hours before class (i.e., Monday afternoon for our Tuesday class sessions). Details of exactly how this will work will be provided in the course schedule and we'll go over them during the first class. For the programming challenges, you should submit code and documentation for your solutions (again, more on how later). If you get completely stuck on a problem, that's okay, but please provide whatever you have.


Problem sets will be evaluated on a complete/incomplete basis. Although the problem sets will not be assigned a letter grade, they are a central focus of the course and completing them will support your mastery of the material in multiple ways. Working through them on schedule will also make it possible for you to participate in the synchronous course meetings and online discussions of course material effectively. Your ability to do so will figure prominently in your participation grade for the course (see the section on grading and assessment below).
Attendance in the synchronous portion of the class will be important to supporting your mastery of the material. Although the problem sets will not be assigned a letter grade, it is critical that you be present and able to discuss your answers to each of the questions. Your ability to do so will figure prominently in your participation grade for the course (see the section on grading and assessment below).


=== Research project (major) assignments ===
=== Research project (major) assignments ===
Line 165: Line 155:
''I strongly urge you'' to produce a project that will further your academic career outside of the class. There are many ways that this can happen. Some obvious options are to prepare a project that you can submit for publication, use as pilot analysis that you can report in a grant or thesis proposal, and/or use to fulfill a degree requirement.
''I strongly urge you'' to produce a project that will further your academic career outside of the class. There are many ways that this can happen. Some obvious options are to prepare a project that you can submit for publication, use as pilot analysis that you can report in a grant or thesis proposal, and/or use to fulfill a degree requirement.


There are several intermediate milestones, deliverables, and deadlines to help you accomplish a successful research project. Unless otherwise noted, all deliverables should be submitted via Canvas by 5pm CT on the day they are due.
There are several intermediate milestones and deadlines to help you accomplish a successful research project. Unless otherwise noted, all deliverables should be submitted via Canvas.




==== Research project plan and dataset identification ====
==== Project plan and dataset identification ====


;Due date: October 9, 2020, 5pm CT
;Due date: TBA
;Maximum length: 500 words (~1-2 pages)
;Maximum length: 500 words (~1-2 pages)


Line 180: Line 170:




===== Notes on finding a dataset =====
''' Notes on finding a dataset '''


In order to complete your final project, you will each need a dataset. If you already have a dataset for the project you plan to conduct, great! If not, fear not! There are many datasets to draw from. Some ideas are below (please suggest others, provide updated links, or report problems). The teaching team will also be available to help you brainstorm/find resources if needed:
In order to complete your final project, you will each need a dataset. If you already have a dataset for the project you plan to conduct, great! If not, fear not! There are many datasets to draw from. Some ideas are below (please suggest others, provide updated links, or report problems). The teaching team will also be available to help you brainstorm/find resources if needed:
Line 191: Line 181:
* Use the [http://scientificdata.isa-explorer.org/index.html ISA Explorer] to find datasets. Keep in mind the large majority of datasets it will search are drawn from the natural sciences.
* Use the [http://scientificdata.isa-explorer.org/index.html ISA Explorer] to find datasets. Keep in mind the large majority of datasets it will search are drawn from the natural sciences.
* The City of Chicago has one of the best [https://data.cityofchicago.org/ data portal sites] of any municipality in the U.S. (and better than many federal agencies). There are also numerous administrative datasets released by other public entities (try searching!) that you might find inspiring.
* The City of Chicago has one of the best [https://data.cityofchicago.org/ data portal sites] of any municipality in the U.S. (and better than many federal agencies). There are also numerous administrative datasets released by other public entities (try searching!) that you might find inspiring.
<!---
* <TODO fix/update accordingly> Set up a meeting with Jennifer Muilenburg — Data Curriculum and Communications Librarian who runs [https://www.lib.washington.edu/digitalscholarship/services/data research data services at the UW libraries]. Her email is: libdata@uw.edu I've have talked to her about this course and she is excited about meeting with you to help.
-->
* [http://fivethirtyeight.com FiveThirtyEight.com] has published a [https://cran.r-project.org/web/packages/fivethirtyeight/vignettes/fivethirtyeight.html GitHub repository and an R package] with pre-processed and cleaned versions of many of the datasets they use for articles published on their website.
* [http://fivethirtyeight.com FiveThirtyEight.com] has published a [https://cran.r-project.org/web/packages/fivethirtyeight/vignettes/fivethirtyeight.html GitHub repository and an R package] with pre-processed and cleaned versions of many of the datasets they use for articles published on their website.
* If you interested in studying online communities, there are some great resources for accessing data from Reddit, Wikipedia, and StackExchange. See [https://files.pushshift.io/reddit/ pushshift] for dumps of Reddit data, [https://meta.wikimedia.org/wiki/Research:Data here] for an overview of Wikipedia's data resources, and [https://data.stackexchange.com/ Stack Exchange's data portal].
* If you interested in studying online communities, there are some great resources for accessing data from Reddit, Wikipedia, and StackExchange. See [https://files.pushshift.io/reddit/ pushshift] for dumps of Reddit data, [https://meta.wikimedia.org/wiki/Research:Data here] for an overview of Wikipedia's data resources, and [https://data.stackexchange.com/ Stack Exchange's data portal].
* The NY Times is publishing a [https://github.com/nytimes/covid-19-data COVID-19 data repository] that includes county-level metrics for deaths, mask usage, and other pandemic-related data. The release a lot of it as frequently updated .csv files and the repository includes documentation of the measurements, data collection details, and more.
* The Community Data Science Collective and colleagues have created a [[COVID-19_Digital_Observatory| COVID-19 digital observatory]] (hosted in part right here on this wiki!) that publishes a bunch of pandemic-related data as csv and json files.
* The [https://openpolicing.stanford.edu Stanford Open Policing project] has published a huge archive of policing data related mostly to traffic stops in states and many cities of the U.S. We'll use at least one of these files for a problem set.


==== Research project planning document ====
==== Project planning document ====


;Due date: October 30, 2020, 5pm CT
;Due date: TBA
;Suggested length: ~5 pages
;Maximum length: ~5 pages


The project planning document is a shell/outline of an empirical quantitative research paper. Your planning document should should have the following sections: (a) Rationale, (b) Objectives; (b.1) General objectives; (b.2) Specific objectives; (c) (Null) hypotheses; (d) Conceptual diagram and explanation of the relationship(s) you plan to test; (e) Measures; (f) Dummy tables/figures; (g) anticipated finding(s) and research contribution(s). Longer descriptions of each of these planning document sections (as well as a few others) can be found [[CommunityData:Planning document|on this wiki page]].
The project planning document is a basic shell/outline of an empirical quantitative research paper. Your planning document should should have the following sections: (a) Rationale, (b) Objectives; (b.1) General objectives; (b.2) Specific objectives; (c) (Null) hypotheses; (d) Conceptual diagram and explanation of the relationship(s) you plan to test; (e) Measures; (f) Dummy tables/figures; (g) anticipated finding(s) and research contribution(s). Longer descriptions of each of these planning document sections (as well as a few others) can be found [[CommunityData:Planning document|on this wiki page]].


I will also provide three example planning documents via our Canvas site (links to-be-updated for 2020 edition of the course):
I will also provide three example planning documents via our Canvas site (links to-be-updated for 2020 edition of the course):
* [https://canvas.northwestern.edu/files/9439380/download?download_frd=1 One by public health researcher Mika Matsuzaki]. The first planning document I ever saw and still one of the best. It's missing a measures section. It's also focused on a research context that is probably very different from yours, but try not to get bogged down by that and imagine how you might map the structure of the document to your own work.
* [https://canvas.northwestern.edu/files/6908602/download?download_frd=1 One by public health researcher Mika Matsuzaki]. The first planning document I ever saw and still one of the best. It's missing a measures section. It's also focused on a research context that is probably very different from yours, but try not to get bogged down by that and imagine how you might map the structure of the document to your own work.
* [https://canvas.northwestern.edu/files/9421229/download?download_frd=1 One by Jim Maddock] created as part of a qualifying exam early in 2019. Jim doesn't provide dummy tables or anticipated findings/contributions, but he has an especially phenomenal explanation of the conceptual relationships and processes he wants to test.  
* [https://canvas.northwestern.edu/files/6919735/download?download_frd=1 One by Jim Maddock] created as part of a qualifying exam early in 2019. Jim doesn't provide dummy tables or anticipated findings/contributions, but he has an especially phenomenal explanation of the conceptual relationships and processes he wants to test.  
* [https://canvas.northwestern.edu/files/9439379/download?download_frd=1 One provided as an appendix to Gerber and Green's excellent textbook, ''Field Experiments: Design, Analysis, and Interpretation'' (FEDAI)]. It's over-detailed and over-long for the purposes of this assignment, but nevertheless an exemplary approach to planning empirical quantitative research in a careful, intentional way that is worthy of imitation.
* [https://canvas.northwestern.edu/files/6908606/download?download_frd=1 One provided as an appendix to Gerber and Green's excellent textbook, ''Field Experiments: Design, Analysis, and Interpretation'' (FEDAI)]. It's over-detailed and incredibly long for our purposes, but nevertheless an exemplary approach to planning empirical quantitative research in a careful, intentional way that is worthy of imitation.
 
==== Research project presentation ====
 
;Presentation due date: December 3, 2020, 5pm CT
;Maximum length: 10 minutes
 
<!-- TODO revisit old presentations page to update/adapt
[[Statistics_and_Statistical_Programming_(Spring_2019)/Final_project_presentations]]
--->
You will also create and record a short presentation of your final project. The presentation will provide an opportunity to share a brief overview of your project and findings with the other members of the class. Since you will all give other research presentations throughout your career, I strongly encourage you to take the opportunity to refine your academic presentation skills. The document [https://canvas.northwestern.edu/files/9439377/download?download_frd=1 Creating a Successful Scholarly Presentation] (file posted to Canvas) may be useful.
 
Additional details about the presentation goals, format suggestions, resources, and more will be provided later in the quarter.


==== Research project paper ====
==== Research paper ====


;Paper due date: December 10, 2020, 5pm CT
;Paper due date: TBA
;Maximum length: 6000 words (~20 pages)
;Maximum length: 6000 words (~20 pages)


Line 235: Line 213:


I do not have strong preferences about the style or formatting guidelines you follow for the paper and its bibliography. However, ''your paper must follow a standard format'' (e.g., [https://cscw.acm.org/2019/submit-papers.html ACM SIGCHI CSCW format] or [https://www.apastyle.org/index APA 6th edition] ([https://templates.office.com/en-us/APA-style-report-6th-edition-TM03982351 Word] and [https://www.overleaf.com/latex/templates/sample-apa-paper/fswjbwygndyq LaTeX] templates)) that is applicable for a peer-reviewed journal or conference proceedings in which you might aim to publish the work (they all have formatting or submission guidelines published online and you should follow them). This includes the references. I also strongly recommend that you use reference management software like Zotero to handle your bibliographic sources.
I do not have strong preferences about the style or formatting guidelines you follow for the paper and its bibliography. However, ''your paper must follow a standard format'' (e.g., [https://cscw.acm.org/2019/submit-papers.html ACM SIGCHI CSCW format] or [https://www.apastyle.org/index APA 6th edition] ([https://templates.office.com/en-us/APA-style-report-6th-edition-TM03982351 Word] and [https://www.overleaf.com/latex/templates/sample-apa-paper/fswjbwygndyq LaTeX] templates)) that is applicable for a peer-reviewed journal or conference proceedings in which you might aim to publish the work (they all have formatting or submission guidelines published online and you should follow them). This includes the references. I also strongly recommend that you use reference management software like Zotero to handle your bibliographic sources.
==== Project presentation ====
;Presentation due date: TBA
;Maximum length: 10 minutes
<!-- TODO revisit old presentations page to update/adapt
[[Statistics_and_Statistical_Programming_(Spring_2019)/Final_project_presentations]]
--->
You will also create and record a short (7-8 minute) presentation of your final project. The presentation will provide an opportunity to share a brief summary of your project and at least preliminary findings with the other members of the class. Since you will all give other research presentations throughout your career, I strongly encourage you to take the opportunity to refine your academic presentation skills. The document [https://canvas.northwestern.edu Creating a Successful Scholarly Presentation] (file will be posted to Canvas) may be useful.
More details about the presentation goals, format suggestions, and more will be provided later in the quarter.


==== Human subjects research, IRB, and ethics ====
==== Human subjects research, IRB, and ethics ====
Line 304: Line 294:
==== September 17: Intro and setup ====
==== September 17: Intro and setup ====


;[[Statistics_and_Statistical_Programming_(Fall_2020)/w01_session_plan|Session plan]]
;Note: Aaron doesn't actually expect you to complete these before class on September 17
 
<blockquote>''Note: Aaron doesn't actually expect you to complete these before class on September 17''</blockquote>


'''Required'''
'''Required'''
* Read this syllabus, discuss any questions/concerns with the teaching team.
* Complete [https://apps3.cehd.umn.edu/artist/user/scale_select.html pre-course assessment of statistical concepts] (access code TBA via email). '''Submission deadline: September 18, 11:00pm Chicago time'''
* Complete [https://apps3.cehd.umn.edu/artist/user/scale_select.html pre-course assessment of statistical concepts] (access code TBA via email). Estimated time to do this is 30-40 minutes. '''Submission deadline: September 18, 11:00pm Chicago time'''
* Confirm access to software and web-services for course (Zoom, Discord, Canvas, this wiki, R, RStudio).
* Confirm course registration and access to [https://www.openintro.org/book/os/ the textbook] (pdf download available for $0 and b&w paperbacks for $20) as well as any software and web-services you'll need for course (Zoom, Discord, Canvas, this wiki, R, RStudio). Discord invites will be sent via email.
* Complete [https://wiki.communitydata.science/Statistics_and_Statistical_Programming_(Fall_2020)/pset0 problem set 0]  
* Complete [https://wiki.communitydata.science/Statistics_and_Statistical_Programming_(Fall_2020)/pset0 problem set #0]  


'''Recommended'''
'''Recommended'''
* Work through one (or more) introduction(s) to R and Rstudio so that you can complete problem set 0. Here are several suggestions:
* Work through one (or more) introduction(s) to R and Rstudio so that you can complete problem set 0. Here are several suggestions:
** '''From Aaron:''' The [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w01-R_tutorial.html Week 01 R tutorial] (you should also download the [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w01-R_tutorial.rmd .rmd version of the tutorial] that you can open and read/edit in RStudio). These are accompanied by the R and Rstudio intro screencasts ([https://communitydata.cc/~ads/teaching/2019/stats/screencasts/w01-s01-intro.webm Part 1] and [https://communitydata.cc/~ads/teaching/2019/stats/screencasts/w01-s02-intro.webm Part 2]) Aaron created for the 2019 version of the course.  
** '''From Aaron:''' The [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w01-R_tutorial.html Week 01 R tutorial] (you should also download the [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w01-R_tutorial.rmd .Rmd version of the tutorial] that you can open and read/edit in RStudio). These are accompanied by the R and Rstudio intro screencasts ([https://communitydata.cc/~ads/teaching/2019/stats/screencasts/w01-s01-intro.webm Part 1] and [https://communitydata.cc/~ads/teaching/2019/stats/screencasts/w01-s02-intro.webm Part 2]) Aaron created for the 2019 version of the course.  
** Modern Dive [https://moderndive.netlify.app/index.html Statistical inference via data science] Chapter 1: [https://moderndive.netlify.app/1-getting-started.html Getting started with R].
 
** [https://rladiessydney.org/courses/ryouwithme/ RYouWithMe] course [https://rladiessydney.org/courses/ryouwithme/01-basicbasics-0/ "Basic basics" 1 & 2] (and maybe 3 if you're feeling ambitious).
'''Resources'''
** Verzani §1 (Getting started).
* Verzani §1 (Getting started) and Healy §2 (Get started) provide helpful background for working with R and RStudio.
** Healy §2 (Get started).
* Modern Dive [https://moderndive.netlify.app/index.html Statistical inference via data science] Chapter 1: [https://moderndive.netlify.app/1-getting-started.html Getting started with R].
* [https://rladiessydney.org/courses/ryouwithme/ RYouWithMe] course [https://rladiessydney.org/courses/ryouwithme/01-basicbasics-0/ "Basic basics" 1 & 2] (and maybe 3 if you're feeling ambitious).


=== Week 2 (9/22, 9/24) ===
=== Week 2 (9/22, 9/24) ===
;[[Statistics_and_Statistical_Programming_(Fall_2020)/w02_session_plan|Session plans]]
==== September 22: Data and variables ====
==== September 22: Data and variables ====
'''Required'''
'''Required'''
* Read Diez, Çetinkaya-Rundel, and Barr: §1.1-1.3 (Introduction to data).  
* Read Diez, Çetinkaya-Rundel, and Barr: §1.1-1.3 (Introduction to data).  
* Watch [https://www.youtube.com/playlist?list=PLkIselvEzpM6pZ76FD3NoCvvgkj_p-dE8 Lecture materials for §1.1-3 (Videos 1-4 in the playlist)].
* Watch [https://www.youtube.com/playlist?list=PLkIselvEzpM6pZ76FD3NoCvvgkj_p-dE8 Lecture materials for §1.1-3 (Videos 1-4 in the playlist)].
* Submit, review, and respond to questions or requests for discussion via Discord or some other means.
* Complete '''exercises from OpenIntro §1:''' 1.6, 1.9, 1.10, 1.16, 1.21, 1.40, 1.42, 1.43 (and remember that solutions to odd-numbered problems are in the book!)


==== September 24: Numerical and categorical data ====
==== September 24: Numerical and categorical data ====
Line 334: Line 321:
* Read Diez, Çetinkaya-Rundel, and Barr: §2.1-2 (Numerical and categorical data).  
* Read Diez, Çetinkaya-Rundel, and Barr: §2.1-2 (Numerical and categorical data).  
* Review [https://www.youtube.com/playlist?list=PLkIselvEzpM6pZ76FD3NoCvvgkj_p-dE8 Lecture materials for §2.1 and §2.2 (Videos 6-7 in the playlist)].
* Review [https://www.youtube.com/playlist?list=PLkIselvEzpM6pZ76FD3NoCvvgkj_p-dE8 Lecture materials for §2.1 and §2.2 (Videos 6-7 in the playlist)].
* Complete '''exercises from OpenIntro §2:''' 2.12, 2.13, 2.16, 2.20, 2.23, 2.30 (and remember that solutions to odd-numbered problems are in the book!)
* Complete '''exercises from OpenIntro §2:''''
* Submit, review, and respond to questions or requests for discussion via Discord or some other means.
 
'''Resources'''


=== Week 3 (9/29, 10/1) ===
=== Week 3 (9/29, 10/1) ===
 
==== September 29: Working with data and variables in R ====
;[[Statistics_and_Statistical_Programming_(Fall_2020)/w03_session_plan|Session plans]]
 
==== September 29: R fundamentals: Import, transform, tidy, and describe data ====
'''Required'''
'''Required'''
* Complete [[Statistics_and_Statistical_Programming_(Fall_2020)/pset1|problem set #1]] (due Monday, September 28 at 1pm Central)
* R lecture materials from 2019 W02
* Complete problem set #1
** TODO Empirical paper/data (UCB admissions. Police stops in IL.)
** TODO update PS2 Programming challenges from 2019


'''Recommended'''
* [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w03-R_tutorial.html Week 3 R tutorial] (note that you can access .rmd or .pdf versions by replacing the suffix of the URL accordingly).
* Additional material from any of the recommended R learning resources suggested last week or elsewhere in the syllabus. In particular, you may find the ModernDive, RYouWithMe, Healy, and/or Wickham and Grolemund resources valuable.
<!---
'''Resources'''
'''Resources'''
* [https://science.sciencemag.org/content/187/4175/398 UCB admissions paper]
* [https://science.sciencemag.org/content/187/4175/398 UCB admissions paper]
* [https://openpolicing.stanford.edu Stanford OpenPolicing Project]
* [https://openpolicing.stanford.edu Stanford OpenPolicing Project]
--->


==== October 1: Probability ====
==== October 1: Probability ====
'''Required'''
'''Required'''
* Read Diez, Çetinkaya-Rundel, and Barr: §3 (Probability).  
* Read Diez, Çetinkaya-Rundel, and Barr: §3.1-3; §3.4-5 (Probability).  
* Watch [https://www.youtube.com/watch?list=PLkIselvEzpM5EgoOajhw83Ax_FktnlD6n&v=rG-SLQ2uF8U Probability introduction] and [https://www.youtube.com/watch?v=HxEz4ZHUY5Y&list=PLkIselvEzpM5EgoOajhw83Ax_FktnlD6n&index=2 Probability trees] OpenIntro lectures (just videos 1 and 2 in the playlist).
* Watch [https://www.youtube.com/watch?list=PLkIselvEzpM5EgoOajhw83Ax_FktnlD6n&v=rG-SLQ2uF8U Probability introduction] and [https://www.youtube.com/watch?v=HxEz4ZHUY5Y&list=PLkIselvEzpM5EgoOajhw83Ax_FktnlD6n&index=2 Probability trees] OpenIntro lectures (just videos 1 and 2 in the playlist).
* Complete '''exercises from OpenIntro §3:''' 3.12, 3.15, 3.22, 3.28, 3.34, 3.38
* Complete '''exercises from OpenIntro §3:''''


'''Resources'''
'''Resources'''
Line 364: Line 347:


=== Week 4 (10/6, 10/8) ===
=== Week 4 (10/6, 10/8) ===
;[[Statistics_and_Statistical_Programming_(Fall_2020)/w04_session_plan|Session plans]]
==== October 6: <Topic> ====
 
==== October 6: Emotional contagion and more advanced R fundamentals: import, tidy, transform, and simulate data; write functions ====
'''Required'''
'''Required'''
* Read the paper below as well as the attendant [https://www.pnas.org/content/111/29/10779.1 "Expression of editorial concern"] and [https://www.pnas.org/content/111/29/10779.2 "Correction"] that were subsequently appended to it.
* Complete problem set #2
:Kramer, Adam D. I., Jamie E. Guillory, and Jeffrey T. Hancock. 2014. “Experimental Evidence of Massive-Scale Emotional Contagion through Social Networks.” ''Proceedings of the National Academy of Sciences'' 111(24):8788–90. [[http://www.pnas.org/content/111/24/8788.full Open access]]
* Complete [[Statistics_and_Statistical_Programming_(Fall_2020)/pset2|problem set #2]] (due Monday, October 5 at 1pm CT)
 
'''Recommended'''
* [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w04-R_tutorial.html Week 4 R tutorial] (as usual, also available as .rmd or .pdf)


'''Resources'''
==== October 8: Distributions ====
==== October 8: Distributions ====
'''Required'''
'''Required'''
* Read Diez, Çetinkaya-Rundel, and Barr: §4.1-3 (Normal and binomial distributions).  
* Read Diez, Çetinkaya-Rundel, and Barr: §4.1-3 (Normal and binomial distributions).  
* Watch [https://www.youtube.com/watch?list=PLkIselvEzpM6V9h55s0l9Kzivih9BUWeW&v=S_p5D-YXLS4 normal and binomial distributions] OpenIntro lectures (videos 1-3 in the playlist).
* Watch [https://www.youtube.com/watch?list=PLkIselvEzpM6V9h55s0l9Kzivih9BUWeW&v=S_p5D-YXLS4 normal and binomial distributions] OpenIntro lectures (videos 1-3 in the playlist).
* Complete '''exercises from OpenIntro §4:''' 4.4, 4.6, 4.15, 4.22
* Complete '''exercises from OpenIntro §4:''''


'''Resources'''
'''Resources'''
* [https://seeing-theory.brown.edu/index.html#secondPage/chapter3 Seeing Theory §3 (Probability distributions)]
* [https://seeing-theory.brown.edu/index.html#secondPage/chapter3 Seeing Theory §3 (Probability distributions)]


==== October 9: [[#Research project plan and dataset identification|Research project plan and dataset identification]] due by 5pm CT ====
*'''Submit via [https://canvas.northwestern.edu/courses/122522/assignments Canvas]''' (due by 5pm CT)


=== Week 5 (10/13, 10/15) ===
=== Week 5 (10/13, 10/15) ===
;[[Statistics_and_Statistical_Programming_(Fall_2020)/w05_session_plan|Session plans]]
==== October 13: <Topic> ====
==== October 13: Descriptive analysis and visualization of data ====
'''Required'''
'''Required'''
* Complete [[Statistics_and_Statistical_Programming_(Fall_2020)/pset3|problem set #3]] (due Monday, October 12 at 1pm CT)
* Complete problem set #3


'''Recommended'''
'''Resources'''
* [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w05-R_tutorial.html Week 5 R tutorial] and [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w05a-R_tutorial.html Week 5 R tutorial supplement] (both, as usual, also available as .rmd or .pdf).


==== October 15: Foundations for (frequentist) inference ====
==== October 15: Foundations for (frequentist) inference ====
Line 401: Line 374:
* Watch [https://www.youtube.com/watch?v=oLW_uzkPZGA&list=PLkIselvEzpM4SHQojH116fYAQJLaN_4Xo foundations for inference] (videos 1-3 in the playlist) OpenIntro lectures.
* Watch [https://www.youtube.com/watch?v=oLW_uzkPZGA&list=PLkIselvEzpM4SHQojH116fYAQJLaN_4Xo foundations for inference] (videos 1-3 in the playlist) OpenIntro lectures.
* Complete [https://www.openintro.org/book/stat/why05/ Why .05?] OpenIntro video/exercise.
* Complete [https://www.openintro.org/book/stat/why05/ Why .05?] OpenIntro video/exercise.
* Complete '''exercises from OpenIntro §5:''' 5.4, 5.8, 5.10, 5.17, 5.30, 5.35, 5.36
* Complete '''exercises from OpenIntro §5:''''


'''Resources'''
'''Resources'''
Line 408: Line 381:


=== Week 6 (10/20, 10/22) ===
=== Week 6 (10/20, 10/22) ===
;[[Statistics_and_Statistical_Programming_(Fall_2020)/w06_session_plan|Session plans]]
==== October 20: <Topic> ====
==== October 20: Reinforced foundations for inference ====
'''Required'''
'''Required'''
* Complete [[Statistics_and_Statistical_Programming_(Fall_2020)/pset4|problem set #4]] 
* Complete problem set #4
* Read Reinhart, §1.
 
* Revisit the Kramer et al. (2014) paper we read a few weeks ago:
'''Resources'''
:Kramer, Adam D. I., Jamie E. Guillory, and Jeffrey T. Hancock. 2014. “Experimental Evidence of Massive-Scale Emotional Contagion through Social Networks.” ''Proceedings of the National Academy of Sciences'' 111(24):8788–90. [[http://www.pnas.org/content/111/24/8788.full Open access]] 


==== October 22: Inference for categorical data ====
==== October 22: Inference for categorical data ====
Line 420: Line 391:
* Read Diez, Çetinkaya-Rundel, and Barr: §6 (Inference for categorical data).  
* Read Diez, Çetinkaya-Rundel, and Barr: §6 (Inference for categorical data).  
* Watch [https://www.youtube.com/watch?list=PLkIselvEzpM5Gn-sHTw1NF0e8IvMxwHDW&v=_iFAZgpWsx0 inference for categorical data] (videos 1-3 in the playlist) OpenIntro lectures.
* Watch [https://www.youtube.com/watch?list=PLkIselvEzpM5Gn-sHTw1NF0e8IvMxwHDW&v=_iFAZgpWsx0 inference for categorical data] (videos 1-3 in the playlist) OpenIntro lectures.
* Complete '''exercises from OpenIntro §6:''' 6.10, 6.16, 6.22, 6.30, 6.40 (just parts a and b; part c gets tedious)
* Complete '''exercises from OpenIntro §6:''''


'''Resources'''
'''Resources'''
Line 426: Line 397:


=== Week 7 (10/27, 10/29) ===
=== Week 7 (10/27, 10/29) ===
;[[Statistics_and_Statistical_Programming_(Fall_2020)/w07_session_plan|Session plans]]
==== October 27: <Topics> ====
==== October 27: Applied inference for categorical data ====
'''Required'''
'''Required'''
* Read Reinhart, §4 and §5 (both are quite short).
* Complete problem set #5
* Skim the following (all are referenced in the problem set)
**  Aronow PM, Karlan D, Pinson LE. (2018). The effect of images of Michelle Obama’s face on trick-or-treaters’ dietary choices: A randomized control trial. PLoS ONE 13(1): e0189693. [https://doi.org/10.1371/journal.pone.0189693 https://doi.org/10.1371/journal.pone.0189693]
** Buechley, Leah and Benjamin Mako Hill. 2010. “LilyPad in the Wild: How Hardware’s Long Tail Is Supporting New Engineering and Design Communities.” Pp. 199–207 in ''Proceedings of the 8th ACM Conference on Designing Interactive Systems.'' Aarhus, Denmark: ACM. [[https://mako.cc/academic/buechley_hill_DIS_10.pdf PDF available on Hill's personal website]]
** Shaw, Aaron and Yochai Benkler. 2012. A tale of two blogospheres: Discursive practices on the left and right. ''American Behavioral Scientist''. 56(4): 459-487. [[https://doi.org/10.1177%2F0002764211433793 available via NU libraries]]
* Complete [[Statistics_and_Statistical_Programming_(Fall_2020)/pset5|problem set #5]]
'''Resources'''
'''Resources'''
* [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w06-R_tutorial.html Week 06 R tutorial] (it's very short!)


==== October 29: Inference for numerical data (part 1) ====
==== October 29: Inference for numerical data (part 1) ====
Line 442: Line 406:
* Read Diez, Çetinkaya-Rundel, and Barr: §7.1-3 (Inference for numerical data: differences of means).  
* Read Diez, Çetinkaya-Rundel, and Barr: §7.1-3 (Inference for numerical data: differences of means).  
* Watch [https://www.youtube.com/watch?list=PLkIselvEzpM5G3IO1tzQ-DUThsJKQzQCD&v=uVEj2uBJfq0 inference for numerical data] (videos 1-4 in the playlist) OpenIntro lectures (and featuring one of the textbook authors!).
* Watch [https://www.youtube.com/watch?list=PLkIselvEzpM5G3IO1tzQ-DUThsJKQzQCD&v=uVEj2uBJfq0 inference for numerical data] (videos 1-4 in the playlist) OpenIntro lectures (and featuring one of the textbook authors!).
* Complete '''exercises from OpenIntro §7:''' 7.12, 7.24, 7.26
* Complete '''exercises from OpenIntro §7:''''


'''Resources'''
'''Resources'''
* [https://gallery.shinyapps.io/CLT_mean/ OpenIntro Central limit theorem for means demo].
* [https://gallery.shinyapps.io/CLT_mean/ OpenIntro Central liumit theorem for means demo].
 
==== October 30: [[#Research project planning document|Research project planning document]] due 5pm CT====
* Submit via [https://canvas.northwestern.edu/courses/122522/assignments/787297 Canvas] (due by 5pm CT)


=== Week 8 (11/3, 11/5) ===
=== Week 8 (11/3, 11/5) ===
==== November 3: U.S. election day (no class meeting) ====
==== November 3: Self-assessment exercise (no class meeting) ====
 
'''Election Day (U.S.): No class meeting today'''
==== November 4: Interactive self-assessment due ====
* Please submit results [https://canvas.northwestern.edu/courses/122522/assignments/799630 (via Canvas)] from the [https://communitydata.science/~ads/teaching/2020/stats/assessment/interactive_assessment.rmd interactive self-assessment] by 5pm CT.


==== November 5: Inference for numerical data (part 2) ====
==== November 5: Inference for numerical data (part 2) ====
Line 460: Line 419:
* Read Diez, Çetinkaya-Rundel, and Barr: §7.4-5 (Inference for numerical data: power calculations, ANOVA, and multiple comparisons).  
* Read Diez, Çetinkaya-Rundel, and Barr: §7.4-5 (Inference for numerical data: power calculations, ANOVA, and multiple comparisons).  
* Watch [https://www.youtube.com/watch?list=PLkIselvEzpM5G3IO1tzQ-DUThsJKQzQCD&v=uVEj2uBJfq0 inference for numerical data] (videos 4-8 in the playlist) OpenIntro lectures (and featuring one of the textbook authors!).
* Watch [https://www.youtube.com/watch?list=PLkIselvEzpM5G3IO1tzQ-DUThsJKQzQCD&v=uVEj2uBJfq0 inference for numerical data] (videos 4-8 in the playlist) OpenIntro lectures (and featuring one of the textbook authors!).
* Complete '''exercises from OpenIntro §7:''' 7.42, 7.44, 7.46
* Complete '''exercises from OpenIntro §7:''''


'''Resources'''
'''Resources'''
Line 466: Line 425:


=== Week 9 (11/10, 11/12) ===
=== Week 9 (11/10, 11/12) ===
==== November 10: Applied inference for numerical data (t-tests, power analysis, ANOVA) ====
==== November 10: <Topic> ====
;[[Statistics_and_Statistical_Programming_(Fall_2020)/w09_session_plan|Session plans]]
 
'''Required'''
'''Required'''
* Complete [[Statistics_and_Statistical_Programming_(Fall_2020)/pset6|problem set #6]]
* Complete problem set #6


'''Resources'''
'''Resources'''
* [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w09-R_tutorial.html Week 09 R tutorial]


==== November 12: Linear regression ====
==== November 12: Linear regression ====
Line 480: Line 436:
* Watch [https://www.youtube.com/playlist?list=PLkIselvEzpM63ikRfN41DNIhSgzboELOM linear regression] (videos 1-4 in the playlist) OpenIntro lectures.
* Watch [https://www.youtube.com/playlist?list=PLkIselvEzpM63ikRfN41DNIhSgzboELOM linear regression] (videos 1-4 in the playlist) OpenIntro lectures.
* Read [https://www.openintro.org/go/?id=stat_more_inference_for_linear_regression&referrer=/book/os/index.php More inference for linear regression] (OpenIntro supplement).
* Read [https://www.openintro.org/go/?id=stat_more_inference_for_linear_regression&referrer=/book/os/index.php More inference for linear regression] (OpenIntro supplement).
* Complete '''exercises from OpenIntro §8:''' 8.6, 8.36, 8.40, 8.44
* Complete '''exercises from OpenIntro §8:''''
* Complete '''exercises from OpenIntro supplement:''' 4 and 5 (answers provided in the supplement).
* Complete '''exercises from OpenIntro supplement:''''
   
   
'''Resources'''
'''Resources'''
Line 487: Line 443:


=== Week 10 (11/17, 11/19) ===
=== Week 10 (11/17, 11/19) ===
;[[Statistics_and_Statistical_Programming_(Fall_2020)/w10_session_plan|Session plans]]
==== November 17: <Topic> ====
==== November 17: Applied linear regression ====
'''Required'''
'''Required'''
* Complete [[Statistics_and_Statistical_Programming_(Fall_2020)/pset7|Problem set #7]]
* Complete Problem set #7


'''Resources'''
'''Resources'''
* [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/w10-R_tutorial.html Week 10 R tutorial]
 
==== November 19: Multiple and logistic regression ====
==== November 19: Multiple and logistic regression ====
'''Required'''
'''Required'''
Line 501: Line 456:
* Read [https://www.openintro.org/go/?id=stat_interaction_terms&referrer=/book/os/index.php Interaction terms] (OpenIntro supplement).
* Read [https://www.openintro.org/go/?id=stat_interaction_terms&referrer=/book/os/index.php Interaction terms] (OpenIntro supplement).
* Read [https://www.openintro.org/go/?id=stat_nonlinear_relationships&referrer=/book/os/index.php Fitting models for non-linear trends] (OpenIntro supplement).
* Read [https://www.openintro.org/go/?id=stat_nonlinear_relationships&referrer=/book/os/index.php Fitting models for non-linear trends] (OpenIntro supplement).
* Complete '''exercises from OpenIntro §9:''' 9.4, 9.13, 9.16, 9.18,
* Complete '''exercises from OpenIntro §9:''''
* Complete '''exercises from OpenIntro supplements:''''


'''Resources'''
'''Resources'''


=== Week 11 (11/24) ===
=== Week 11 (11/24) ===
==== November 24: Applied multiple and logistic regression ====
==== November 24: <Topic> and assessment ====
;[[Statistics_and_Statistical_Programming_(Fall_2020)/w11_session_plan|Session plans]]
'''Required'''
'''Required'''
* Complete [[Statistics_and_Statistical_Programming_(Fall_2020)/pset8|Problem set #8]]
* Complete Problem set #8
* Complete [https://apps3.cehd.umn.edu/artist/user/scale_select.html post-course assessment of statistical concepts] (access code TBA VIA email). '''Submission deadline: December 1, 11:00pm Chicago time'''
'''Resources'''
'''Resources'''
* Mako Hill created (and Aaron updated) a brief tutorial on [https://communitydata.science/~ads/teaching/2020/stats/r_tutorials/logistic_regression_interpretation.html interpreting logistic regression coefficients with examples in R]
* Mako Hill created an example of [https://communitydata.science/~mako/2017-COM521/logistic_regression_interpretation.html interpreting logistic regression coefficients with examples in R]
 
=== Week 12+ ===
 
==== December 3: [[#Research project presentation|Research project presentation]] due by 5pm CT ====
'''[https://canvas.northwestern.edu/courses/122522/discussion_topics/856868 Post your video via this "Discussion" on Canvas]'''. Please view and provide constructive feedback on other's videos!
 
* '''Post videos directly to the "Discussion."''' The Canvas text editor has an option to upload/record a video. That's what you want.
* '''Please remember not to over-work/think this.''' I mentioned this in class, but just to reiterate, the focus of this assignment should not be your video editing skills. Please do what you can to record and convey your ideas clearly without devoting insane hours to creating the perfect video.
* '''Some resources for recording presentations:''' There are a bunch of ways you might record/share your video. Some ideas include using the embedded media recorder in Canvas (!) that can record with with your webcam (maybe attach a few visuals to accompany this?); recording a "meeting" with yourself in Zoom; and "Panopto," a piece of high-end video recording, sharing, and editing software that NU licenses for campus use. Here are some pointers:
** NU has a "digital learning resource hub" which provides some [https://digitallearning.northwestern.edu/resource-hub#for-students resources for students]. The first item in that list has pointers for recording yourself and posting to Canvas and includes info about the Canvas media recorder and Panopto.
** You should be able to use your NU zoom account to create a zoom meeting, record your meeting (in which you deliver your presentation and share your screen with any visuals), and then share a link to the recording via the "Recordings" item in the left-hand menu of your [https://northwestern.zoom.us/ https://northwestern.zoom.us/] account page.
** If nothing works, please get in touch.
 
==== December 4: Post-course assessment of statistical concepts due by 11pm CT ====
Complete [https://apps3.cehd.umn.edu/artist/user/scale_select.html post-course assessment] (access code TBA VIA email). Submission deadline: December 4, 11:00pm Chicago time.
 
==== December 10: [[#Research project paper|Research project paper]] due by 5pm CT ====
'''[https://canvas.northwestern.edu/courses/122522/assignments/812317 Submit your paper, data, and code via Canvas].'''


== Credit and Notes ==
== Credit and Notes ==


This syllabus has, in ways that should be obvious, borrowed and built on the [https://www.openintro.org/stat/index.php OpenInto Statistics curriculum]. Most aspects of this course design extend Benjamin Mako Hill's [[Statistics_and_Statistical_Programming_(Winter_2017)|COM 521 class]] from the University of Washington as well as a [[Statistics_and_Statistical_Programming_(Spring_2019)|prior iteration of the same course]] offered at Northwestern in Spring 2019.
This syllabus has, in ways that should be obvious, borrowed and built on the [https://www.openintro.org/stat/index.php OpenInto Statistics curriculum]. Most aspects of this course design extend Benjamin Mako Hill's [[Statistics_and_Statistical_Programming_(Winter_2017)|COM 521 class]] from the University of Washington as well as a [[Statistics_and_Statistical_Programming_(Spring_2019)|prior iteration of the same course]] offered at Northwestern in Spring 2019.
Please note that all contributions to CommunityData are considered to be released under the Attribution-Share Alike 3.0 Unported (see CommunityData:Copyrights for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource. Do not submit copyrighted work without permission!

To protect the wiki against automated edit spam, we kindly ask you to solve the following CAPTCHA:

Cancel Editing help (opens in new window)