Editing User:Groceryheist/drafts/Data Science Syllabus

From CommunityData

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.

Latest revision Your text
Line 1: Line 1:
;Course Title: Data Science and Organizational Communication:
;Data Science and Organizational Communication:
;Instructor: [[User:Groceryheist|Nate TeBlunthuis]]
;Principal instructor: [[User:Groceryheist|Nate TeBlunthuis]]
;Course Catalog Description: Fundamental principles of data science and its implications, including research ethics; data privacy; legal frameworks; algorithmic bias, transparency, fairness and accountability; data provenance, curation, preservation, and reproducibility; human computation; data communication and visualization; the role of data science in organizational context and the societal impacts of data science.  
;Course Catalog Description: Fundamental principles of data science and its implications, including research ethics; data privacy; legal frameworks; algorithmic bias, transparency, fairness and accountability; data provenance, curation, preservation, and reproducibility; human computation; data communication and visualization; the role of data science in organizational context and the societal impacts of data science.  


== Course Description ==
== Course Description ==
The rise of "data science" reflects a broad and ongoing shift in how many teams, organizational leaders,  communities of practice, and entire industries create and use knowledge. This class teaches "data science" as practiced by data-intensive knowledge workers but also as it is positioned in historical, organizational, institutional, and societal contexts.  Students will gain an appreciation for the technical and intellectual aspects of data science, consider critical questions about how data science is often practiced, and envision ethical and effective science practice in their current and future organizational roles.  The format of the class will be a mix of lecture, discussion, in-class activities, and qualitative and quantitative research assignments.  We assume no prior expertise in programming or statistics, only strong academic skills and a willingness to learn. However, students without any background in either programming or in qualitative research (e.g. interviewing) may find this course a challenge.
The rise of "data science" reflects a broad and ongoing shift in how many teams, organizational leaders,  communities of practice, and entire industries create and use knowledge. This class teaches "data science" as practiced by data-intensive knowledge workers but also as it is positioned in historical, organizational, institutional, and societal contexts.  Students will gain an appriciation for the technical and intellectual aspects of data science, consider critical questions about how data science is often practiced, and envision ethical and effective science practice in their current and future organiational roles.  The format of the class will be a mix of lecture, discussion, in-class activities, and qualitative and quantitative research assignments.  We assume no prior expertise in programming or statistics, only strong academic skills and a willingness to learn.


The course is designed around two high-stakes projects. In the first stage of the students will attend the Community Data Science Workshop (CDSC). I am one of the organizers and instructors of this three week intensive workshop on basic programming and data analysis skills. The first course project is to apply these skills together with the conceptual material from this course we have covered so far to conduct an original data analysis on a topic of the student's interest. The second high-stakes project is a critical analysis of an organization or work team. For this project students will serve as consultants to an organizational unit involved in data science. Through interviews and workplace observations they will gain an understanding of the socio-technical and organizational context of their team. They will then synthesize this understanding with the knowledge they gained from the course material to compose a report offering actionable insights to their team.
The course is designed around two high-stakes projects. In the first stage of the students will attend the Community Data Science Workshop (CDSC). I am one of the organizers and instructors of this three week intensive workshop on basic programming and data analysis skills. The first course project is to apply these skills together with the conceptual material from this course we have covered so far to conduct an original data analysis on a topic of the student's interest. The second high-stakes project is a critical analysis of an organization or work team. For this project students will serve as consultants to an organizational unit involved in data science. Through interviews and workplace observations they will gain an understanding of the socio-technical and organizational context of their team. They will then synthesize this understanding with the knowledge they gained from the course material to compose a report offering actionable insights to their team.
Line 67: Line 67:


;Homework assigned
;Homework assigned
* Week 2 reading reflection
* Reading reflection
* Attend week 1 of CDSW
* Attend week 1 of CDSW


Line 88: Line 88:


;Assignments due
;Assignments due
* Week 2 reading reflection
* Week 1 reading reflection
* Attend week 1 of CDSW
* Attend week 1 of CDSW


Line 98: Line 98:


;Homework assigned
;Homework assigned
* Week 3 reading reflection
* Reading reflection
* Attend week 2 of CDSW
* Attend week 2 of CDSW


Line 129: Line 129:


;Assignments due
;Assignments due
* Week 3 reading reflection
* Week 2 reading reflection
* Attend week 2 of CDSW
* Attend week 2 of CDSW
<!-- ;Agenda -->
<!-- ;Agenda -->
Line 138: Line 138:


;Homework assigned
;Homework assigned
* Week 4 reading reflection
* Reading reflection
* Attend week 3 of CDSW
* Attend week 3 of CDSW
* A1: Project proposal and data aquisition
* Project assignment 1: Project proposal and data aquisition


<!-- ;Resources -->
<!-- ;Resources -->
Line 169: Line 169:


;Assignments due
;Assignments due
* Week 4 reading reflection
* Reading reflection
* [[Human_Centered_Data_Science_(Fall_2018)/Assignments#A1:_Data_curation|A1: Data curation]]
* Attend week 3 of CDSW
* Attend week 3 of CDSW
* A1: Project proposal and data aquisition
* Project assignment 1: Project proposal and data aquisition


<!-- ;Agenda -->
<!-- ;Agenda -->
Line 181: Line 182:


;Homework assigned
;Homework assigned
* Week 5 reading reflection
* Reading reflection
* A2: Data analysis (due week 6)
* Attend week 4 of CDSW
 
<!-- ;Resources -->
<!-- ;Resources -->
<!-- * Olteanu, A., Castillo, C., Diaz, F., & Kiciman, E. (2016). ''[http://kiciman.org/wp-content/uploads/2017/08/SSRN-id2886526.pdf Social data: Biases, methodological pitfalls, and ethical boundaries]. -->
<!-- * Olteanu, A., Castillo, C., Diaz, F., & Kiciman, E. (2016). ''[http://kiciman.org/wp-content/uploads/2017/08/SSRN-id2886526.pdf Social data: Biases, methodological pitfalls, and ethical boundaries]. -->
Line 200: Line 202:


; Assignments due
; Assignments due
* Week 5 reading reflection
* Week 4 reading reflection
* Assignment 1: Project proposal and data aquisition
* Project assignment 1: Project proposal and data aquisition


; Readings assigned
; Readings assigned
Line 207: Line 209:


; Homework Assigned
; Homework Assigned
* Week 6 reading reflection
* Reading reflection
* A2: Data analysis (due week 6)
* Project assignment 2: Data analysis
<br/>
<br/>
<hr/>
<hr/>
Line 215: Line 217:
=== Week 6 ===
=== Week 6 ===
; Data science in Organizational Contexts
; Data science in Organizational Contexts
''And a crash course on qualitative research''


; Assignments due
; Assignments due
* Week 6 reading reflection
* Week 5 reading reflection
* A2: Data analysis
 
;Readings assigned (Read both, reflect on one)
;Readings assigned (Read both, reflect on one)
* Wang, Tricia. ''[https://medium.com/ethnography-matters/why-big-data-needs-thick-data-b4b3e75e3d7 Why Big Data Needs Thick Data]''. Ethnography Matters, 2016.
* Wang, Tricia. ''[https://medium.com/ethnography-matters/why-big-data-needs-thick-data-b4b3e75e3d7 Why Big Data Needs Thick Data]''. Ethnography Matters, 2016.
* Shilad Sen, Margaret E. Giesel, Rebecca Gold, Benjamin Hillmann, Matt Lesicko, Samuel Naden, Jesse Russell, Zixiao (Ken) Wang, and Brent Hecht. 2015. ''[http://www-users.cs.umn.edu/~bhecht/publications/goldstandards_CSCW2015.pdf Turkers, Scholars, "Arafat" and "Peace": Cultural Communities and Algorithmic Gold Standards]''. In Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing (CSCW '15)
* Shilad Sen, Margaret E. Giesel, Rebecca Gold, Benjamin Hillmann, Matt Lesicko, Samuel Naden, Jesse Russell, Zixiao (Ken) Wang, and Brent Hecht. 2015. ''[http://www-users.cs.umn.edu/~bhecht/publications/goldstandards_CSCW2015.pdf Turkers, Scholars, "Arafat" and "Peace": Cultural Communities and Algorithmic Gold Standards]''. In Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing (CSCW '15)


; Homework assigned
* Week 7 reading reflection
* A3: Final project proposal
<br/>
<br/>
<hr/>
<hr/>
Line 236: Line 234:
<!-- [[:File:HCDS 2018 week 5 slides.pdf|Day 5 slides]] -->
<!-- [[:File:HCDS 2018 week 5 slides.pdf|Day 5 slides]] -->


;Introduction to mixed-methods research: ''Big data vs thick data; integrating qualitative research methods into data science practice; crowd-sourcing''
;Introduction to mixed-methods research: ''Big data vs thick data; integrating qualitative research methods into data science practice; crowdsourcing''




;Assignments due
;Assignments due
* Week 7 reading reflection
* Reading reflection
* A3: Final project proposal
 


<!-- ;Agenda -->
<!-- ;Agenda -->
Line 252: Line 250:


;Homework assigned
;Homework assigned
* Week 8 reading reflection
* Reading reflection
* A4: Final presentation (due week 11)
 
<!-- ;Qualitative research methods resources -->
<!-- ;Qualitative research methods resources -->
<!-- * Ladner, S. (2016). ''[http://www.practicalethnography.com/ Practical ethnography: A guide to doing ethnography in the private sector]''. Routledge. -->
<!-- * Ladner, S. (2016). ''[http://www.practicalethnography.com/ Practical ethnography: A guide to doing ethnography in the private sector]''. Routledge. -->
Line 260: Line 258:
<!-- * Usability.gov, ''[https://www.usability.gov/how-to-and-tools/methods/system-usability-scale.html System usability scale]''.  -->
<!-- * Usability.gov, ''[https://www.usability.gov/how-to-and-tools/methods/system-usability-scale.html System usability scale]''.  -->
<!-- * Nielsen, Jakob (2000). ''[https://www.nngroup.com/articles/why-you-only-need-to-test-with-5-users/ Why you only need to test with five users]''. nngroup.com. -->
<!-- * Nielsen, Jakob (2000). ''[https://www.nngroup.com/articles/why-you-only-need-to-test-with-5-users/ Why you only need to test with five users]''. nngroup.com. -->
<!-- ;Wikipedia gender gap research resources -->
<!-- ;Wikipedia gender gap research resources -->
<!-- * Hill, B. M., & Shaw, A. (2013). ''[journals.plos.org/plosone/article?id=10.1371/journal.pone.0065782 The Wikipedia gender gap revisited: Characterizing survey response bias with propensity score estimation]''. PloS one, 8(6), e65782 -->
<!-- * Hill, B. M., & Shaw, A. (2013). ''[journals.plos.org/plosone/article?id=10.1371/journal.pone.0065782 The Wikipedia gender gap revisited: Characterizing survey response bias with propensity score estimation]''. PloS one, 8(6), e65782 -->
Line 265: Line 264:
<!-- * Maximillian Klein. ''[http://whgi.wmflabs.org/gender-by-language.html Gender by Wikipedia Language]''. Wikidata Human Gender Indicators (WHGI), 2017. -->
<!-- * Maximillian Klein. ''[http://whgi.wmflabs.org/gender-by-language.html Gender by Wikipedia Language]''. Wikidata Human Gender Indicators (WHGI), 2017. -->
<!-- * Source: Wagner, C., Garcia, D., Jadidi, M., & Strohmaier, M. (2015, April). ''[https://www.aaai.org/ocs/index.php/ICWSM/ICWSM15/paper/viewFile/10585/10528 It's a Man's Wikipedia? Assessing Gender Inequality in an Online Encyclopedia]''. In ICWSM (pp. 454-463). -->
<!-- * Source: Wagner, C., Garcia, D., Jadidi, M., & Strohmaier, M. (2015, April). ''[https://www.aaai.org/ocs/index.php/ICWSM/ICWSM15/paper/viewFile/10585/10528 It's a Man's Wikipedia? Assessing Gender Inequality in an Online Encyclopedia]''. In ICWSM (pp. 454-463). -->
<!-- * Benjamin Collier and Julia Bear. ''[https://static1.squarespace.com/static/521c8817e4b0dca2590b4591/t/523745abe4b05150ff027a6e/1379354027662/2012+-+Collier%2C+Bear+-+Conflict%2C+confidence%2C+or+criticism+an+empirical+examination+of+the+gender+gap+in+Wikipedia.pdf Conflict, criticism, or confidence: an empirical examination of the gender gap in wikipedia contributions]''. In Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work (CSCW '12). DOI: https://doi.org/10.1145/2145204.2145265 -->
<!-- * Benjamin Collier and Julia Bear. ''[https://static1.squarespace.com/static/521c8817e4b0dca2590b4591/t/523745abe4b05150ff027a6e/1379354027662/2012+-+Collier%2C+Bear+-+Conflict%2C+confidence%2C+or+criticism+an+empirical+examination+of+the+gender+gap+in+Wikipedia.pdf Conflict, criticism, or confidence: an empirical examination of the gender gap in wikipedia contributions]''. In Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work (CSCW '12). DOI: https://doi.org/10.1145/2145204.2145265 -->
<!-- * Christina Shane-Simpson, Kristen Gillespie-Lynch, Examining potential mechanisms underlying the Wikipedia gender gap through a collaborative editing task, In Computers in Human Behavior, Volume 66, 2017, https://doi.org/10.1016/j.chb.2016.09.043. (PDF on Canvas) -->
<!-- * Christina Shane-Simpson, Kristen Gillespie-Lynch, Examining potential mechanisms underlying the Wikipedia gender gap through a collaborative editing task, In Computers in Human Behavior, Volume 66, 2017, https://doi.org/10.1016/j.chb.2016.09.043. (PDF on Canvas) -->
<!-- * Amanda Menking and Ingrid Erickson. 2015. ''[https://upload.wikimedia.org/wikipedia/commons/7/77/The_Heart_Work_of_Wikipedia_Gendered,_Emotional_Labor_in_the_World%27s_Largest_Online_Encyclopedia.pdf The Heart Work of Wikipedia: Gendered, Emotional Labor in the World's Largest Online Encyclopedia]''. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI '15). https://doi.org/10.1145/2702123.2702514  -->
<!-- * Amanda Menking and Ingrid Erickson. 2015. ''[https://upload.wikimedia.org/wikipedia/commons/7/77/The_Heart_Work_of_Wikipedia_Gendered,_Emotional_Labor_in_the_World%27s_Largest_Online_Encyclopedia.pdf The Heart Work of Wikipedia: Gendered, Emotional Labor in the World's Largest Online Encyclopedia]''. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI '15). https://doi.org/10.1145/2702123.2702514  -->
<!-- ;Crowdwork research resources -->
<!-- ;Crowdwork research resources -->
<!-- * WeArDynamo contributors. ''[http://wiki.wearedynamo.org/index.php?title=Basics_of_how_to_be_a_good_requester How to be a good requester]'' and ''[http://wiki.wearedynamo.org/index.php?title=Guidelines_for_Academic_Requesters Guidelines for Academic Requesters]''. Wearedynamo.org -->
<!-- * WeArDynamo contributors. ''[http://wiki.wearedynamo.org/index.php?title=Basics_of_how_to_be_a_good_requester How to be a good requester]'' and ''[http://wiki.wearedynamo.org/index.php?title=Guidelines_for_Academic_Requesters Guidelines for Academic Requesters]''. Wearedynamo.org -->
<br/>
<br/>
<hr/>
<hr/>
Line 282: Line 285:


;Assignments due
;Assignments due
* Week 8 Reading reflection
* Reading reflection
* A4: Final Project Plan
 
<!-- ;Agenda -->
<!-- ;Agenda -->
<!-- {{:HCDS (Fall 2018)/Day 6 plan}} -->
<!-- {{:HCDS (Fall 2018)/Day 6 plan}} -->
;Readings assigned
;Readings assigned
* Hill, B. M., Dailey, D., Guy, R. T., Lewis, B., Matsuzaki, M., & Morgan, J. T. (2017). ''[https://mako.cc/academic/hill_etal-cdsw_chapter-DRAFT.pdf Democratizing Data Science: The Community Data Science Workshops and Classes].'' In N. Jullien, S. A. Matei, & S. P. Goggins (Eds.), Big Data Factories: Scientific Collaborative approaches for virtual community data collection, repurposing, recombining, and dissemination.
* Hill, B. M., Dailey, D., Guy, R. T., Lewis, B., Matsuzaki, M., & Morgan, J. T. (2017). ''[https://mako.cc/academic/hill_etal-cdsw_chapter-DRAFT.pdf Democratizing Data Science: The Community Data Science Workshops and Classes].'' In N. Jullien, S. A. Matei, & S. P. Goggins (Eds.), Big Data Factories: Scientific Collaborative approaches for virtual community data collection, repurposing, recombining, and dissemination.
;Homework assigned
;Homework assigned
* Week 9 Reading reflection
* Reading reflection
* A4: Final presentation (due week 11)
 
<!-- ;Resources -->
<!-- ;Resources -->
<!-- * Ethical OS ''[https://ethicalos.org/wp-content/uploads/2018/08/Ethical-OS-Toolkit-2.pdf Toolkit]'' and ''[https://ethicalos.org/wp-content/uploads/2018/08/EthicalOS_Check-List_080618.pdf Risk Mitigation Checklist]''. EthicalOS.org. -->
<!-- * Ethical OS ''[https://ethicalos.org/wp-content/uploads/2018/08/Ethical-OS-Toolkit-2.pdf Toolkit]'' and ''[https://ethicalos.org/wp-content/uploads/2018/08/EthicalOS_Check-List_080618.pdf Risk Mitigation Checklist]''. EthicalOS.org. -->
Line 314: Line 321:
<!-- * Julia Angwin, Jeff Larson, Surya Mattu and Lauren Kirchner. ''[https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing Machine Bias: Risk Assessment in Criminal Sentencing]. Propublica, May 2018. -->
<!-- * Julia Angwin, Jeff Larson, Surya Mattu and Lauren Kirchner. ''[https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing Machine Bias: Risk Assessment in Criminal Sentencing]. Propublica, May 2018. -->
<!-- * [https://www.perspectiveapi.com/#/ Google's Perspective API] -->
<!-- * [https://www.perspectiveapi.com/#/ Google's Perspective API] -->


<br/>
<br/>
<hr/>
<hr/>
<br/>
<br/>
<!-- === Week 7 === -->
<!-- === Week 7 === -->
<!-- <\!-- [[HCDS_(Fall_2018)/Day_7_plan|Day 7 plan]] -\-> -->
<!-- <\!-- [[HCDS_(Fall_2018)/Day_7_plan|Day 7 plan]] -\-> -->
Line 341: Line 350:
<!-- <hr/> -->
<!-- <hr/> -->
<!-- <br/> -->
<!-- <br/> -->
=== Week 9 ===
=== Week 9 ===
<!-- [[HCDS_(Fall_2018)/Day_8_plan|Day 9 plan]] -->
<!-- [[HCDS_(Fall_2018)/Day_8_plan|Day 9 plan]] -->
;Data science for social good: ''Community-based and participatory approaches to data science; Using data science for society's benefit''
;Data science for social good: ''Community-based and participatory approaches to data science; Using data science for society's benefit''
;Assignments due
;Assignments due
* Week 9 reading reflection
* Reading reflection
* A4: Final project plan
 
<!-- ;Agenda -->
<!-- ;Agenda -->
<!-- {{:HCDS (Fall 2018)/Day 9 plan}} -->
<!-- {{:HCDS (Fall 2018)/Day 9 plan}} -->
Line 353: Line 367:


;Homework assigned
;Homework assigned
* A4: Final presentation (due week 11)
* Reading reflection
* Reading reflection


;Resources
;Resources
*  Daniela Aiello, Lisa Bates, et al. [https://shelterforce.org/2018/08/22/eviction-lab-misses-the-mark/ Eviction Lab Misses the Mark], ShelterForce, August 2018.   
*  Daniela Aiello, Lisa Bates, et al. [https://shelterforce.org/2018/08/22/eviction-lab-misses-the-mark/ Eviction Lab Misses the Mark], ShelterForce, August 2018.   
<br/>
<br/>
<hr/>
<hr/>
Line 365: Line 380:
=== Week 10 ===
=== Week 10 ===
<!-- [[HCDS_(Fall_2018)/Day_10_plan|Day 10 plan]] -->
<!-- [[HCDS_(Fall_2018)/Day_10_plan|Day 10 plan]] -->
<!-- [[:File:HCDS 2018 week 10 slides.pdf|Day 10 slides]] -->
<!-- [[:File:HCDS 2018 week 10 slides.pdf|Day 10 slides]] -->
;User experience and big data: ''Design considerations for machine learning applications; human centered data visualization; data storytelling''
;User experience and big data: ''Design considerations for machine learning applications; human centered data visualization; data storytelling''


;Assignments due
;Assignments due
* Week 10 reading reflection
* Reading reflection


<!-- ;Agenda -->
<!-- ;Agenda -->
Line 378: Line 395:


;Homework assigned
;Homework assigned
* A4: Final presentation
* A5: Final presentation
 
<!-- ;Resources -->
<!-- ;Resources -->
<!-- *Fabien Girardin. ''[https://medium.com/@girardin/experience-design-in-the-machine-learning-era-e16c87f4f2e2 Experience design in the machine learning era].'' Medium, 2016. -->
<!-- *Fabien Girardin. ''[https://medium.com/@girardin/experience-design-in-the-machine-learning-era-e16c87f4f2e2 Experience design in the machine learning era].'' Medium, 2016. -->
Line 392: Line 410:
<!-- * Megan Risdal, ''[http://blog.kaggle.com/2016/06/13/communicating-data-science-an-interview-with-a-storytelling-expert-tyler-byers/ Communicating data science: an interview with a storytelling expert].'' Kaggle blog, 2016. -->
<!-- * Megan Risdal, ''[http://blog.kaggle.com/2016/06/13/communicating-data-science-an-interview-with-a-storytelling-expert-tyler-byers/ Communicating data science: an interview with a storytelling expert].'' Kaggle blog, 2016. -->
<!-- * Brent Dykes, ''[https://www.forbes.com/sites/brentdykes/2016/03/31/data-storytelling-the-essential-data-science-skill-everyone-needs/ Data Storytelling: The Essential Data Science Skill Everyone Needs].'' Forbes, 2016. -->
<!-- * Brent Dykes, ''[https://www.forbes.com/sites/brentdykes/2016/03/31/data-storytelling-the-essential-data-science-skill-everyone-needs/ Data Storytelling: The Essential Data Science Skill Everyone Needs].'' Forbes, 2016. -->
<br/>
<br/>
<hr/>
<hr/>
Line 403: Line 422:


;Assignments due
;Assignments due
* A4: Final presentation
* A5: Final presentation
 
 
<!-- ;Agenda -->
<!-- ;Agenda -->
<!-- {{:HCDS (Fall 2018)/Day 11 plan}} -->
<!-- {{:HCDS (Fall 2018)/Day 11 plan}} -->
Line 411: Line 432:


;Homework assigned
;Homework assigned
* A5: Final project report (by 11:59pm)
* A6: Final project report (by 11:59pm)


<!-- ;Resources -->
<!-- ;Resources -->
Line 422: Line 443:
=== Week 12: Finals Week (No Class Session) ===
=== Week 12: Finals Week (No Class Session) ===
* NO CLASS
* NO CLASS
* A5.: FINAL PROJECT REPORT DUE BY 11:59PM
* A6: FINAL PROJECT REPORT DUE BY 11:59PM
<!-- * LATE PROJECT SUBMISSIONS NOT ACCEPTED. -->
<!-- * LATE PROJECT SUBMISSIONS NOT ACCEPTED. -->


Line 469: Line 490:
For this assignment you will propose a midterm project and use the skills you have learned in the CDSW to collect or present a dataset. You will turn in a one-page project description that
For this assignment you will propose a midterm project and use the skills you have learned in the CDSW to collect or present a dataset. You will turn in a one-page project description that


:* Identifies a dataset for analysis, and what makes it interesting to you.
* Identifies a dataset for analysis, and what makes it interesting to you.
:* Explains how the source of the data, how did you get it?
* Explains how the source of the data, how did you get it?
:* Describes 2-3 questions that the data can help answer, and explain how you will answer them.
* Describes 2-3 questions that you hope the data can help answer
:* What results you expect or intend, and most importantly, why your project is interesting or important (and to whom, besides yourself).
* Includes a table of summary statistics (minimum, maximum, median, and mean values) for variables in your dataset related to these questions  
:* Includes a table of summary statistics (minimum, maximum, median, and mean values) for variables in your dataset related to these questions  


I hope that you find a dataset related to your own interests, such as data from your workplace, community, or any other organization you may be involved in.  For some ideas about where to look for datasets related to your interests, [[HCDS (Fall 2017)/Datasets | see this page]] with examples of freely available datasets that you can use for this project.  
I hope that you find a dataset related to your own interests, such as data from your workplace, community, or any other organization you may be involved in.  For some ideas about where to look for datasets related to your interests, [[see this page | HCDS (Fall 2017)/Datasets]] with examples of freely available datasets that you can use for this project.  


==== Evaluation and Rubric ====
==== Rubric ====


:''Dataset identification:'' 20%
'''Dataset identification:''' 25%
:''Explaination of data source:'' 20%
'''Explaination of data source:''' 25%
:''Example questions:'' 20%
'''Example questions:''' 25%
:''Anticipated results and their significance:'' 20%
'''Summary  statistics:''' 25%
:''Summary  statistics:'' 20%


=== A2: Data analysis ===
=== A2: Data analysis ===


For this exercise, you will design and execute the analysis that you proposed in A1. You must attempt to answer to the questions you posed using your new data science skills, but you must also practice a kind of "meta-analysis" of your analysis to understand limitations and potential consequences of your analysis. Turn in a report of about 1500 words with about equal space dedicated to:
=== A4: Final project plan ===
For this assignment, you will write up a study plan for your final class project. The plan will cover a variety of details about your final project. Identify the organization that you will work with, the data you will use, what you will do with the data (e.g. statistical analysis, train a model), what results you expect or intend, and most importantly, why your project is interesting or important (and to whom, besides yourself).


* Presenting your analysis: what did you do and what did you find out? Communicate your findings though using at least one chart or table.
=== A5: Final project presentation ===
* Explaining the significance of your analysis to the (real or hypothetical) organization or community that will make use of it. Why should we care about this analysis?
For this assignment, you will give an in-class presentation of your final project. The goal of this assignment is to demonstrate that you are able to effectively communicate your research questions, methods, conclusions, and implications to your target audience.
* Critique of the analysis in terms of both what you did and how it might be used. How might your analysis improve through better data or analysis? What assumptions underlie your interpretation of it?  How might (or might not) this analysis influence or mislead its audiences? 


==== Evaluation and Rubric ====
=== A6: Final project report ===
:''Presentation of data analysis:'' 40%
For this assignment, you will publish the complete code, data, and analysis of your final research project. The goal is to demonstrate that you can incorporate all of the human-centered design considerations you learned in this course and create research artifacts that are understandable, impactful, and reproducible.
:''Appropriate chart or table:'' 5%
:''Explanation of applicability to community or organization:'' 20%
:''Critique in terms of improving the analysis :'' 10%
:''Critique in terms of application :'' 10%
:''Writing quality (see [[User:Benjamin Mako Hill/Assessment#Writing Rubric | the writing rubric]]): 15%''


=== A3: Final project plan ===
For this assignment, you will write up a study plan for your final class project. The goal of this project is for you to apply what you have learned about data science studies to understanding and improving data science practice in an organizational context. The plan will cover a variety of details about your final project.
Specifically your plan should:
* Identify the organization that you will work with, and your contact there.
* Summarize what you already know about this organization and how they use data science.
* Identify a research question that you don't already know the answer to, but where project can realistically help you answer it.  This should be specific and tied to a particular aspect of how data science is practiced or is used in this organization. Effective research questions will often raise issues or problems with the organization. 
* Outline your plan to collect qualitative data. Will you conduct interviews? Who with? What questions will you ask? Will you conduct workplace observations?
* Explain what results you expect or intend, and most importantly, why your project is interesting or important (and to whom, besides yourself).
A reasonable amount of data collection for this project is about 4 30 minutes interviews with different team members, about 4 hours of workplace observation, or an equivalent combination of interviews and observation.
Maximum length: 1500 words.
==== Evaluation and Rubric ====
:''Organization and identification:'' 20%
:''Summary of what you already know:'' 20%
:''Research question:'' 20%
:''Data collection plan:'' 20%
:''Anticipated results and their significance:'' 20%
=== A4: Final project presentation ===
For this assignment, you will give an in-class presentation of your final project. The goal of this assignment is to demonstrate that you are able to effectively communicate your research questions, methods, conclusions, and implications to your target audience.  This is your chance to get quality feedback on your project from me and from your classmates. Key elements that you should cover in your presentation include:
:* What organization or team you studied
:* Your research question about how data science functions in this organization
:* Your findings: what you learned about your research question from your data in relation to course material.
:* Who you observed or interviewed
:* At least one quotation or anecdote from your qualitative data that support your findings
:* Tentative recommendations to the organization based on your findings
==== Evaluation and Rubric ====
:''Presentation organization and design:'' 15%
:''Explaination and justification of a research question:'' 15%
:''Presentation of evidence:'' 25%
:''Findings:'' 25%
:''Recommendations to the organization:'' 20%
=== A5: Final project report ===
In the final report, I expect you to take feedback from your presentation and and report on your project in up to 3000 words. You can organize your paper however you want, but it should do the following:
:* Introduce the organization or team you studied
:* Document how you collected data with the team (who you interviewed or observed, for how long, describe their jobs and roles).
:* Motivate your project in terms of your substantive interest, curiosity, and course concepts.
:* Introduce and articulate a specific research question about how data science functions in this organization. This will often be driven by a particular challenge or issue the team you are studying faces.
:*  The bulk of your report (about 2000 words) should argue for an understanding of the research question based on
::* Your empirical findings: what you learned about the organization and the practice of data science from your own observations or interviews. Use anecdotes and quotes as appropriate to support your argument.
::* Any course material relevant to the challenge or issue at hand.
:* Make recommendations to the organization based on your findings.
Optionally, you may make recommendations to me about the course material in relation to this project.  Was there anything that you expected to see based on the course material that you didn't observe? Was there anything interesting that you observed that the course didn't address? How might you improve the course given what you learned?
==== Evaluation and Rubric ====
:''Writing quality (see [[User:Benjamin Mako Hill/Assessment#Writing Rubric | the writing rubric]]): 15%''
:''Explaination and justification of a research question:'' 15%
:''Presentation of evidence:'' 25%
:''Findings:'' 25%
:''Recommendations to the organization:'' 20%
:''Course feedback (Extra credit):'' 3%


== Policies ==
== Policies ==
The following general policies apply to this course.
The following general policies apply to this course.
=== Grades ===
Grades will be determined as follows:
* 20% Participation
* 20% Reading reflections
* 5% Midterm proposal
* 15% Midterm report
* 5% Final project proposal
* 10% Final project presentation
* 25% Final project report
You are expected to produce work in all of the assignments that reflects the highest standards of professionalism. For written documents, this means proper spelling, grammar, and formatting.
Late assignments will not be accepted; if your assignment is late, you will receive a zero score. Again, if you run into an issue that necessitates an extension, please reach out.


=== Attendance ===
=== Attendance ===


As detailed in [[User:Benjamin Mako Hill/Assessment | this page on assessment]], attendance in class is expected of all participants. If you need to miss class for any reason, please contact a member of the teaching team ahead of time (email is best). Multiple unexplained absences will likely result in a lower grade or (in extreme circumstances) a failing grade. In the event of an absence, you are responsible for obtaining class notes, handouts, assignments, etc.  
As detailed in [[Teaching Assessment | my page on assessment]], attendance in class is expected of all participants. If you need to miss class for any reason, please contact a member of the teaching team ahead of time (email is best). Multiple unexplained absences will likely result in a lower grade or (in extreme circumstances) a failing grade. In the event of an absence, you are responsible for obtaining class notes, handouts, assignments, etc.  


=== Respect ===  
=== Respect ===  
Line 635: Line 571:
=== Disability and accommodations ===
=== Disability and accommodations ===


As part of ensuring that the class is as accessible as possible, the instructors are entirely comfortable with you using whatever form of note-taking method or recording is most comfortable to you, including laptops and audio recording devices. The instructors will do their best to ensure that all slides and scripts/notes are immediately available online after a lecture has concluded. In addition, if asked ahead of time we can try to record the audio of individual lectures for students who have learning differences that make audiovisual notes preferable to written ones.
As part of ensuring that the class is as accessible as possible, the instructors are entirely comfortable with you using whatever form of note-taking method or recording is most comfortable to you, including laptops and audio recording devices. The instructors will do their best to ensure that all slides and scripts/notes are immediately available online after a lecture has concluded. In addition, if asked ahead of time we can try to record the audio of individial lectures for students who have learning differences that make audiovisual notes preferable to written ones.


If you require additional accommodations, please contact Disabled Student Services: 448 Schmitz, 206-543-8924 (V/TTY). If you have a letter from DSS indicating that you have a disability which requires academic accommodations, please present the letter to the instructors so we can discuss the accommodations you might need in the class. If you have any questions about this policy, reach out to the instructors directly.
If you require additional accommodations, please contact Disabled Student Services: 448 Schmitz, 206-543-8924 (V/TTY). If you have a letter from DSS indicating that you have a disability which requires academic accommodations, please present the letter to the instructors so we can discuss the accommodations you might need in the class. If you have any questions about this policy, reach out to the instructors directly.
Line 641: Line 577:
For more information on disability accommodations, and how to apply for one, please review [http://depts.washington.edu/uwdrs/current-students/accommodations/ UW's Disability Resources for Students].
For more information on disability accommodations, and how to apply for one, please review [http://depts.washington.edu/uwdrs/current-students/accommodations/ UW's Disability Resources for Students].


=== Grades ===
Grades will be determined as follows:
* 20% Participation
* 20% Reading reflections
* 20% Midterm project
* 40% Final project
You are expected to produce work in all of the assignments that reflects the highest standards of professionalism. For written documents, this means proper spelling, grammar, and formatting.
Late assignments will not be accepted; if your assignment is late, you will receive a zero score. Again, if you run into an issue that necessitates an extension, please reach out.


[[Category:Groceryheist drafts]]
[[Category:Groceryheist drafts]]
Please note that all contributions to CommunityData are considered to be released under the Attribution-Share Alike 3.0 Unported (see CommunityData:Copyrights for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource. Do not submit copyrighted work without permission!

To protect the wiki against automated edit spam, we kindly ask you to solve the following CAPTCHA:

Cancel Editing help (opens in new window)