Main Page



The Community Data Science Collective is an interdisciplinary research group made of up of faculty and students at the University of Washington Department of Communication and the Northwestern University Department of Communication Studies.

We are social scientists applying a range of quantitative and qualitative methods to the study of online communities. We seek to understand both how and why some attempts at collaborative production — like Wikipedia and Linux — build large volunteer communities and high quality work products.

Our research is particularly focused on how the design of communication and information technologies shape fundamental social outcomes with broad theoretical and practical implications — like an individual’s decision to join a community, contribute to a public good, or a group’s ability to make decisions democratically.

Our research is deeply interdisciplinary, most frequently consists of “big data” quantitative analyses, and lies at the intersection of communication, sociology, and human-computer interaction.

Workshops and Courses
In addition to research, we run workshops and teach classes. Some of that work is coordinated on this wiki. A more detailed lists of workshops and teaching material on this wikis is on our Workshops and Classes page. In this page, we only list ongoing classes and workshops.

Public Data Science Workshops
Community Data Science Workshops — The Community Data Science Workshops (CDSW) are a series of workshops designed to introduce some of the basic tools of programming and analysis of data from online communities to absolute beginners. The CDSW have been held roughly twice a year since beginning in Seattle in 2014. So far, more than 100 people have volunteered their weekends to teach more than 500 people to program in Python, to build datasets from Web APIs, and to ask and answer questions using these data.

University of Washington Courses

 * [Fall 2017] DATA512: Human Centered Data Science — Fundamental principles of data science and its human implications. Data ethics; data privacy; differential privacy; algorithmic bias; legal frameworks and intellectual property; provenance and reproducibility; data curation and preservation; user experience design and usability testing for big data; ethics of crowdwork; data communication; and societal impacts of data science.


 * [Fall 2017] COM597: Innovation Communities — A UW Communication Leadership’s elective in the “Masters in Communication in Communities and Networks” program covering using online communities to harness user innovation taught by Benjamin Mako Hill.

Northwestern Courses & Workshop

 * Bring Your Own Research Workshop (BYOR) — A research workshop for CDSC affiliates and fellow travelers at Northwestern convened by Aaron Shaw. Participants present work and provide peer feedback/accountability in weekly meetings. Most members of the group are affiliates of the Media, Technology & Society and Technology & Social Behavior programs at Northwestern and study online communities, collective action, organizations, collaboration, and related topics.

Research Resources
If you are a member of the collective, perhaps you're looking for CommunityData:Resources which includes details on email, TeX templates, documentation on our computing resources, etc.

Research News
Follow us as @comdatasci on Twitter and subscribe to the Community Data Science Collective blog.

Recent posts from the blog include:

https://blog.communitydata.cc/feed/atom/

About This Wiki
This is open to the public and hackable by all but mostly contains information that will be useful to collective members, their collaborators, people enrolled in their projects, or people interested in building off of their work. If you're interested in making a change or creating content here, generally feel empowered to Be Bold. If things don't fit, somebody who watches this wiki will be in touch.

This is mostly a normal MediaWiki although there are a few things to know:


 * There's a CAPTCHA enabled. If you create an account and then contact any collective member with the username (on or off wiki), they can turn the CAPTCHA off for you.
 * Extension:Math is installed so you can write math here. Basically you just add math by putting TeX inside  tags like this: $$\frac{\sigma}{\sqrt{n}}$$