# Main Page

CDSC members plus affiliates and guests at UW April 2019. From left: From left to right the people in the picture are: Jeremy, Nate, Charlie, Kaylea, Sejal, Jonathan, Emilia, Mako, Morten, Jim, Isaac, Salt, Abel, and Sayamindu.

The Community Data Science Collective is an interdisciplinary research group made of up of faculty and students at the University of Washington Department of Communication and the Northwestern University Department of Communication Studies.

We are social scientists applying a range of quantitative and qualitative methods to the study of online communities. We seek to understand both how and why some attempts at collaborative production — like Wikipedia and Linux — build large volunteer communities and high quality work products.

Our research is particularly focused on how the design of communication and information technologies shape fundamental social outcomes with broad theoretical and practical implications — like an individual’s decision to join a community, contribute to a public good, or a group’s ability to make decisions democratically.

Our research is deeply interdisciplinary, most frequently consists of “big data” quantitative analyses, and lies at the intersection of communication, sociology, and human-computer interaction.

## Workshops and Courses

In addition to research, we run workshops and teach classes. Some of that work is coordinated on this wiki. A more detailed lists of workshops and teaching material on this wikis is on our Workshops and Classes page. In this page, we only list ongoing classes and workshops.

### Public Data Science Workshops

Community Data Science Workshops — The Community Data Science Workshops (CDSW) are a series of workshops designed to introduce some of the basic tools of programming and analysis of data from online communities to absolute beginners. The CDSW have been held roughly twice a year since beginning in Seattle in 2014. So far, more than 100 people have volunteered their weekends to teach more than 500 people to program in Python, to build datasets from Web APIs, and to ask and answer questions using these data.

### Northwestern Courses & Workshop

• [Spring 2019] MTS 525: Statistics and Statistical Programming — A quarter-long quantitative methods course that builds a first-quarter introduction to quantitative methodology and that focuses on both the more mathematical elements of statistics as well as the nuts-and-bolts of statistical programming in the GNU R programming language. Taught by Aaron Shaw.
• [Spring 2019] MTS 503: The Practice of Scholarship — A workshop-style course dedicated to the submission of an original (lead or sole authored) piece of academic research for publication by the end of the quarter. The course and assignments require weekly writing and feedback from all participants (required of all second year Ph.D. students in the MTS and TSB Ph.D. programs). Taught by Aaron Shaw

## Research Resources

If you are a member of the collective, perhaps you're looking for CommunityData:Resources which includes details on email, TeX templates, documentation on our computing resources, etc.

## Research News

Follow us as @comdatasci on Twitter and subscribe to the Community Data Science Collective blog.

Tor users: An untapped resource for Wikipedia?
Like everyone else, Internet users who protect their privacy by using the anonymous browsing software Tor are welcome to read Wikipedia. However, when Tor users try to contribute to the self-described “encyclopedia that anybody can edit,” they typically come face-to-face with a notice explaining that their participation is not welcome. Our new paper—led by Chau …
— Benjamin Mako Hill http://mako.cc 2020-05-22
COVID-19 Digital Observatory awarded Open Innovation Grant from Protocol Labs Research
Last week, Protocol Labs Research announced their COVID-19 Open Innovation Grant recipients and we thrilled to announce that the Community Data Science Collective’s COVID-19 Digital Observatory is among the awarded projects! Protocol Labs works to improve internet technologies through open source protocols, systems, and tools. The organization initially grew out of efforts to apply blockchain …
— Aaron Shaw http://aaronshaw.org 2020-05-22
What do people do when they edit Wikipedia through Tor?
A paper recently published at CSCW describes the results of a forensic qualitative analysis of contributions made to Wikipedia through the anonymous browsing system Tor. The project was conducted collaboratively with researchers from Drexel, NYU, and the University of Washington and complements a quantitative analysis of the same data we also published to provide a …
— kaylea 2020-05-22
Sohyeon Hwang awarded NSF Graduate Research Fellowship
Congratulations to Sohyeon Hwang who has been awarded a Graduate Research Fellowship by the U.S. National Science Foundation!
— Community Data Science Collective https://communitydata.cc/ 2020-04-18

• Extension:Math is installed so you can write math here. Basically you just add math by putting TeX inside <nowiki> tags like this: $\frac{\sigma}{\sqrt{n}}$