Main Page

From CommunityData


CDSC members plus affiliates and guests at Northwestern University September 2019. Back row, from left to right: Aaron, Nate, Jeremy, Mako, Jim, Charlie, Regina, Salt. Front row, f.l.t.r.: Sohyeon, Kaylea, Nick, Sejal, Floor, Jackie.

The Community Data Science Collective is an interdisciplinary research group made of up of faculty and students at the University of Washington Department of Communication, the Northwestern University Department of Communication Studies, the University of North Carolina School of Information and Library Science, the Carleton College Computer Science Department, and the Purdue University School of Communication.

We are social scientists applying a range of quantitative and qualitative methods to the study of online communities. We seek to understand both how and why some attempts at collaborative production — like Wikipedia and Linux — build large volunteer communities and high quality work products.

Our research is particularly focused on how the design of communication and information technologies shape fundamental social outcomes with broad theoretical and practical implications — like an individual’s decision to join a community, contribute to a public good, or a group’s ability to make decisions democratically.

Our research is deeply interdisciplinary, most frequently consists of “big data” quantitative analyses, and lies at the intersection of communication, sociology, and human-computer interaction.

Workshops and Courses

In addition to research, we run workshops and teach classes. Some of that work is coordinated on this wiki. A more detailed lists of workshops and teaching material on this wikis is on our Workshops and Classes page. In this page, we only list ongoing classes and workshops.

Public Data Science Workshops

Community Data Science Workshops — The Community Data Science Workshops (CDSW) are a series of workshops designed to introduce some of the basic tools of programming and analysis of data from online communities to absolute beginners. The CDSW have been held roughly twice a year since beginning in Seattle in 2014. So far, more than 100 people have volunteered their weekends to teach more than 500 people to program in Python, to build datasets from Web APIs, and to ask and answer questions using these data.

University of Washington Courses

Northwestern Courses

  • [Winter 2020] History and Theory of Information — We live in an information age, with computers of unprecedented power in our pockets. This course seeks to understand how information shapes our lives today, and how it has in the past. It does so via an interdisciplinary inquiry into four technological infrastructures of information and communication—print, wires, airwaves, and bits. Co-taught by Aaron Shaw and Daniel Immerwahr.


Research Resources

If you are a member of the collective, perhaps you're looking for CommunityData:Resources which includes details on email, TeX templates, documentation on our computing resources, etc.

Research News

Follow us as @comdatasci on Twitter and subscribe to the Community Data Science Collective blog.

Recent posts from the blog include:

On The Challenges of Governing the Online Commons
Over the past several months (post-general exam!), I have been thinking and reading about organizational and institutional perspectives on the governance of platforms and the online communities that populate them. While much of the research on the emerging area of “platform governance” draws from legal traditions or socio-technical approaches, there is also a smaller subset …
— Zarine Kharazian http://zarine.net 2024-11-21
CDSC at CSCW 2024: Moderation, Bots, Taboos, and Governance Capture!
If you are attending the ACM conference on Computer-supported Cooperative Work and Social Computing (CSCW) this year CSCW in San José, Costa Rica. You are warmly invited to join CDSC members during our talks and other scheduled events. Please come say hi! This CDSC has four papers at CSCW, which we will be presenting over …
— kaylea 2024-11-11
Dr. Yoel Roth: Online Safety and Security
On Oct. 23, 2024, Dr. Yoel Roth gave a lecture titled as “Decentralizing online safety and security: The promises and perils of federated social media” hosted by the Department of Human-Centered Design and Engineering at University of Washington, and a number of CDSC faculty and students were present and discussed issues of digital governance with …
— madisondeyo 2024-11-09
FOSSY 2024 Wrap Up: Sophia Vargas on “A review of valuation models and their application to open source models”
In the seventh talk of the Science of Community track we organized for FOSSY, Google FOSS researcher Sophia Vargas offered an overview of different strategies for measuring the value of open source (particularly in the context of a company thinking about how to engage with FOSS). Some of Sophia’s key insights are: models for measuring …
— kaylea 2024-10-15

About This Wiki

This is open to the public and hackable by all but mostly contains information that will be useful to collective members, their collaborators, people enrolled in their projects, or people interested in building off of their work. If you're interested in making a change or creating content here, generally feel empowered to Be Bold. If things don't fit, somebody who watches this wiki will be in touch.

This is mostly a normal MediaWiki although there are a few things to know:

  • There's a CAPTCHA enabled. If you create an account and then contact any collective member with the username (on or off wiki), they can turn the CAPTCHA off for you.
  • Extension:Math is installed so you can write math here. Basically you just add math by putting TeX inside <nowiki> tags like this: <math>\frac{\sigma}{\sqrt{n}}</math>