Main Page
The Community Data Science Collective is an interdisciplinary research group made up of faculty and students at the University of Washington Department of Communication, the Northwestern University Department of Communication Studies, the Carleton College Computer Science Department, and the Purdue University School of Communication.
We are social scientists applying a range of quantitative and qualitative methods to the study of online communities. We seek to understand both how and why some attempts at collaborative production — like Wikipedia and Linux — build large volunteer communities and high quality work products.
Our research is particularly focused on how the design of communication and information technologies shape fundamental social outcomes with broad theoretical and practical implications — like an individual’s decision to join a community, contribute to a public good, or a group’s ability to make decisions democratically.
Our research is deeply interdisciplinary, most frequently consists of “big data” quantitative analyses, and lies at the intersection of communication, sociology, and human-computer interaction.
To learn more about the CDSC, please check out our about page (especially the links there). Prospective students should also review these materials.
Courses
In addition to research, we teach classes and run workshops. Some of that work is coordinated on this wiki. A more detailed lists of workshops and teaching material on this wiki is on our Workshops and Classes page. In this page, we only list ongoing classes and workshops.
Purdue Courses
- [Summer 2023] Advanced Computational Communication Methods – In this class, we will investigate a number of more advanced methods or concepts not covered in the Intro to Programming and Data Science course, including SQL, computational text analysis, creating reproducible projects, and advanced visualization.
- [Spring 2023] Quantitative Methods for Communication – This course introduces students to a range of social-scientific research methods used to investigate human communication, with a focus on research design, statistics, and statistical software. Taught by Jeremy Foote and Hazel Chiu.
University of Washington Courses
- [Fall 2024] COM 481: Online Communities — An undergraduate course on online communities taught by Benjamin Mako Hill and Ellie Mercedes Ross.
- [Fall 2024] COMMLD 570A/COM 597A: Building Successful Online Communities — A course on online communities taught by Benjamin Mako Hill in both the Communication Leadership Masters Program (COMMLD) and the Department of Communication MA/PhD program (COM 597A). Registration in the latter is very limited.
Public Data Science Workshops
Community Data Science Workshops — The Community Data Science Workshops (CDSW) are a series of workshops designed to introduce some of the basic tools of programming and analysis of data from online communities to absolute beginners. The CDSW have been held six times in Seattle between 2014 and 2020. So far, more than 100 people have volunteered their weekends to teach more than 500 people to program in Python, to build datasets from Web APIs, and to ask and answer questions using these data.
Research Resources
If you are a member of the collective, perhaps you're looking for CommunityData:Resources which includes details on email, TeX templates, documentation on our computing resources, etc.
About This Wiki
This is open to the public and hackable by all but mostly contains information that will be useful to collective members, their collaborators, people enrolled in their projects, or people interested in building off of their work. If you're interested in making a change or creating content here, generally feel empowered to Be Bold. If things don't fit, somebody who watches this wiki will be in touch.
This is mostly a normal MediaWiki although there are a few things to know:
- There's a CAPTCHA enabled. If you create an account and then contact any collective member with the username (on or off wiki), they can turn the CAPTCHA off for you.
- Extension:Math is installed so you can write math here. Basically you just add math by putting TeX inside <math> tags like this: <math>\frac{\sigma}{\sqrt{n}}</math> and it will write .
Research News
Follow us as @comdatasci on Twitter and @communitydata@social.coop in the Fediverse/Mastodon and subscribe to the Community Data Science Collective blog.
Recent posts from the blog include:
- FOSSY 2024 Wrap Up: Sophia Vargas on “A review of valuation models and their application to open source models”
- In the seventh talk of the Science of Community track we organized for FOSSY, Google FOSS researcher Sophia Vargas offered an overview of different strategies for measuring the value of open source (particularly in the context of a company thinking about how to engage with FOSS). Some of Sophia’s key insights are: models for measuring …
- — kaylea 2024-10-15
- Check Out the PhD Q&A Session!
- Missed the prospective student Q&A session? Fear not, you can still hear from our faculty members, see a few examples of current students research, and listen to answers for our prospective student audience. You can find more resources about the Community Data Science Collective below: Still have questions for our group? Check out our people …
- — madisondeyo 2024-10-23
- FOSSY 2024 Wrap Up: Darius Kazemi on “Community governance models on small-to-mid-size Mastodon servers
- In the sixth talk of the Science of Community track we organized for FOSSY, independent FOSS researcher Darius Kazemi described the results of an interview study to learn from the moderation teams of decentralized social network servers. One of Darius’ key observations is the extensive compliance and legally-required work that running such a server requires. …
- — kaylea 2024-10-14
- FOSSY 2024 Wrap Up: Bogdan Vasilescu on “Navigating Dependency Abandonment”
- In the final talk of the Science of Community track we organized for FOSSY, Computer Science professor and FOSS researcher Dr. Bogdan Vasilescu described his team’s work to understand how developers think about abandoned dependencies. One of the key insights from this work is that abandonment of dependencies is quite common, but that updating a …
Continue reading "FOSSY 2024 Wrap Up: Bogdan Vasilescu on “Navigating Dependency Abandonment”"
- — kaylea 2024-10-21
- FOSSY 2024 Wrap Up: Kaylea Champion on “Research Says…..Insights on Building, Leading, and Sustaining Open Source”
- In the fifth talk of the Science of Community track we organized for FOSSY, Dr. Kaylea Champion describes a series of research results on both how to build high-quality FOSS and how to sustain a community alongside it. One of her key insights is that a great community is no guarantee of a high-quality project …
- — kaylea 2024-10-15