DSCC

From CommunityData
Revision as of 22:28, 5 April 2016 by Groceryheist (talk | contribs) (→‎Plan Workshops)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

This is the page for the Data Science in Communication Collective.

TODO[edit]

  • Fill out Wikipage
  • Get accounts on community data blog
  • Make introductory blog post
  • Plan workshops
  • Get mailing list
  • Get sub directory word-press from Mako for ordinary web page.
  • Get twitter handle
  • Talk to students in content analysis - they might do automated content analysis.

Planned Blog Post[edit]

  • Panama Papers Social Network Analysis - data preparation (Matthew)
  • Panama Papers Visualizing Social Network (Matthew)
  • Panama Papers Network Statistics (Matthew)
  • Change.org - scraping - API (Nate)
  • Change.org - language filtering - API / Parallelism (Nate)
  • Change.org - spam filtering - API (Nate)
  • Change.org - topic modeling - Machine Learning (Nate)
  • Change.org - signature prediction - Pandas / Machine Learning (Nate)
  • Markov Chain Twitter Bot Tutorial - API / Machine Learning (Nate)
  • Hashtag Network Visualization (Matthew)

Plan Workshops[edit]

Tuesday 12:30 - 1:30

  • Send email to CDSW students.
  • Send email to Internet Research Methods students.
  • Python Tutor - review CDSW week 1.

Week 1[edit]

randomly assigning students to small groups, randomly assign students to give speeches in an order.

import random variable assignment difference between = and == lists - setting and accessing, range slices. for loops - doing something for each thing in the list.