Social media comp chapter: Difference between revisions
From CommunityData
(6 intermediate revisions by the same user not shown) | |||
Line 3: | Line 3: | ||
=== Current to-do list === | === Current to-do list === | ||
==== Data collection and preparation ==== | |||
* Get articles about online communities as well | |||
* Get Scopus articles about "online communities" as well | |||
==== Introduction ==== | ==== Introduction ==== | ||
* | * Revise once we have completed the sections and have a story to tell about our findings. | ||
* Incorporate a bit more about how the chapter attempts to point people toward great resources and exemplary work applying computational techniques. | |||
==== Data Collection and Descriptives (Jeremy) ==== | ==== Data Collection and Descriptives (Jeremy) ==== | ||
Line 18: | Line 20: | ||
** Papers per year | ** Papers per year | ||
* Finish writing up benefits/drawbacks section | * Finish writing up benefits/drawbacks section | ||
* Incorporate mentions/citations of instructional texts/resources that interested readers can use. | |||
==== Topic Models (Jeremy) ==== | ==== Topic Models (Jeremy) ==== | ||
Line 28: | Line 31: | ||
* Write benefits/limitations of approach | * Write benefits/limitations of approach | ||
==== Prediction (Aaron) ==== | ==== Citation Prediction (Aaron) ==== | ||
* Incorporate links/citations to Statistical Learning book and related instructional resources. | |||
* Prepare data for analysis | |||
** Abstract text | |||
*** Lowercase | |||
*** Remove stop-words | |||
*** Create uni-, bi-, tri- grams. | |||
*** Determine minimum threshhold of phrase occurrence for inclusion across categories/subjects (e.g., Mitra & Gilbert say 50 in their dataset). | |||
** Control measures | |||
*** n.authors | |||
*** publication year | |||
*** publication type (conference? journal?) | |||
*** agg prior citations for authors (sqrt? log?) | |||
*** author affiliation (fixed effects? maybe just dummy for R1?) | |||
*** venue (fixed effects) | |||
*** subject area (fixed effects — various measures available) | |||
*** affiliation countries | |||
*** language | |||
==== Citation Networks (Mako) ==== | ==== Citation Networks (Mako) ==== | ||
* Mako will complete detailed outline of his section in .tex file. | * Mako will complete detailed outline of his section in .tex file. | ||
==== Conclusion ==== | |||
* Expand draft conclusion as we go. Revise upon completion. |
Latest revision as of 17:09, 29 April 2016
A book chapter that Jeremy, Mako, and Aaron are working on.
Current to-do list[edit]
Data collection and preparation[edit]
- Get Scopus articles about "online communities" as well
Introduction[edit]
- Revise once we have completed the sections and have a story to tell about our findings.
- Incorporate a bit more about how the chapter attempts to point people toward great resources and exemplary work applying computational techniques.
Data Collection and Descriptives (Jeremy)[edit]
- Write code to produce descriptive statistics
- Citation Counts
- Top Journals
- Top Countries
- Papers per year
- Finish writing up benefits/drawbacks section
- Incorporate mentions/citations of instructional texts/resources that interested readers can use.
Topic Models (Jeremy)[edit]
- Write code to produce:
- Overall topic distribution
- Topic distribution over time
- Topic distribution in top journals?
- Write discussion of findings
- Write benefits/limitations of approach
Citation Prediction (Aaron)[edit]
- Incorporate links/citations to Statistical Learning book and related instructional resources.
- Prepare data for analysis
- Abstract text
- Lowercase
- Remove stop-words
- Create uni-, bi-, tri- grams.
- Determine minimum threshhold of phrase occurrence for inclusion across categories/subjects (e.g., Mitra & Gilbert say 50 in their dataset).
- Control measures
- n.authors
- publication year
- publication type (conference? journal?)
- agg prior citations for authors (sqrt? log?)
- author affiliation (fixed effects? maybe just dummy for R1?)
- venue (fixed effects)
- subject area (fixed effects — various measures available)
- affiliation countries
- language
- Abstract text
Citation Networks (Mako)[edit]
- Mako will complete detailed outline of his section in .tex file.
Conclusion[edit]
- Expand draft conclusion as we go. Revise upon completion.