CommunityData:Wikia rises and declines tasks: Difference between revisions

From CommunityData
Line 35: Line 35:
### total wiki length '''Done'''
### total wiki length '''Done'''
### Wiki age '''Done'''
### Wiki age '''Done'''
## Adding Wikipedia Data
### Download wikipedia dumps '''Done'''
### Download wikipedia userroles data '''Done'''
### Merge large wikipedia dumps '''Done'''
### Integrate wikipedia and wikia data '''in progress'''


= Reading =
= Reading =

Revision as of 16:29, 17 July 2017

Build Dataset

  1. Collect list of 2010 wikis with wikiq dumps Done
  2. Scrape bot data for wikis Done and add to tables In progress
  3. build dataset with variables
    1. newcomer 1st edit session
      1. is reverted Done
      2. is reverted and messaged Done
      3. is reverted and messaged on article talk Done
      4. is messaged Done
      5. number of edits on wikia overall Done
      6. number of edits on wiki Done
      7. has edit other wikia wikis Done
      8. Survives (makes an edit 2-6 months after first session) Done
    1. bots
      1. ask mako for script Done
      2. tool reverts Done
      3. change in tool reverts Done
    1. Wiki level rules
      1. number of namespace 4 editors Done
      2. number of namespace 4 edits Done
      3. change in namespace 4 page length Done
      4. age of namespace 4 editors Done
      5. change in newcomer revert rate Done
      6. change in newcomer revert rate without talk page discussion Done
      7. change in newcomer revert rate without any message Done
      8. newcomer survival rate Done
    1. Wiki level controls
      1. Active editors Done
      2. edits per time Done
      3. newcomer edits per time Done
      4. number of articles
      5. total wiki length Done
      6. Wiki age Done
    1. Adding Wikipedia Data
      1. Download wikipedia dumps Done
      2. Download wikipedia userroles data Done
      3. Merge large wikipedia dumps Done
      4. Integrate wikipedia and wikia data in progress

Reading

  1. Read Geiger 2012

Models

  1. Newcomer retention
  2. rate of newcomer revert
  3. rate of rule making
  4. rate of tool assisted revert
  5. rate of newcomer messaging (following revert)