CommunityData:Wikia rises and declines tasks

From CommunityData

Issues[edit]

Namespace 4 is a very coarse measure for the amount of governance activity.[edit]

Option 0
treat all of namespace 4 equally for all wikis (minimal assumption, maximum generalizable, more unknown measurement error, minimal work).
Option 1
treat WP:ANI separately for English Wikipedia (easily done, opens can of worms)
Option 2
use translation to painstakingly identify WP:ANI in other language wikipedias.
Option 3
Do a different project that uses templates and patterns of use to identify organizational routines on different language editions of WP.

Go with Option 0 for now since it treats alll Wikis the same without beginning a new engineering project.

Build Dataset[edit]

  1. Collect list of 2010 wikis with wikiq dumps Done
  2. Scrape bot data for wikis Done and add to tables In progress
  3. build dataset with variables
    1. newcomer 1st edit session
      1. is reverted Done
      2. is reverted and messaged Done
      3. is reverted and messaged on article talk Done
      4. is messaged Done
      5. number of edits on wikia overall Done
      6. number of edits on wiki Done
      7. has edit other wikia wikis Done
      8. Survives (makes an edit 2-6 months after first session) Done
    1. bots
      1. ask mako for script Done
      2. tool reverts Done
      3. change in tool reverts Done
    1. Wiki level rules
      1. number of namespace 4 editors Done
      2. number of namespace 4 edits Done
      3. change in namespace 4 page length Done
      4. age of namespace 4 editors Done
      5. change in newcomer revert rate Done
      6. change in newcomer revert rate without talk page discussion Done
      7. change in newcomer revert rate without any message Done
      8. newcomer survival rate Done
    1. Wiki level controls
      1. Active editors Done
      2. edits per time Done
      3. newcomer edits per time Done
      4. number of articles
      5. total wiki length Done
      6. Wiki age Done
    1. Adding Wikipedia Data
      1. Download wikipedia dumps Done
      2. Download wikipedia userroles data Done
      3. Merge large wikipedia dumps Done
      4. Integrate wikipedia and wikia data in progress

Reading[edit]

  1. Read Geiger 2012

Models[edit]

  1. Newcomer retention
  2. rate of newcomer revert
  3. rate of rule making
  4. rate of tool assisted revert
  5. rate of newcomer messaging (following revert)