CommunityData:Wikia rises and declines tasks: Difference between revisions

From CommunityData
No edit summary
(Add issues section)
 
(11 intermediate revisions by the same user not shown)
Line 1: Line 1:
= Build Dataset =
= Issues =
# Collect list of 2010 wikis with wikiq dumps
== Namespace 4 is a very coarse measure for the amount of governance activity. ==
# Scrape bot data for wikis add add to tables
; Option 0 : treat all of namespace 4 equally for all wikis (minimal assumption, maximum generalizable, more unknown measurement error, minimal work).
; Option 1 : treat WP:ANI separately for English Wikipedia (easily done, opens can of worms)
; Option 2 : use translation to painstakingly identify WP:ANI in other language wikipedias.
; Option 3 : Do a different project that uses templates and patterns of use to identify organizational routines on different language editions of WP.
 
Go with Option 0 for now since it treats alll Wikis the same without beginning a new engineering project.
 
= Build Dataset =  
# Collect list of 2010 wikis with wikiq dumps '''Done'''
# Scrape bot data for wikis '''Done''' and add to tables '''In progress'''
# build dataset with variables  
# build dataset with variables  
## newcomer 1st edit session
## newcomer 1st edit session
### is reverted
### is reverted '''Done'''
### is deleted
### is reverted and messaged '''Done'''
### is reverted and messaged
### is reverted and messaged on article talk '''Done'''
### is reverted and messaged on article talk
### is messaged '''Done'''
### is messaged
### number of edits on wikia overall '''Done'''
### number of edits on wikia overall
### number of edits on wiki '''Done'''
### number of edits on wiki
### has edit other wikia wikis '''Done'''
### has edit other wikia wikis
### Survives (makes an edit 2-6 months after first session) '''Done'''


## bots
## bots
### ask mako for script(DONE)
### ask mako for script '''Done'''
### tool reverts
### tool reverts '''Done'''
### change in tool reverts
### change in tool reverts '''Done'''


## Wiki level rules
## Wiki level rules
### number of namespace 4 editors  
### number of namespace 4 editors '''Done'''
### number of namespace 4 edits
### number of namespace 4 edits '''Done'''
### change in namespace 4 page length  
### change in namespace 4 page length '''Done'''
### age of namespace 4 editors  
### age of namespace 4 editors '''Done'''
### change in newcomer revert rate
### change in newcomer revert rate '''Done'''
### change in newcomer revert rate without talk page discussion
### change in newcomer revert rate without talk page discussion '''Done'''
### change in newcomer revert rate without any message  
### change in newcomer revert rate without any message '''Done'''
### newcomer survival rate '''Done'''
 
## Wiki level controls
### Active editors '''Done'''
### edits per time '''Done'''
### newcomer edits per time '''Done'''
### number of articles
### total wiki length '''Done'''
### Wiki age '''Done'''


## Wiki level controls
## Adding Wikipedia Data
### Active editors
### Download wikipedia dumps '''Done'''
### edits per time
### Download wikipedia userroles data '''Done'''
### newcomer edits per time
### Merge large wikipedia dumps '''Done'''
### number of articles
### Integrate wikipedia and wikia data '''in progress'''
### total wiki length
### Wiki age


= Reading =
= Reading =

Latest revision as of 16:47, 17 July 2017

Issues[edit]

Namespace 4 is a very coarse measure for the amount of governance activity.[edit]

Option 0
treat all of namespace 4 equally for all wikis (minimal assumption, maximum generalizable, more unknown measurement error, minimal work).
Option 1
treat WP:ANI separately for English Wikipedia (easily done, opens can of worms)
Option 2
use translation to painstakingly identify WP:ANI in other language wikipedias.
Option 3
Do a different project that uses templates and patterns of use to identify organizational routines on different language editions of WP.

Go with Option 0 for now since it treats alll Wikis the same without beginning a new engineering project.

Build Dataset[edit]

  1. Collect list of 2010 wikis with wikiq dumps Done
  2. Scrape bot data for wikis Done and add to tables In progress
  3. build dataset with variables
    1. newcomer 1st edit session
      1. is reverted Done
      2. is reverted and messaged Done
      3. is reverted and messaged on article talk Done
      4. is messaged Done
      5. number of edits on wikia overall Done
      6. number of edits on wiki Done
      7. has edit other wikia wikis Done
      8. Survives (makes an edit 2-6 months after first session) Done
    1. bots
      1. ask mako for script Done
      2. tool reverts Done
      3. change in tool reverts Done
    1. Wiki level rules
      1. number of namespace 4 editors Done
      2. number of namespace 4 edits Done
      3. change in namespace 4 page length Done
      4. age of namespace 4 editors Done
      5. change in newcomer revert rate Done
      6. change in newcomer revert rate without talk page discussion Done
      7. change in newcomer revert rate without any message Done
      8. newcomer survival rate Done
    1. Wiki level controls
      1. Active editors Done
      2. edits per time Done
      3. newcomer edits per time Done
      4. number of articles
      5. total wiki length Done
      6. Wiki age Done
    1. Adding Wikipedia Data
      1. Download wikipedia dumps Done
      2. Download wikipedia userroles data Done
      3. Merge large wikipedia dumps Done
      4. Integrate wikipedia and wikia data in progress

Reading[edit]

  1. Read Geiger 2012

Models[edit]

  1. Newcomer retention
  2. rate of newcomer revert
  3. rate of rule making
  4. rate of tool assisted revert
  5. rate of newcomer messaging (following revert)