CommunityData:Wikia rises and declines tasks: Difference between revisions
From CommunityData
Groceryheist (talk | contribs) |
Groceryheist (talk | contribs) (Add issues section) |
||
(7 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
= Build Dataset = | = Issues = | ||
== Namespace 4 is a very coarse measure for the amount of governance activity. == | |||
; Option 0 : treat all of namespace 4 equally for all wikis (minimal assumption, maximum generalizable, more unknown measurement error, minimal work). | |||
; Option 1 : treat WP:ANI separately for English Wikipedia (easily done, opens can of worms) | |||
; Option 2 : use translation to painstakingly identify WP:ANI in other language wikipedias. | |||
; Option 3 : Do a different project that uses templates and patterns of use to identify organizational routines on different language editions of WP. | |||
Go with Option 0 for now since it treats alll Wikis the same without beginning a new engineering project. | |||
= Build Dataset = | |||
# Collect list of 2010 wikis with wikiq dumps '''Done''' | # Collect list of 2010 wikis with wikiq dumps '''Done''' | ||
# Scrape bot data for wikis '''Done''' and add to tables '''In progress''' | # Scrape bot data for wikis '''Done''' and add to tables '''In progress''' | ||
Line 5: | Line 14: | ||
## newcomer 1st edit session | ## newcomer 1st edit session | ||
### is reverted '''Done''' | ### is reverted '''Done''' | ||
### is reverted and messaged | ### is reverted and messaged '''Done''' | ||
### is reverted and messaged on article talk '''Done''' | ### is reverted and messaged on article talk '''Done''' | ||
### is messaged | ### is messaged '''Done''' | ||
### number of edits on wikia overall | ### number of edits on wikia overall '''Done''' | ||
### number of edits on wiki | ### number of edits on wiki '''Done''' | ||
### has edit other wikia wikis | ### has edit other wikia wikis '''Done''' | ||
### Survives (makes an edit 2-6 months after first session) '''Done''' | |||
## bots | ## bots | ||
### ask mako for script | ### ask mako for script '''Done''' | ||
### tool reverts | ### tool reverts '''Done''' | ||
### change in tool reverts | ### change in tool reverts '''Done''' | ||
## Wiki level rules | ## Wiki level rules | ||
### number of namespace 4 editors | ### number of namespace 4 editors '''Done''' | ||
### number of namespace 4 edits | ### number of namespace 4 edits '''Done''' | ||
### change in namespace 4 page length | ### change in namespace 4 page length '''Done''' | ||
### age of namespace 4 editors | ### age of namespace 4 editors '''Done''' | ||
### change in newcomer revert rate | ### change in newcomer revert rate '''Done''' | ||
### change in newcomer revert rate without talk page discussion | ### change in newcomer revert rate without talk page discussion '''Done''' | ||
### change in newcomer revert rate without any message | ### change in newcomer revert rate without any message '''Done''' | ||
### newcomer survival rate '''Done''' | |||
## Wiki level controls | |||
### Active editors '''Done''' | |||
### edits per time '''Done''' | |||
### newcomer edits per time '''Done''' | |||
### number of articles | |||
### total wiki length '''Done''' | |||
### Wiki age '''Done''' | |||
## | ## Adding Wikipedia Data | ||
### | ### Download wikipedia dumps '''Done''' | ||
### | ### Download wikipedia userroles data '''Done''' | ||
### | ### Merge large wikipedia dumps '''Done''' | ||
### | ### Integrate wikipedia and wikia data '''in progress''' | ||
= Reading = | = Reading = |
Latest revision as of 16:47, 17 July 2017
Issues[edit]
Namespace 4 is a very coarse measure for the amount of governance activity.[edit]
- Option 0
- treat all of namespace 4 equally for all wikis (minimal assumption, maximum generalizable, more unknown measurement error, minimal work).
- Option 1
- treat WP:ANI separately for English Wikipedia (easily done, opens can of worms)
- Option 2
- use translation to painstakingly identify WP:ANI in other language wikipedias.
- Option 3
- Do a different project that uses templates and patterns of use to identify organizational routines on different language editions of WP.
Go with Option 0 for now since it treats alll Wikis the same without beginning a new engineering project.
Build Dataset[edit]
- Collect list of 2010 wikis with wikiq dumps Done
- Scrape bot data for wikis Done and add to tables In progress
- build dataset with variables
- newcomer 1st edit session
- is reverted Done
- is reverted and messaged Done
- is reverted and messaged on article talk Done
- is messaged Done
- number of edits on wikia overall Done
- number of edits on wiki Done
- has edit other wikia wikis Done
- Survives (makes an edit 2-6 months after first session) Done
- newcomer 1st edit session
- bots
- ask mako for script Done
- tool reverts Done
- change in tool reverts Done
- bots
- Wiki level rules
- number of namespace 4 editors Done
- number of namespace 4 edits Done
- change in namespace 4 page length Done
- age of namespace 4 editors Done
- change in newcomer revert rate Done
- change in newcomer revert rate without talk page discussion Done
- change in newcomer revert rate without any message Done
- newcomer survival rate Done
- Wiki level rules
- Wiki level controls
- Active editors Done
- edits per time Done
- newcomer edits per time Done
- number of articles
- total wiki length Done
- Wiki age Done
- Wiki level controls
- Adding Wikipedia Data
- Download wikipedia dumps Done
- Download wikipedia userroles data Done
- Merge large wikipedia dumps Done
- Integrate wikipedia and wikia data in progress
- Adding Wikipedia Data
Reading[edit]
- Read Geiger 2012
Models[edit]
- Newcomer retention
- rate of newcomer revert
- rate of rule making
- rate of tool assisted revert
- rate of newcomer messaging (following revert)