Editing Wiki language research
From CommunityData
The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.
Latest revision | Your text | ||
Line 3: | Line 3: | ||
== Action Items == | == Action Items == | ||
* | * write rough draft of findings and discussion | ||
=== Undergrads === | === Undergrads === | ||
'''Bennett''' | '''Bennett''' | ||
* continue hand-coding samples | * continue hand-coding samples | ||
* take notes on interesting patters (in notes document on spreadsheets | * take notes on interesting patters (in notes document on spreadsheets | ||
'''Shane''' | |||
* | * start hand-coding samples | ||
* start writing XML parser | |||
* | |||
== | === Additional Tasks === | ||
* analyze current talk edits models using marginal effects | |||
* create models using ratios | |||
== meeting logs & notes == | |||
=== 05-22-16 === | === 05-22-16 === | ||
DG: I spent some time looking at the data distributions and ran a bunch of models on the simple EN models overnight. The data for len_1 are reallllly long-tailed with very low frequencies -- this is causing the convergence issues. Below is a table of the simple model (len_1 ~ num_editors_1), run through a series of truncated data sets. The models will converge all the way up to removing the final data point out of the 4,077,819 data points we have. In other words, I was able to get convergence by dropping a single data point. Here's a quick table of the results from running the models: | DG: I spent some time looking at the data distributions and ran a bunch of models on the simple EN models overnight. The data for len_1 are reallllly long-tailed with very low frequencies -- this is causing the convergence issues. Below is a table of the simple model (len_1 ~ num_editors_1), run through a series of truncated data sets. The models will converge all the way up to removing the final data point out of the 4,077,819 data points we have. In other words, I was able to get convergence by dropping a single data point. Here's a quick table of the results from running the models: | ||
Line 140: | Line 112: | ||
== project resources & links == | == project resources & links == | ||
'''05-16-16''' | '''05-16-16''' | ||