Editing MTurk Workshop (CASBS 2019)/Notes

From CommunityData
Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.

Latest revision Your text
Line 28: Line 28:


The MTurk platform has an enormous bias toward support of the "requestor" side of the labor market. That said, Turkers are organized:
The MTurk platform has an enormous bias toward support of the "requestor" side of the labor market. That said, Turkers are organized:


* [https://www.reddit.com/r/TurkerNation/ TurkerNation]
* [https://www.reddit.com/r/TurkerNation/ TurkerNation]
Line 44: Line 45:
Approaches to building these vary.
Approaches to building these vary.


=== Anatomy of MTurk request ===
== Anatomy of MTurk request ==


<ul>
<ul>
Line 50: Line 51:
<li>(Optionally) input data that you want coded, processed, etc.</li>
<li>(Optionally) input data that you want coded, processed, etc.</li>
<li>Tasks/HITs come in '''Batches''' which are defined as ('''Input Data CSV''' * '''Project/Design Template''')</li>
<li>Tasks/HITs come in '''Batches''' which are defined as ('''Input Data CSV''' * '''Project/Design Template''')</li>
<li>Assignments (the number of workers assigned to each HIT). A specific Turker can only do a given HIT/task once.</li>
<li>Assignments (the number of workers assiged to each HIT). A specific Turker can only do a given HIT/task once.</li>
<li><p>[https://blog.mturk.com/managing-groups-of-hits-on-mechanical-turk-3194c045371f Groups of tasks] — If you republish multiple batches of the same task ''or'' different tasks with same name/etc, they show up as the same HIT in the worker interface!</p>
<li><p>[https://blog.mturk.com/managing-groups-of-hits-on-mechanical-turk-3194c045371f Groups of tasks] — If you republish multiple batches of the same task ''or'' different tasks with same name/etc, they show up as the same HIT in the worker interface!</p>
<p>Be careful that this will mean that people can do each HIT within each batch/group (i.e., similar to doing a HIT multiple times).</p></li></ul>
<p>Be careful that this will mean that people can do each HIT within each batch/group (i.e., similar to doing a HIT multiple times).</p></li></ul>


=== Request workflow ===
== Request Workflow ==


# Define the properties and design the layout of your Project. (Create a Project)
# Define the properties and design the layout of your Project. (Create a Project)
Line 61: Line 62:
# View a Worker’s approval rating, award bonuses, assign qualifications, or block Workers from working on your HITs. (Managing Workers)
# View a Worker’s approval rating, award bonuses, assign qualifications, or block Workers from working on your HITs. (Managing Workers)


=== Creating HITs ===
== Creating HITs ==


There are two ways to do this:
There are two ways to do this:
Line 70: Line 71:
Although (2) is powerful, we'll focus entirely on (1) here and on two approaches that an be accomplished within (1): (A) external surveys and material (B) created using Amazon's Crowd HTML Elements.
Although (2) is powerful, we'll focus entirely on (1) here and on two approaches that an be accomplished within (1): (A) external surveys and material (B) created using Amazon's Crowd HTML Elements.


==== (A) External surveys ====
=== (A) External surveys ===


Sending Turkers to a survey on an external website is very popular. The challenge with involving a separate website typically relies on ensuring that the worker you hire/pay on MTurk is the same one who is doing the task on the separate website.
Sending Turkers to a survey on an external website is very popular. The challenge with involving a separate website typically relies on ensuring that the worker you hire/pay on MTurk is the same one who is doing the task on the separate website.
Line 83: Line 84:
In either case, you can choose merge your data from your survey website and MTurk to verify this information before approving/paying.
In either case, you can choose merge your data from your survey website and MTurk to verify this information before approving/paying.


==== (B) Crowd HTML Elements ====
=== (B) Crowd HTML Elements ===


The best way involves modifying a set of HTML templates on MTurk. Understanding at least some HTML is very helpful for doing this but not necessary.
The best way involves modifying a set of HTML templates on MTurk. Understanding at least some HTML is very helpful for doing this but not necessary.
Line 91: Line 92:
Here are two options:
Here are two options:


* [https://www.w3schools.com/html/html_intro.asp W3C's HTML Tutorial]
https://www.w3schools.com/html/html_intro.asp https://developer.mozilla.org/en-US/docs/Learn/HTML/Introduction_to_HTML
* [https://developer.mozilla.org/en-US/docs/Learn/HTML/Introduction_to_HTML Mozilla's Introduction to HTML]


In many cases, modifications will be extremely minimal:
In many cases, modifications will be extremely minimal:


# Pick an example from the options on the website that is similar to what you want to do.
# Pick an example from the options on the website that is similar to what you want to do.
# Modify it so that /incoming/ variables from your input file are marked like ''${variable_name}'' in the HTML. Output variables will be marked in the HTML with ''name='' somewhere.
# Modify it so that /incoming/ variables from your input file are marked like &quot;${variable_name}&quot; in the HTML. Output variables will be marked in the HTML with &quot;name=&quot; somewhere.


I'll walk through an example on my own.
I'll walk through an example on my own.
Line 103: Line 103:
'''NOTE:''' If your task involves external media (images, sounds, etc), you will need to host these yourself. MTurk does not allow you upload this material. I typically host these on my university web servers. You will need to provide these in a sheet in the form of URLs.
'''NOTE:''' If your task involves external media (images, sounds, etc), you will need to host these yourself. MTurk does not allow you upload this material. I typically host these on my university web servers. You will need to provide these in a sheet in the form of URLs.


=== Publishing HITs ===
== Publishing HITs ==


# Publishing a batch will require providing a CSV '''input''' file that works like the input to a mail merge. An example version of the CSV will be provided will be provided by the MTurk user interface.
# Publishing a batch will require providing a CSV &quot;input&quot; file that works like the input to a mail merge. An example version of the CSV will be provided will be provided by the MTurk user interface.
# Preview the HITS
# Preview the HITS
# Publish/launch the batch!
# Publish/launch the batch!


=== Reviewing Workers Work ===
== Reviewing Workers Work ==


There's an interface for reviewing work. I'll walk through the example quickly.
There's an interface for reviewing work. I'll walk through the example quickly.


In general, you should ''strongly'' err on the side of paying everybody. Clean/remove data later.
In general, you should /strongly/ err on the side of paying everybody. Clean/remove data later.
 
The damage that can come from not paying someone who did poor work on a HIT is potentially much higher than the small amount one would save for a given HIT.


=== Ensuring quality/honesty ===
The damage that can come from not paying someone who did poor work on a HIT is potentially much higher than the small amoung one would save for a given HIT.


Approaches will very heavily depending on whether there are objective answers (i.e., Turkers as subjects or RAs)
== Ensuring Honesty ==


* Qualifications (best)
* Qualifications (best)
* "Golden" questions (questions for which you know the answers)
* &quot;Golden&quot; questions (questions for which you know the answers)
* Javascript web-bugs and similar to track mouse-movement, time on page, etc (some of this comes built-in)
* Javascript web-bugs to track mouse-movement, time on page, etc
* Attention checks (seem to be discouraged?)
* Attention checks (seem to be discouraged?)
* Assignment of 1 task to many workers (cross check results)
* Assignment of 1 task to many workers (cross check results)
* Working with third-party labor market intermediaries w/ memory of workers across many HITs.


== Tips and tricks ==
<blockquote>* Working with third-party labor market intermediaries w/ memory of workers across many HITs.
</blockquote>
= Tips and tricks =


* By default there's no way to request unique works across batches or different tasks but there are tricks/scripts. Here's one example: https://uniqueturker.myleott.com/
* By default there's no way to request unique works across batches or different tasks but there are tricks/scripts. Here's one example: https://uniqueturker.myleott.com/
* Rather than requiting "masters" qualification, make your qualification be a 95% approval rate and 1,000 approved assignments.
* Rather than requring &quot;masters,&quot; make your qualification be a 95% approval rate and 1,000 approved assignments.
* Test things out in the [https://requester.mturk.com/developer/sandbox MTurk Requester Sandbox] first: https://requester.mturk.com/developer/sandbox
* Test things out in the Requester Sandbox first: https://requester.mturk.com/developer/sandbox
* '''ALWAYS PILOT!''' One great way to use MTurk is as a setting to pilot other projects!
Please note that all contributions to CommunityData are considered to be released under the Attribution-Share Alike 3.0 Unported (see CommunityData:Copyrights for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource. Do not submit copyrighted work without permission!

To protect the wiki against automated edit spam, we kindly ask you to solve the following CAPTCHA:

Cancel Editing help (opens in new window)