-----------------------------------------------------------
-----  Readme.txt for ETP-gold-labels Dataset ------

This file is the Readme file for the ETP-gold-labels Dataset.  This dataset contains the crowdsource labels for ETP-gold, as annotated  on Amazon Mechanical Turk.

ETP-gold-labels Dataset is discussed in the paper:
@inproceedings{	TUD-CS-2014-0991,
	author = {Emily Jamison and Iryna Gurevych},
	title = {Needle in a Haystack: Reducing the Costs of Annotating Rare-Class Instances
in Imbalanced Datasets},
	year = {2014},
	address = {Phuket,Thailand},
	booktitle = {Proceedings of the 28th Pacific Asia Conference on Language, Information
and Computing},
	pages = {244--253},
}

Crowdsource labels are yes/canttell/no with the ETP-gold pair ID attached to the label.

Processing:
We have anonymized worker id's (but retained a 1:1 correspondance between original id's and anonymized id's).  
Also, we have redacted:
"feedback": the message we sent a worker when accepting or rejecting the HIT
"Answer.commentsname": the message the worker sent us with the HIT


Copyright: 
This dataset is released by UKP Lab, TU Darmstadt under the Creative Commons Attribution/Share-Alike License (CC-BY-SA).  UKP Lab ownership of the data originates from Amazon Mechanical Turk's Conditions of Use:
"[...] all ownership rights, including worldwide intellectual property rights, will vest with the Requester immediately upon [Turker's] performance of the Service. To the extent any such rights do not vest in Requester under applicable law, [Turker] hereby assign or exclusively grant (without the right to any compensation) all right, title and interest, including all intellectual property rights, to such work product to Requester."
https://www.mturk.com/mturk/conditionsofuse
