Der Login über E-Mail und Passwort wird in Kürze abgeschaltet. Für Externe steht ab sofort der Login über ORCID zur Verfügung.
The login via e-mail and password will be retired in the near future. External uses can login via ORCID from now on.
 

Lessons Learned from a Citizen Science Project for Natural Language Processing

datacite.relation.isSupplementTo https://aclanthology.org/2023.eacl-main.261/
dc.contributor.author Klie, Jan-Christoph
dc.contributor.author Lee, Ji-Ung
dc.contributor.author Stowe, Kevin
dc.contributor.author Sahin, Gözde Gül
dc.contributor.author Moosavi, Nafise Sadat
dc.contributor.author Bates, Luke
dc.contributor.author Dominic, Petrak
dc.contributor.author Eckart de Castilho, Richard
dc.contributor.author Gurevych, Iryna
dc.date.accessioned 2023-09-08T14:20:46Z
dc.date.available 2023-09-08T14:20:46Z
dc.date.created 2023-05
dc.date.issued 2023-09-08
dc.description This is the accompanying data for our paper "Lessons Learned from a Citizen Science Project for Natural Language Processing". Many Natural Language Processing (NLP) systems use annotated corpora for training and evaluation. However, labeled data is often costly to obtain and scaling annotation projects is difficult, which is why annotation tasks are often outsourced to paid crowdworkers. Citizen Science is an alternative to crowdsourcing that is relatively unexplored in the context of NLP. To investigate whether and how well Citizen Science can be applied in this setting, we conduct an exploratory study into engaging different groups of volunteers in Citizen Science for NLP by re-annotating parts of a pre-existing crowdsourced dataset. Our results show that this can yield high-quality annotations and at- tract motivated volunteers, but also requires considering factors such as scalability, participation over time, and legal and ethical issues. We summarize lessons learned in the form of guidelines and provide our code and data to aid future work on Citizen Science. de_DE
dc.identifier.uri https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/3942
dc.rights.licenseCC-BY-NC-4.0 (https://creativecommons.org/licenses/by-nc/4.0)
dc.subject citizen science de_DE
dc.subject annotation de_DE
dc.subject nlp de_DE
dc.subject.classification 4.43-04
dc.subject.classification 4.43-05
dc.subject.ddc 004
dc.title Lessons Learned from a Citizen Science Project for Natural Language Processing de_DE
dc.type Dataset de_DE
dcterms.accessRights openAccess
person.identifier.orcid 0000-0003-0181-6450
person.identifier.orcid #PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.orcid #PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.orcid 0000-0002-0332-1657
person.identifier.orcid 0000-0002-8332-307X
person.identifier.orcid 0000-0001-7715-2449
person.identifier.orcid #PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.orcid 0000-0003-0991-7045
person.identifier.orcid 0000-0003-2187-7621
tuda.history.classification Version=2016-2020;409-05 Interaktive und intelligente Systeme, Bild- und Sprachverarbeitung, Computergraphik und Visualisierung
tuda.unit TUDa

Files

Original bundle

Now showing 1 - 1 of 1
NameDescriptionSizeFormat
citizen-tudatalib.zip76.71 MBZIP-Archivdateien Download

Collections