Der Login über E-Mail und Passwort wird in Kürze abgeschaltet. Für Externe steht ab sofort der Login über ORCID zur Verfügung.
The login via e-mail and password will be retired in the near future. External uses can login via ORCID from now on.
 

Turk Bootstrap Word Sense Inventory (TWSI) 2.0

datacite.relation.isReferencedBy https://www.aclweb.org/anthology/L12-1101/
dc.contributor.author Biemann, Chris
dc.date.accessioned 2021-05-17T09:24:24Z
dc.date.available 2021-05-17T09:24:24Z
dc.date.created 2010-02-01
dc.date.issued 2021-05-17
dc.description Turk Bootstrap Word Sense Inventory (TWSI) 2.0. This lexical resource, created by a crowdsourcing process using Amazon Mechanical Turk (http://www.mturk.com), encompasses a sense inventory for lexical substitution for 1,012 highly frequent English common nouns. Along with each sense, a large number of sense-annotated occurrences in context are given, as well as a weighted list of substitutions. Sense distinctions are not motivated by lexicographic considerations, but driven by substitutability: two usages belong to the same sense if their substitutions overlap considerably. After laying out the need for such a resource, the data is characterized in terms of organization and quantity. Then, we briefly describe how this data was used to create a system for lexical substitutions. Training a supervised lexical substitution system on a smaller version of the resource resulted in well over 90% acceptability for lexical substitutions provided by the system. Thus, this resource can be used to set up reliable, enabling technologies for semantic natural language processing (NLP), some of which we discuss briefly. en_US
dc.description.version 2 en_US
dc.identifier.uri https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/2768
dc.language.iso en en_US
dc.rights Creative Commons Attribution Share-Alike 4.0
dc.rights.licenseother
dc.rights.uri https://creativecommons.org/licenses/by-sa/4.0/
dc.subject lexical substitution en_US
dc.subject nlp
dc.subject mturk
dc.subject semantic word sense
dc.subject.classification 4.43-04
dc.subject.classification 4.43-05
dc.subject.ddc 004
dc.title Turk Bootstrap Word Sense Inventory (TWSI) 2.0 en_US
dc.type Dataset en_US
dcterms.accessRights openAccess
person.identifier.orcid #PLACEHOLDER_PARENT_METADATA_VALUE#
tuda.history.classification Version=2016-2020;409-05 Interaktive und intelligente Systeme, Bild- und Sprachverarbeitung, Computergraphik und Visualisierung
tuda.unit TUDa

Files

Original bundle

Now showing 1 - 1 of 1
NameDescriptionSizeFormat
TWSI2_complete.zipDataset50.43 MBZIP-Archivdateien Download