German Relatedness Datasets
Description
The datasets on this page were obtained by asking human subjects to assign a similarity or relatedness judgment to a number of German word pairs. The datasets have been used to test the performance of semantic similarity/relatedness measures.
All subjects in our experiments were native speakers of German. A judgment of 0 means “fully unsimilar/unrelated”, while a score of 4 means “fully similar/related”.
In the comma-separated dataset files, each word pair is on a single line followed by the mean judgment score and the standard deviation.
DFG subject classification
4.43-04 Künstliche Intelligenz und Maschinelle Lernverfahren4.43-05 Bild- und Sprachverarbeitung, Computergraphik und Visualisierung, Human Computer Interaction, Ubiquitous und Wearable Computing
Related Resources
- Is supplement to: DOI:10.1007/11562214_67
Collections
The following license files are associated with this item: