Context-Aware Representations for Knowledge Base Relation Extraction
Description
We provide a subcorpus of Wikipedia that was annotated with Wikidata relations using a distant supervision procedure. The corpus contains two types of annotations: entities and relations. Entity annotations were extracted from the Wikipedia linkes in the article text. Each link was converted to a Wikidata identifier using the mappings from the Wikidata itself. Additional entities were recognised using a named entity recognizer and were later linked to Wikidata. For each pair of entities in each sentence we searched for Wikidata relations that connect this pair of entities and stored all unambigious instances (only one relation is possible).
DFG subject classification
4.43-04 Künstliche Intelligenz und Maschinelle Lernverfahren4.43-05 Bild- und Sprachverarbeitung, Computergraphik und Visualisierung, Human Computer Interaction, Ubiquitous und Wearable Computing
Related Resources
- Is referenced by: DOI:10.18653/v1/D17-1188
Collections
The following license files are associated with this item: