HyperCoref Corpus Seed Pages
| datacite.relation.isSupplementTo | https://doi.org/10.18653/v1/2021.emnlp-main.38 | |
| dc.contributor.author | Bugert, Michael | |
| dc.date.accessioned | 2022-01-21T09:50:44Z | |
| dc.date.available | 2022-01-21T09:50:44Z | |
| dc.date.created | 2021 | |
| dc.date.issued | 2022-01-21 | |
| dc.description | Archive containing the seed URLs for recreating the "HyperCoref" corpus, an automatically extracted corpus of cross-document event coreference links in online news. Further details on Github: https://github.com/UKPLab/emnlp2021-hypercoref-cdcr | de_DE |
| dc.description.version | v1 | de_DE |
| dc.identifier.uri | https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/3390 | |
| dc.language.iso | en | de_DE |
| dc.rights.license | CC-BY-4.0 (https://creativecommons.org/licenses/by/4.0) | |
| dc.subject | hyperlinks | de_DE |
| dc.subject | coreference resolution | de_DE |
| dc.subject | event | de_DE |
| dc.subject | CDCR | de_DE |
| dc.subject.classification | 1.14-03 | |
| dc.subject.ddc | 400 | |
| dc.title | HyperCoref Corpus Seed Pages | de_DE |
| dc.type | Dataset | de_DE |
| dcterms.accessRights | openAccess | |
| person.identifier.orcid | #PLACEHOLDER_PARENT_METADATA_VALUE# | |
| tuda.history.classification | Version=2020-2024;104-04 Angewandte Sprachwissenschaften, Experimentelle Linguistik, Computerlinguistik | |
| tuda.unit | TUDa |
Files
Original bundle
1 - 1 of 1
| Name | Description | Size | Format | |
|---|---|---|---|---|
| hypercoref_page_index_filtered.7z | with CommonCrawl URL prefix updated after Apr 2022 | 282.35 MB | Unknown data format |
