DeuParl

dc.contributor.author Kirschner, Celina
dc.contributor.author Walter, Tobias
dc.contributor.author Eger, Steffen
dc.contributor.author Glavas, Goran
dc.contributor.author Lauscher, Anne
dc.contributor.author Ponzetto, Simone Paolo
dc.date.accessioned 2021-08-05T07:29:46Z
dc.date.available 2021-08-05T07:29:46Z
dc.date.created 2021-08-05
dc.date.issued 2021-08-05
dc.description The data is part of our JCDL paper with the title "Diachronic Analysis of German Parliamentary Proceedings: Ideological Shifts through the Lens of Political Biases". This is the raw data underlying our corpus, the German Reichs- und Bundestagsprotokolle. It was crawled from https://www.reichstagsprotokolle.de/ and https://www.bundestag.de/protokolle Code can be found here: https://github.com/umanlp/crosstemporal_bias In our revision of the data, we (i) removed XML and (ii) corrected obvious OCR errors (e.g., negation sign instead of dash in line ends). Further modifications are indicated in the accompanying paper. en_US
dc.description.version v1.0 en_US
dc.identifier.uri https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/2889
dc.language.iso de en_US
dc.rights Creative Commons Attribution Share-Alike 4.0
dc.rights.licenseother
dc.rights.uri https://creativecommons.org/licenses/by-sa/4.0/
dc.subject Reichstagsprotokolle, Bundestagsprotokolle, Parlamentsdebatten, Biases, Historische NLP en_US
dc.subject.classification 1.14-03
dc.subject.ddc 400
dc.title DeuParl en_US
dc.type Dataset en_US
dcterms.accessRights openAccess
person.identifier.orcid #PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.orcid #PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.orcid #PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.orcid #PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.orcid #PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.orcid #PLACEHOLDER_PARENT_METADATA_VALUE#
tuda.history.classification Version=2020-2024;104-04 Angewandte Sprachwissenschaften, Experimentelle Linguistik, Computerlinguistik
tuda.unit TUDa

Files

Original bundle

Now showing 1 - 2 of 2
NameDescriptionSizeFormat
reichstag_corrseg.tgz598.18 MBUnknown data format Download
BRD Protokolle-20210622T061754Z-001.zip1.95 GBZIP-Archivdateien Download