TUdatalib Upgrade

Am 2. Juni erfolgte ein TUdatalib Upgrade auf eine neue Softwareversion. Dieses Upgrade bringt wichtige Neuerungen mit sich. Eine Übersicht finden Sie in der Dokumentation
On June 2nd, TUdatalib was upgraded to a new software version. This upgrade introduced major changes to the system. Please see our documentation for an overview.

 

Darmstadt Fanfiction Corpus 1.0 (Fanfiktion.de, 2020-2023)

Abstract

Description

A corpus of mostly German-language fanfiction texts created or updated in 2020-2023 from the website Fanfiktion.de. The website was scraped every month, the monthly corpora were later merged into one. The corpus consists of four components: the texts (in .csv and .txt format), the reviews to texts updated in the selected period, the text metadata, and the user information for each author. There are 67538 texts by 22738 authors, and reviews were written on 51188 texts. The corpus allows the creation of subcorpora based on fandom, fanfiction genre, author metadata, age restriction, word or chapter count, and review or endorsement numbers. Further, the corpus can be transformed into network data using reviews as relations. ACCESS: To access to the corpus, please send a signed PDF of the Statement for the use of the Darmstadt Fanfiction Corpus in English or in German to anastasia.glawion@fau.de or thomas.weitin@tu-darmstadt.de.

Citation

Endorsement

Project(s)

Faculty

Collections

License

Except where otherwise noted, this license is described as In Copyright