Darmstadt Fanfiction Corpus 1.0 (Fanfiktion.de, 2020-2023)
Description
A corpus of mostly German-language fanfiction texts created or updated in 2020-2023 from the website Fanfiktion.de. The website was scraped every month, the monthly corpora were later merged into one. The corpus consists of four components: the texts (in .csv and .txt format), the reviews to texts updated in the selected period, the text metadata, and the user information for each author. There are 67538 texts by 22738 authors, and reviews were written on 51188 texts.
The corpus allows the creation of subcorpora based on fandom, fanfiction genre, author metadata, age restriction, word or chapter count, and review or endorsement numbers. Further, the corpus can be transformed into network data using reviews as relations.
ACCESS: To access to the corpus, please send a signed PDF of the Statement for the use of the Darmstadt Fanfiction Corpus in English or in German to anastasia.glawion@fau.de or thomas.weitin@tu-darmstadt.de.
The corpus allows the creation of subcorpora based on fandom, fanfiction genre, author metadata, age restriction, word or chapter count, and review or endorsement numbers. Further, the corpus can be transformed into network data using reviews as relations.
ACCESS: To access to the corpus, please send a signed PDF of the Statement for the use of the Darmstadt Fanfiction Corpus in English or in German to anastasia.glawion@fau.de or thomas.weitin@tu-darmstadt.de.
DFG subject classification
1.15-02 Germanistische Literatur-und Kulturwissenschaften (Neuere deutsche Literatur)URI
https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/4245https://doi.org/10.48328/tudatalib-1452
Collections
The following license files are associated with this item: