Description
A demonstrator corpus containing 2077 articles from the journal Fluids (Publisher MDPI, ISSN 2311-5521) in XML-TEI format.
The corpus was created as part of the NFDI4ING S7 project that collects, for text and data mining, literature relevant for engineering sciences. This collection was performed using the infrastructure und software stack build in the "Workflow Digitale Medien" project at the University and State Library Darmstadt. JATS-XML files provided by the publisher were automatically converted to the TEI-XML files that are based on an application profile of the XML specification of the Text Encoding Initiative.
The XML-TEI files can also be retrieved via the following REST API endpoints of the eXist wdb+ system.
1. This query provides an overview of all volumes of the journal Fluids in eXist wdb+: https://exist.ulb.tu-darmstadt.de/2/r/edoc/collection/jz000014.json
2. Using the IDs of the individual volumes, the IDs of the articles contained in these volumes can be obtained with the following query. The query is an example for volume 7 with the ID jz000102: https://exist.ulb.tu-darmstadt.de/2/r/edoc/collection/jz000102.json
3. The articles can be downloaded in XML-TEI format using their IDs from the response above. The following query can be used to download the article with the ID jz000102-0010: https://exist.ulb.tu-darmstadt.de/2/g/jz000102-0010