'Your AI Text is not Mine': Redefining and Evaluating AI-generated Text Detection under Realistic Assumptions

Simple item page

dc.contributor.author	Dycke, Nils
dc.contributor.author	Sakharova, Marina
dc.contributor.author	Daheim, Nico
dc.contributor.author	Iryna, Gurevych
dc.date.accessioned	2026-06-15T14:10:12Z
dc.date.created	2026
dc.date.issued	2026-06-15
dc.description	Although it is generally agreed that AI-generated text poses a broad societal risk, there is no common understanding in the AI-generated text detection literature on what constitutes harmful use. Rather, existing datasets and approaches often define their own criteria and make their own assumptions, sometimes implicitly, and often only loosely related to real-world needs and applications. To address this gap, we here systematically define various notions of AI-generated text and their characteristics. To study these, we collect AITDNA - a new benchmark of human-machine co-constructed texts that is annotated with detailed genesis information, such as the entire edit and AI-interaction history. AITDNA is a dataset of human-AI interactions collected throughout a set of user studies. The dataset contains: 1. Full creation information for each text: raw user edits, model suggestions, user queries etc. 2. Representation of each text with respect to different notions (definitions) of AI-generated text described in the paper. Currently supported notions: - Document-level: one label per document (AI if >=50% of tokens are AI-generated) - Sentence-level: one label per sentence (AI if >=50% of tokens are AI-generated) - Token-level: one label per token - Boundary-level: divide text into N parts by finding most optimal split indices (default N = 5) - Span-level: character-level spans of same authorship (e.g. User: "GPUs are speci", AI: "alized processors",...) - Intent-based: sentence-level labels based on a pre-defined set of rules specifying allowed and forbidden types of user queries. - Content-based: sentence-level labels based on a pre-defined set of rules specifying allowed and forbidden types of model output. - Membership-based: token-level labels based on occurence of N-grams in reference human corpus (default N = 2, reference human corpus = human-only part of the dataset) The dataset is provided in the form of parquet files. A loader script is provided which can be used as: ``` from load_dataset import load_config ds = load_config(name="membership") ``` For more details, please go through the README file included.
dc.description.version	v1
dc.identifier.uri	https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/5170
dc.language.iso	en
dc.rights.license	CC-BY-SA-4.0 (https://creativecommons.org/licenses/by-sa/4.0)
dc.subject	AI-Text Detection
dc.subject.classification	4.43-04
dc.subject.ddc	004
dc.title	'Your AI Text is not Mine': Redefining and Evaluating AI-generated Text Detection under Realistic Assumptions
dc.type	Other
dcterms.accessRights	openAccess
person.identifier.orcid	#PLACEHOLDER_PARENT_METADATA_VALUE#
person.identifier.orcid	0009-0000-8966-2446
person.identifier.orcid	0009-0005-4003-0997
person.identifier.orcid	#PLACEHOLDER_PARENT_METADATA_VALUE#
tuda.agreements	true
tuda.unit	TUDa

Files

Original bundle

Now showing 1 - 1 of 1

Name	Description	Size	Format
AITDNA.zip		8.33 MB	ZIP-Archivdateien	Download

Simple item page