CORE-T: COherent REtrieval of Tables for Text-to-SQL
| dc.contributor.author | Soliman, Hassan | |
| dc.date.accessioned | 2026-01-16T20:40:20Z | |
| dc.date.created | 2026-01-16 | |
| dc.date.issued | 2026-01-16 | |
| dc.description | We present three preprocessed text-to-SQL benchmarks (BIRD, SPIDER and MMQA). We preprocessed these datasets to follow our open-book setting by merging tables from multiple DBs (or question-specific schemas for MMQA) into a single retrieval corpus per benchmark. We provide the preprocessed data as well as their corresponding SQL databases. | |
| dc.description.version | 1.0 | |
| dc.identifier.uri | https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/4993 | |
| dc.language.iso | en | |
| dc.rights | CC BY-SA 4.0 | |
| dc.rights.license | other | |
| dc.rights.uri | https://creativecommons.org/licenses/by-sa/4.0/ | |
| dc.subject | Information Retrieval | |
| dc.subject | Text-to-SQL | |
| dc.subject | Multi-table Selection | |
| dc.subject.classification | 4.43-04 | |
| dc.subject.ddc | 004 | |
| dc.title | CORE-T: COherent REtrieval of Tables for Text-to-SQL | |
| dc.type | Dataset | |
| dc.type | Text | |
| dcterms.accessRights | openAccess | |
| person.identifier.orcid | 0009-0003-4574-9074 | |
| tuda.agreements | true | |
| tuda.unit | TUDa |
Files
Original bundle
1 - 1 of 1
| Name | Description | Size | Format | |
|---|---|---|---|---|
| core-t-data-v1.0.zip | 549.64 MB | ZIP-Archivdateien |
