An In-Depth Benchmarking of Text-to-SQL Systems
Summary: Rigorous, multi-class Text-to-SQL benchmark covering diverse query types beyond existing datasets. Systematic evaluation of several T2SQL systems with execution time and resource usage, revealing gaps, capabilities, and open challenges in current approaches. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Orest Gkini
- 2. Theofilos Belmpas
- 3. Georgia Koutrika
- 4. Yannis Ioannidis
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,635 | A Deep Dive into Deep Learning Approaches for Text-to-SQL Systems | 2021 | SIGMOD | 6.8981006e-05 |
| 3,662 | The Dawn of Natural Language to SQL: Are We Fully Ready? | 2024 | VLDB | 6.8672143e-05 |
| 8,892 | Generation of Training Examples for Tabular Natural Language Inference | 2023 | SIGMOD | 4.4275457e-05 |
| 9,032 | Sphinteract: Resolving Ambiguities in NL2SQL Through User Interaction | 2025 | VLDB | 4.4039656e-05 |
| 9,151 | The Power of Constraints in Natural Language to SQL Translation | 2025 | VLDB | 4.3849295e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 54 | DISCOVER: Keyword Search in Relational Databases | 2002 | VLDB | 0.00066047203 |
| 206 | Constructing an Interactive Natural Language Interface for Relational Databases | 2015 | VLDB | 0.00034667032 |
| 276 | Efficient IR-Style Keyword Search over Relational Databases | 2003 | VLDB | 0.00029336949 |
| 301 | BLINKS: Ranked Keyword Searches on Graphs | 2007 | SIGMOD | 0.00028370644 |
| 535 | ATHENA: An Ontology-Driven System for Natural Language Querying over Relational Data Stores | 2016 | VLDB | 0.00020727678 |
| 1,153 | SQAK: Doing More with Keywords | 2008 | SIGMOD | 0.00013642866 |
| 1,168 | SODA: Generating SQL for Business Users | 2012 | VLDB | 0.00013541143 |
| 1,201 | SPARK: Top-k Keyword Query in Relational Databases | 2007 | SIGMOD | 0.0001334371 |
| 5,455 | Natural Language Data Management and Interfaces: Recent Development and Open Challenges | 2017 | SIGMOD | 5.4977219e-05 |
| 6,611 | MeanKS: Meaningful Keyword Search in Relational Databases with Complex Schema | 2014 | SIGMOD | 4.9950232e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,359 | Text2SQL is Not Enough: Unifying AI and Databases with TAG | 2025 | CIDR | 7.1744146e-05 |
| 10,451 | RTS+: Reliable Text to SQL | 2025 | SIGMOD | 4.1945683e-05 |
| 10 | Benchmarking Database Systems: A Systematic Approach | 1983 | VLDB | 0.0012103754 |
| 10,118 | Test Data Generation for Complex SQL Queries | 2026 | SIGMOD | 4.1945683e-05 |
| 10,249 | TACO: A Benchmark for Open-Domain Text-to-SQL with Ambiguous and Cross-Database Queries | 2026 | VLDB | 4.1945683e-05 |
| 3,635 | A Deep Dive into Deep Learning Approaches for Text-to-SQL Systems | 2021 | SIGMOD | 6.8981006e-05 |
| 10,221 | NL2SQLBench: A Modular Benchmarking Framework for LLM-Enabled NL2SQL Solutions | 2026 | VLDB | 4.1945683e-05 |
| 6,697 | The TEXTURE Benchmark: Measuring Performance of Text Queries on a Relational DBMS | 2005 | VLDB | 4.9577992e-05 |
| 984 | Natural language to SQL: Where are we today? | 2020 | VLDB | 0.00014857465 |
| 369 | Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation | 2024 | VLDB | 0.0002547515 |