NL2SQLBench: A Modular Benchmarking Framework for LLM-Enabled NL2SQL Solutions

Summary: Modular benchmark for LLM-NL2SQL, decomposing systems into schema selection, candidate generation, and query revision with fine-grained accuracy/efficiency metrics. Reveals major gaps in open-source methods, plus dataset/eval flaws (e.g., noisy gold SQL), providing a reproducible baseline for fair comparison. (summarized by gpt-5.4-mini on Apr 12 2026)

Paper ID: 14256
Venue: VLDB
Year: 2026
Pagerank: 4.1905499e-05
Overall Rank: 10,221 | 28.97%
DOI: 10.14778/3796195.3796211

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank	Citing Paper	Year	Venue	Pagerank

Outgoing Citations (Sorted by Pagerank)

Showing 13 of 13 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank	Cited Paper	Year	Venue	Pagerank
366	Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation	2024	VLDB	0.00025580097
998	CodeS: Towards Building Open-source Language Models for Text-to-SQL	2024	SIGMOD	0.00014726344
2,435	ScienceBenchmark: A Complex Real-World Benchmark for Evaluating Natural Language to SQL Systems	2024	VLDB	8.8218963e-05
2,987	NL2SQL is a solved problem... Not!	2024	CIDR	7.77529e-05
3,666	The Dawn of Natural Language to SQL: Are We Fully Ready?	2024	VLDB	6.8606092e-05
3,862	OpenSearch-SQL: Enhancing Text-to-SQL with Dynamic Few-shot and Consistency Alignment	2025	SIGMOD	6.68436e-05
3,978	OmniSQL: Synthesizing High-quality Text-to-SQL Data at Scale	2025	VLDB	6.5662694e-05
4,908	Combining Small Language Models and Large Language Models for Zero-Shot NL2SQL	2024	VLDB	5.835596e-05
6,762	Natural Language Querying of Complex Business Intelligence Queries	2019	SIGMOD	4.929789e-05
7,137	Automated Validating and Fixing of Text-to-SQL Translation with Execution Consistency	2025	SIGMOD	4.8165495e-05
8,365	NeurDB: On the Design and Implementation of an AI-powered Autonomous Database	2025	CIDR	4.5305127e-05
9,035	Sphinteract: Resolving Ambiguities in NL2SQL Through User Interaction	2025	VLDB	4.3997447e-05
9,250	Demonstration of DB-GPT: Next Generation Data Interaction System Empowered by Large Language Models	2024	VLDB	4.3648789e-05

Semantically Similar Papers

Overall Rank	Paper	Year	Venue	Pagerank
7,351	Reliable Text-to-SQL with Adaptive Abstention	2025	SIGMOD	4.7484027e-05
10,268	OpenSQL: Data-Efficient Text-to-SQL for Open-Source LLMs via Synthesized Intermediate Supervision	2026	VLDB	4.1905499e-05
9,973	BenchPress: A Human-in-the-Loop Annotation System for Rapid Text-to-SQL Benchmark Curation	2026	CIDR	4.1905499e-05
2,435	ScienceBenchmark: A Complex Real-World Benchmark for Evaluating Natural Language to SQL Systems	2024	VLDB	8.8218963e-05
10,841	Natural Language to SQL: State of the Art and Open Problems	2025	VLDB	4.1905499e-05
973	Natural language to SQL: Where are we today?	2020	VLDB	0.0001488435
366	Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation	2024	VLDB	0.00025580097
2,987	NL2SQL is a solved problem... Not!	2024	CIDR	7.77529e-05
4,908	Combining Small Language Models and Large Language Models for Zero-Shot NL2SQL	2024	VLDB	5.835596e-05
3,666	The Dawn of Natural Language to SQL: Are We Fully Ready?	2024	VLDB	6.8606092e-05