Combining Small Language Models and Large Language Models for Zero-Shot NL2SQL
Summary: ZeroNL2SQL: fine-tuned SLMs (with novel database serialization and question-aware schema alignment) generate SQL sketches, then LLMs fill missing clauses and values. Multi-level value matching plus execution-based selection yields SOTA zero-shot NL2SQL (+5.5–20% execution accuracy). (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Ju Fan
- 2. Zihui Gu
- 3. Songyue Zhang
- 4. Yuxin Zhang
- 5. Zui Chen
- 6. Lei Cao
- 7. Guoliang Li
- 8. Samuel Madden
- 9. Xiaoyong Du
- 10. Nan Tang
Incoming Citations (Sorted by Pagerank)
Showing 15 of 15 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 369 | Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation | 2024 | VLDB | 0.0002547515 |
| 998 | CodeS: Towards Building Open-source Language Models for Text-to-SQL | 2024 | SIGMOD | 0.00014729379 |
| 1,541 | Symphony: Towards Natural Language Query Answering over Multi-modal Data Lakes | 2023 | CIDR | 0.00011456579 |
| 2,945 | Few-shot Text-to-SQL Translation using Structure and Content Prompt Learning | 2023 | SIGMOD | 7.8377395e-05 |
| 4,212 | Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data Integration | 2023 | SIGMOD | 6.3555142e-05 |
| 6,569 | Domain Adaptation for Deep Entity Resolution | 2022 | SIGMOD | 5.0065379e-05 |
| 11,347 | OpenTFV: An Open Domain Table-Based Fact Verification System | 2022 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,945 | Few-shot Text-to-SQL Translation using Structure and Content Prompt Learning | 2023 | SIGMOD | 7.8377395e-05 |
| 369 | Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation | 2024 | VLDB | 0.0002547515 |
| 7,354 | Reliable Text-to-SQL with Adaptive Abstention | 2025 | SIGMOD | 4.7529612e-05 |
| 984 | Natural language to SQL: Where are we today? | 2020 | VLDB | 0.00014857465 |
| 1,732 | CatSQL: Towards Real World Natural Language to SQL Applications | 2023 | VLDB | 0.00010732004 |
| 9,151 | The Power of Constraints in Natural Language to SQL Translation | 2025 | VLDB | 4.3849295e-05 |
| 3,662 | The Dawn of Natural Language to SQL: Are We Fully Ready? | 2024 | VLDB | 6.8672143e-05 |
| 2,988 | NL2SQL is a solved problem... Not! | 2024 | CIDR | 7.7761714e-05 |
| 10,837 | Natural Language to SQL: State of the Art and Open Problems | 2025 | VLDB | 4.1945683e-05 |
| 10,221 | NL2SQLBench: A Modular Benchmarking Framework for LLM-Enabled NL2SQL Solutions | 2026 | VLDB | 4.1945683e-05 |