Back to papers
SQL-Factory: A Multi-Agent Framework for High-Quality and Large-Scale SQL Generation
Summary: SQL-Factory: a modular multi-agent pipeline splitting SQL synthesis into Generation (diverse structures via a strong LLM), Expansion (cheap scaling via a lightweight LLM), and Management (adaptive scheduling and quality control). Generates >300k diverse SQLs for <$200 API cost and boosts downstream Text-to-SQL performance.
(summarized by gpt-5-mini on Mar 13 2026)
- Paper ID
- 14319
- Venue
- VLDB
- Year
- 2026
- Pagerank
- 4.427232e-05
- Overall Rank
- 8,896 | 38.12%
- DOI
-
10.14778/3778092.3778093
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
Outgoing Citations (Sorted by Pagerank)
Showing 19 of 19 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 183 |
Automatic Database Management System Tuning Through Large-scale Machine Learning |
2017 |
SIGMOD |
0.00036721403 |
| 369 |
Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation |
2024 |
VLDB |
0.0002547515 |
| 406 |
Massive Stochastic Testing of SQL |
1998 |
VLDB |
0.00024053686 |
| 514 |
An End-to-End Automatic Cloud Database Tuning System Using Deep Reinforcement Learning |
2019 |
SIGMOD |
0.0002124895 |
| 659 |
The Making of TPC-DS |
2006 |
VLDB |
0.00018500853 |
| 998 |
CodeS: Towards Building Open-source Language Models for Text-to-SQL |
2024 |
SIGMOD |
0.00014729379 |
| 1,956 |
D-Bot: Database Diagnosis System using Large Language Models |
2024 |
VLDB |
9.960627e-05 |
| 2,433 |
ScienceBenchmark: A Complex Real-World Benchmark for Evaluating Natural Language to SQL Systems |
2024 |
VLDB |
8.8285962e-05 |
| 2,985 |
DSB: A Decision Support Benchmark for Workload-Driven and Traditional Database Systems |
2021 |
VLDB |
7.7795847e-05 |
| 3,978 |
OmniSQL: Synthesizing High-quality Text-to-SQL Data at Scale |
2025 |
VLDB |
6.5725884e-05 |
| 4,417 |
Robust Query Driven Cardinality Estimation under Changing Workloads |
2023 |
VLDB |
6.2037371e-05 |
| 4,661 |
PreQR: Pre-training Representation for SQL Understanding |
2022 |
SIGMOD |
6.0137947e-05 |
| 5,033 |
FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial Analysis |
2024 |
SIGMOD |
5.7486224e-05 |
| 5,371 |
LearnedSQLGen: Constraint-aware SQL Generation using Reinforcement Learning |
2022 |
SIGMOD |
5.5428776e-05 |
| 5,401 |
ALECE: An Attention-based Learned Cardinality Estimator for SPJ Queries on Dynamic Workloads |
2024 |
VLDB |
5.5285035e-05 |
| 7,221 |
Speeding Up End-to-end Query Execution via Learning-based Progressive Cardinality Estimation |
2023 |
SIGMOD |
4.797194e-05 |
| 8,892 |
Generation of Training Examples for Tabular Natural Language Inference |
2023 |
SIGMOD |
4.4275457e-05 |
| 9,352 |
Db2une: Tuning Under Pressure via Deep Learning |
2024 |
VLDB |
4.3522361e-05 |
| 9,618 |
A New Paradigm in Tuning Learned Indexes: A Reinforcement Learning Enhanced Approach |
2025 |
SIGMOD |
4.3173366e-05 |
Semantically Similar Papers