Database Paper Browser

Back to papers

SQL-Factory: A Multi-Agent Framework for High-Quality and Large-Scale SQL Generation

Summary: SQL-Factory: a modular multi-agent pipeline splitting SQL synthesis into Generation (diverse structures via a strong LLM), Expansion (cheap scaling via a lightweight LLM), and Management (adaptive scheduling and quality control). Generates >300k diverse SQLs for <$200 API cost and boosts downstream Text-to-SQL performance. (summarized by gpt-5-mini on Mar 13 2026)

Paper ID
14319
Venue
VLDB
Year
2026
Pagerank
4.427232e-05
Overall Rank
8,896 | 38.12%
DOI
10.14778/3778092.3778093

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 1 of 1 citing papers.

Rank Citing Paper Year Venue Pagerank
10,242 SQL-Exchange: Transforming SQL Queries Across Domains 2026 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 19 of 19 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
183 Automatic Database Management System Tuning Through Large-scale Machine Learning 2017 SIGMOD 0.00036721403
369 Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation 2024 VLDB 0.0002547515
406 Massive Stochastic Testing of SQL 1998 VLDB 0.00024053686
514 An End-to-End Automatic Cloud Database Tuning System Using Deep Reinforcement Learning 2019 SIGMOD 0.0002124895
659 The Making of TPC-DS 2006 VLDB 0.00018500853
998 CodeS: Towards Building Open-source Language Models for Text-to-SQL 2024 SIGMOD 0.00014729379
1,956 D-Bot: Database Diagnosis System using Large Language Models 2024 VLDB 9.960627e-05
2,433 ScienceBenchmark: A Complex Real-World Benchmark for Evaluating Natural Language to SQL Systems 2024 VLDB 8.8285962e-05
2,985 DSB: A Decision Support Benchmark for Workload-Driven and Traditional Database Systems 2021 VLDB 7.7795847e-05
3,978 OmniSQL: Synthesizing High-quality Text-to-SQL Data at Scale 2025 VLDB 6.5725884e-05
4,417 Robust Query Driven Cardinality Estimation under Changing Workloads 2023 VLDB 6.2037371e-05
4,661 PreQR: Pre-training Representation for SQL Understanding 2022 SIGMOD 6.0137947e-05
5,033 FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial Analysis 2024 SIGMOD 5.7486224e-05
5,371 LearnedSQLGen: Constraint-aware SQL Generation using Reinforcement Learning 2022 SIGMOD 5.5428776e-05
5,401 ALECE: An Attention-based Learned Cardinality Estimator for SPJ Queries on Dynamic Workloads 2024 VLDB 5.5285035e-05
7,221 Speeding Up End-to-end Query Execution via Learning-based Progressive Cardinality Estimation 2023 SIGMOD 4.797194e-05
8,892 Generation of Training Examples for Tabular Natural Language Inference 2023 SIGMOD 4.4275457e-05
9,352 Db2une: Tuning Under Pressure via Deep Learning 2024 VLDB 4.3522361e-05
9,618 A New Paradigm in Tuning Learned Indexes: A Reinforcement Learning Enhanced Approach 2025 SIGMOD 4.3173366e-05
Previous Page 1 / 1 Next

Semantically Similar Papers