Back to papers
LearnedSQLGen: Constraint-aware SQL Generation using Reinforcement Learning
Summary: Constraint-aware SQL generation via RL: LearnedSQLGen steers query synthesis toward constraint satisfaction using exploration–exploitation and execution feedback. Finite-state machine enforces valid SQL; benchmarks show ~30% accuracy gain and 10–35× speedups.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 6466
- Venue
- SIGMOD
- Year
- 2022
- Pagerank
- 5.5428776e-05
- Overall Rank
- 5,371 | 62.64%
- DOI
-
10.1145/3514221.3526155
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 11 of 11 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 3,727 |
Cost-based or Learning-based? A Hybrid Query Optimizer for Query Plan Selection |
2022 |
VLDB |
6.8141709e-05 |
| 6,750 |
Breaking It Down: An In-depth Study of Index Advisors |
2024 |
VLDB |
4.9392771e-05 |
| 8,103 |
Grep: A Graph Learning Based Database Partitioning System |
2023 |
SIGMOD |
4.5852201e-05 |
| 8,636 |
WISK: A Workload-aware Learned Index for Spatial Keyword Queries |
2023 |
SIGMOD |
4.4801284e-05 |
| 8,896 |
SQL-Factory: A Multi-Agent Framework for High-Quality and Large-Scale SQL Generation |
2026 |
VLDB |
4.427232e-05 |
| 8,969 |
A Learned Query Rewrite System |
2023 |
VLDB |
4.4189226e-05 |
| 9,392 |
Demonstrating SQLBarber: Leveraging Large Language Models to Generate Customized and Realistic SQL Workloads |
2025 |
SIGMOD |
4.3441378e-05 |
| 9,902 |
Robustness of Updatable Learning-based Index Advisors against Poisoning Attack |
2024 |
SIGMOD |
4.258022e-05 |
| 10,156 |
Divo: Learning a Stable and Effective Query Optimizer with a Diverse Workload |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,212 |
SQLBarber: A System Leveraging Large Language Models to Generate Customized and Realistic SQL Workloads |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,707 |
PBench: Workload Synthesizer with Real Statistics for Cloud Analytics Benchmarking |
2025 |
VLDB |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 27 of 27 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 300 |
Deep Learning for Entity Matching: A Design Space Exploration |
2018 |
SIGMOD |
0.00028441466 |
| 406 |
Massive Stochastic Testing of SQL |
1998 |
VLDB |
0.00024053686 |
| 514 |
An End-to-End Automatic Cloud Database Tuning System Using Deep Reinforcement Learning |
2019 |
SIGMOD |
0.0002124895 |
| 782 |
QTune: A Query-Aware Database Tuning System with Deep Reinforcement Learning |
2019 |
VLDB |
0.00016729063 |
| 826 |
ALEX: An Updatable Adaptive Learned Index |
2020 |
SIGMOD |
0.00016224841 |
| 1,019 |
Robust Estimation of Resource Consumption for SQL Queries using Statistical Techniques |
2012 |
VLDB |
0.00014625603 |
| 1,478 |
Learning Multi-dimensional Indexes |
2020 |
SIGMOD |
0.00011762542 |
| 1,638 |
Cardinality Estimation in DBMS: A Comprehensive Benchmark Evaluation |
2022 |
VLDB |
0.00011049779 |
| 1,827 |
An Inquiry into Machine Learning-based Automatic Configuration Tuning Services on Real-World Database Management Systems |
2021 |
VLDB |
0.00010390548 |
| 2,277 |
Generating Targeted Queries for Database Testing |
2008 |
SIGMOD |
9.1241198e-05 |
| 2,606 |
Design Continuums and the Path Toward Self-Designing Key-Value Stores that Know and Learn |
2019 |
CIDR |
8.4645832e-05 |
| 2,985 |
DSB: A Decision Support Benchmark for Workload-Driven and Traditional Database Systems |
2021 |
VLDB |
7.7795847e-05 |
| 3,248 |
A Learned Query Rewrite System using Monte Carlo Tree Search |
2022 |
VLDB |
7.3258782e-05 |
| 3,409 |
SQLCheck: Automated Detection and Diagnosis of SQL Anti-Patterns |
2020 |
SIGMOD |
7.1270252e-05 |
| 3,473 |
AI Meets Database: AI4DB and DB4AI |
2021 |
SIGMOD |
7.062864e-05 |
| 3,580 |
Query Performance Prediction for Concurrent Queries using Graph Embedding |
2020 |
VLDB |
6.9500996e-05 |
| 3,789 |
DIAMetrics: Benchmarking Query Engines at Scale |
2020 |
VLDB |
6.7644737e-05 |
| 4,543 |
FACE: A Normalizing Flow based Cardinality Estimator |
2022 |
VLDB |
6.1011198e-05 |
| 4,590 |
MB2: Decomposed Behavior Modeling for Self-Driving Database Management Systems |
2021 |
SIGMOD |
6.0620053e-05 |
| 4,623 |
Automated Generation of Materialized Views in Oracle |
2020 |
VLDB |
6.0411909e-05 |
| 4,644 |
A genetic approach for random testing of database systems |
2007 |
VLDB |
6.0259936e-05 |
| 5,381 |
Selective Data Acquisition in the Wild for Model Charging |
2022 |
VLDB |
5.5399508e-05 |
| 5,810 |
Database Benchmarking for Supporting Real-Time Interactive Querying of Large Data |
2020 |
SIGMOD |
5.3178017e-05 |
| 5,861 |
Machine Learning for Databases |
2021 |
VLDB |
5.298883e-05 |
| 6,230 |
Learned Approximate Query Processing: Make it Light, Accurate and Fast |
2021 |
CIDR |
5.145989e-05 |
| 7,309 |
DBMind: A Self-Driving Platform in openGauss |
2021 |
VLDB |
4.766574e-05 |
| 7,575 |
Human-in-the-loop Outlier Detection |
2020 |
SIGMOD |
4.7068909e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 10,018 |
GenJoin: Conditional Generative Plan-to-Plan Query Optimizer that Learns from Subplan Hints |
2026 |
SIGMOD |
4.1945683e-05 |
| 3,472 |
LLM-R2: A Large Language Model Enhanced Rule-based Rewrite System for Boosting Query Efficiency |
2025 |
VLDB |
7.0639229e-05 |
| 8,699 |
Supporting Database Constraints in Synthetic Data Generation based on Generative Adversarial Networks |
2020 |
SIGMOD |
4.465684e-05 |
| 2,156 |
SkinnerDB: Regret-Bounded Query Evaluation via Reinforcement Learning |
2018 |
VLDB |
9.4170209e-05 |
| 7,008 |
Is Your Learned Query Optimizer Behaving As You Expect? A Machine Learning Perspective |
2024 |
VLDB |
4.8643538e-05 |
| 3,449 |
Learned Cardinality Estimation: A Design Space Exploration and A Comparative Evaluation |
2022 |
VLDB |
7.0824319e-05 |
| 5,473 |
Facilitating SQL Query Composition and Analysis |
2020 |
SIGMOD |
5.4885366e-05 |
| 8,896 |
SQL-Factory: A Multi-Agent Framework for High-Quality and Large-Scale SQL Generation |
2026 |
VLDB |
4.427232e-05 |
| 2,291 |
Data Generation using Declarative Constraints |
2011 |
SIGMOD |
9.0926719e-05 |
| 3,658 |
Towards a Hands-Free Query Optimizer through Deep Learning |
2019 |
CIDR |
6.8704209e-05 |