Back to papers
DeepDB: Learn from Data, not from Queries!
Summary: Data-driven learned DBMS components bypass workload-driven training, enabling changes in workload or data without retraining. Empirically higher accuracy and better generalization to unseen queries than state-of-the-art learned components.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 12289
- Venue
- VLDB
- Year
- 2020
- Pagerank
- 0.00019235898
- Overall Rank
- 608 | 95.78%
- DOI
-
10.14778/3384345.3384349
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 21 of 121 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 10,018 |
GenJoin: Conditional Generative Plan-to-Plan Query Optimizer that Learns from Subplan Hints |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,096 |
NeuSO: Neural Optimizer for Subgraph Queries |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,149 |
CorrBound: Cardinality Estimation Accounting for Inter- and Intra-relation Correlations |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,174 |
IDAP++: Advancing Divergence-Aware Pruning with Joint Filter and Layer Optimization |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,197 |
Qualitative Join Discovery in Data Lakes using Examples |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,219 |
Practical Parameterized Query Optimization via Efficient Plan Reuse and List-wise Ranking |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,481 |
FAAQP: Fast and Accurate Approximate Query Processing based on Bitmap-augmented Sum-Product Network |
2025 |
SIGMOD |
4.1945683e-05 |
| 10,565 |
Holistic query Approximation via RL Modeling |
2025 |
VLDB |
4.1945683e-05 |
| 10,590 |
ACE: A Cardinality Estimator for Set-Valued Queries |
2025 |
VLDB |
4.1945683e-05 |
| 10,619 |
Data-Agnostic Cardinality Learning from Imperfect Workloads |
2025 |
VLDB |
4.1945683e-05 |
| 10,627 |
Robust Plan Evaluation based on Approximate Probabilistic Machine Learning |
2025 |
VLDB |
4.1945683e-05 |
| 10,639 |
Cardinality Estimation for Having-Clauses |
2025 |
VLDB |
4.1945683e-05 |
| 10,699 |
The Accuracy of Cardinality Estimators: Unraveling the Evaluation Result Conundrum |
2025 |
VLDB |
4.1945683e-05 |
| 10,724 |
Privacy-Enhanced Database Synthesis for Benchmark Publishing |
2025 |
VLDB |
4.1945683e-05 |
| 10,833 |
Cardinality Estimation for Similarity Search on High-Dimensional Data Objects: The Impact of Reference Objects |
2025 |
VLDB |
4.1945683e-05 |
| 10,859 |
Graph Transformers for Query Plan Representation: Potentials and Challenges |
2025 |
VLDB |
4.1945683e-05 |
| 10,942 |
Sub-optimal Join Order Identification with L1-error |
2024 |
SIGMOD |
4.1945683e-05 |
| 11,190 |
Efficient and Effective Cardinality Estimation for Skyline Family |
2023 |
SIGMOD |
4.1945683e-05 |
| 11,194 |
A Step Toward Deep Online Aggregation |
2023 |
SIGMOD |
4.1945683e-05 |
| 11,427 |
Accelerating Complex Analytics using Speculation |
2021 |
CIDR |
4.1945683e-05 |
| 11,453 |
XLJoins |
2021 |
SIGMOD |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 25 of 25 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 71 |
How Good Are Query Optimizers, Really? |
2016 |
VLDB |
0.00059038975 |
| 102 |
The Case for Learned Index Structures |
2018 |
SIGMOD |
0.00049545203 |
| 204 |
Learned Cardinalities: Estimating Correlated Joins with Deep Learning |
2019 |
CIDR |
0.00034784455 |
| 333 |
Neo: A Learned Query Optimizer |
2019 |
VLDB |
0.00027206884 |
| 372 |
Selectivity Estimation using Probabilistic Models |
2001 |
SIGMOD |
0.00025354779 |
| 405 |
Approximate Query Processing Using Wavelets |
2000 |
VLDB |
0.00024057494 |
| 758 |
Deep Unsupervised Cardinality Estimation |
2020 |
VLDB |
0.0001706608 |
| 806 |
An End-to-End Learning-based Cost Estimator |
2020 |
VLDB |
0.00016434274 |
| 943 |
Wander Join: Online Aggregation via Random Walks |
2016 |
SIGMOD |
0.00015145883 |
| 980 |
BayesStore: Managing Large, Uncertain Data Repositories with Probabilistic Graphical Models |
2008 |
VLDB |
0.00014879747 |
| 1,105 |
Cardinality Estimation Done Right: Index-Based Join Sampling |
2017 |
CIDR |
0.00013990395 |
| 1,204 |
VerdictDB: Universalizing Approximate Query Processing |
2018 |
SIGMOD |
0.00013319541 |
| 1,254 |
Selectivity Estimation for Range Predicates using Lightweight Models |
2019 |
VLDB |
0.00013027411 |
| 1,569 |
Querying Continuous Functions in a Database System |
2008 |
SIGMOD |
0.0001132337 |
| 2,083 |
Towards a Learning Optimizer for Shared Clouds |
2019 |
VLDB |
9.5834572e-05 |
| 2,129 |
IDEBench: A Benchmark for Interactive Data Exploration |
2020 |
SIGMOD |
9.480002e-05 |
| 2,501 |
DBEst: Revisiting Approximate Query Processing Engines with Machine Learning Models |
2019 |
SIGMOD |
8.6453446e-05 |
| 2,588 |
Database Learning: Toward a Database that Becomes Smarter Every Time |
2017 |
SIGMOD |
8.4909562e-05 |
| 2,669 |
A Black-Box Approach to Query Cardinality Estimation |
2007 |
CIDR |
8.3389856e-05 |
| 2,841 |
Selectivity Estimation in Extensible Databases - A Neural Network Approach |
1998 |
VLDB |
8.0287389e-05 |
| 2,865 |
Designing Succinct Secondary Indexing Mechanism by Exploiting Column Correlations |
2019 |
SIGMOD |
7.9862595e-05 |
| 4,164 |
SlimShot: In-Database Probabilistic Inference for Knowledge Bases |
2016 |
VLDB |
6.3923099e-05 |
| 5,266 |
Probabilistic Databases with MarkoViews |
2012 |
VLDB |
5.5972559e-05 |
| 7,434 |
Local Structure and Determinism in Probabilistic Databases |
2012 |
SIGMOD |
4.7314358e-05 |
| 8,067 |
Learning Statistical Models from Relational Data |
2011 |
SIGMOD |
4.5937196e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 5,637 |
Database Workload Characterization with Query Plan Encoders |
2022 |
VLDB |
5.3979505e-05 |
| 6,230 |
Learned Approximate Query Processing: Make it Light, Accurate and Fast |
2021 |
CIDR |
5.145989e-05 |
| 5,473 |
Facilitating SQL Query Composition and Analysis |
2020 |
SIGMOD |
5.4885366e-05 |
| 13,229 |
Using Deep Learning Models to Replace Large Materialized Views in Relational Database |
2021 |
CIDR |
- |
| 2,364 |
Deep Learning Models for Selectivity Estimation of Multi-Attribute Queries |
2020 |
SIGMOD |
8.9554751e-05 |
| 9,120 |
Deep Query Optimization |
2019 |
SIGMOD |
4.392741e-05 |
| 9,852 |
Machine Unlearning in Learned Databases: An Experimental Analysis |
2024 |
SIGMOD |
4.2714575e-05 |
| 884 |
Plan-Structured Deep Neural Network Models for Query Performance Prediction |
2019 |
VLDB |
0.00015654004 |
| 3,658 |
Towards a Hands-Free Query Optimizer through Deep Learning |
2019 |
CIDR |
6.8704209e-05 |
| 9,892 |
DBMS Fitting: Why should we learn what we already know? |
2020 |
CIDR |
4.261445e-05 |