DeepDB: Learn from Data, not from Queries!

Summary: Data-driven learned DBMS components bypass workload-driven training, enabling changes in workload or data without retraining. Empirically higher accuracy and better generalization to unseen queries than state-of-the-art learned components. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID: 12289
Venue: VLDB
Year: 2020
Pagerank: 0.00019235898
Overall Rank: 608 | 95.78%
DOI: 10.14778/3384345.3384349

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 21 of 121 citing papers.

Rank	Citing Paper	Year	Venue	Pagerank
10,018	GenJoin: Conditional Generative Plan-to-Plan Query Optimizer that Learns from Subplan Hints	2026	SIGMOD	4.1945683e-05
10,096	NeuSO: Neural Optimizer for Subgraph Queries	2026	SIGMOD	4.1945683e-05
10,149	CorrBound: Cardinality Estimation Accounting for Inter- and Intra-relation Correlations	2026	SIGMOD	4.1945683e-05
10,174	IDAP++: Advancing Divergence-Aware Pruning with Joint Filter and Layer Optimization	2026	SIGMOD	4.1945683e-05
10,197	Qualitative Join Discovery in Data Lakes using Examples	2026	SIGMOD	4.1945683e-05
10,219	Practical Parameterized Query Optimization via Efficient Plan Reuse and List-wise Ranking	2026	SIGMOD	4.1945683e-05
10,481	FAAQP: Fast and Accurate Approximate Query Processing based on Bitmap-augmented Sum-Product Network	2025	SIGMOD	4.1945683e-05
10,565	Holistic query Approximation via RL Modeling	2025	VLDB	4.1945683e-05
10,590	ACE: A Cardinality Estimator for Set-Valued Queries	2025	VLDB	4.1945683e-05
10,619	Data-Agnostic Cardinality Learning from Imperfect Workloads	2025	VLDB	4.1945683e-05
10,627	Robust Plan Evaluation based on Approximate Probabilistic Machine Learning	2025	VLDB	4.1945683e-05
10,639	Cardinality Estimation for Having-Clauses	2025	VLDB	4.1945683e-05
10,699	The Accuracy of Cardinality Estimators: Unraveling the Evaluation Result Conundrum	2025	VLDB	4.1945683e-05
10,724	Privacy-Enhanced Database Synthesis for Benchmark Publishing	2025	VLDB	4.1945683e-05
10,833	Cardinality Estimation for Similarity Search on High-Dimensional Data Objects: The Impact of Reference Objects	2025	VLDB	4.1945683e-05
10,859	Graph Transformers for Query Plan Representation: Potentials and Challenges	2025	VLDB	4.1945683e-05
10,942	Sub-optimal Join Order Identification with L1-error	2024	SIGMOD	4.1945683e-05
11,190	Efficient and Effective Cardinality Estimation for Skyline Family	2023	SIGMOD	4.1945683e-05
11,194	A Step Toward Deep Online Aggregation	2023	SIGMOD	4.1945683e-05
11,427	Accelerating Complex Analytics using Speculation	2021	CIDR	4.1945683e-05
11,453	XLJoins	2021	SIGMOD	4.1945683e-05

Outgoing Citations (Sorted by Pagerank)

Showing 25 of 25 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank	Cited Paper	Year	Venue	Pagerank
71	How Good Are Query Optimizers, Really?	2016	VLDB	0.00059038975
102	The Case for Learned Index Structures	2018	SIGMOD	0.00049545203
204	Learned Cardinalities: Estimating Correlated Joins with Deep Learning	2019	CIDR	0.00034784455
333	Neo: A Learned Query Optimizer	2019	VLDB	0.00027206884
372	Selectivity Estimation using Probabilistic Models	2001	SIGMOD	0.00025354779
405	Approximate Query Processing Using Wavelets	2000	VLDB	0.00024057494
758	Deep Unsupervised Cardinality Estimation	2020	VLDB	0.0001706608
806	An End-to-End Learning-based Cost Estimator	2020	VLDB	0.00016434274
943	Wander Join: Online Aggregation via Random Walks	2016	SIGMOD	0.00015145883
980	BayesStore: Managing Large, Uncertain Data Repositories with Probabilistic Graphical Models	2008	VLDB	0.00014879747
1,105	Cardinality Estimation Done Right: Index-Based Join Sampling	2017	CIDR	0.00013990395
1,204	VerdictDB: Universalizing Approximate Query Processing	2018	SIGMOD	0.00013319541
1,254	Selectivity Estimation for Range Predicates using Lightweight Models	2019	VLDB	0.00013027411
1,569	Querying Continuous Functions in a Database System	2008	SIGMOD	0.0001132337
2,083	Towards a Learning Optimizer for Shared Clouds	2019	VLDB	9.5834572e-05
2,129	IDEBench: A Benchmark for Interactive Data Exploration	2020	SIGMOD	9.480002e-05
2,501	DBEst: Revisiting Approximate Query Processing Engines with Machine Learning Models	2019	SIGMOD	8.6453446e-05
2,588	Database Learning: Toward a Database that Becomes Smarter Every Time	2017	SIGMOD	8.4909562e-05
2,669	A Black-Box Approach to Query Cardinality Estimation	2007	CIDR	8.3389856e-05
2,841	Selectivity Estimation in Extensible Databases - A Neural Network Approach	1998	VLDB	8.0287389e-05
2,865	Designing Succinct Secondary Indexing Mechanism by Exploiting Column Correlations	2019	SIGMOD	7.9862595e-05
4,164	SlimShot: In-Database Probabilistic Inference for Knowledge Bases	2016	VLDB	6.3923099e-05
5,266	Probabilistic Databases with MarkoViews	2012	VLDB	5.5972559e-05
7,434	Local Structure and Determinism in Probabilistic Databases	2012	SIGMOD	4.7314358e-05
8,067	Learning Statistical Models from Relational Data	2011	SIGMOD	4.5937196e-05

Semantically Similar Papers

Overall Rank	Paper	Year	Venue	Pagerank
5,637	Database Workload Characterization with Query Plan Encoders	2022	VLDB	5.3979505e-05
6,230	Learned Approximate Query Processing: Make it Light, Accurate and Fast	2021	CIDR	5.145989e-05
5,473	Facilitating SQL Query Composition and Analysis	2020	SIGMOD	5.4885366e-05
13,229	Using Deep Learning Models to Replace Large Materialized Views in Relational Database	2021	CIDR	-
2,364	Deep Learning Models for Selectivity Estimation of Multi-Attribute Queries	2020	SIGMOD	8.9554751e-05
9,120	Deep Query Optimization	2019	SIGMOD	4.392741e-05
9,852	Machine Unlearning in Learned Databases: An Experimental Analysis	2024	SIGMOD	4.2714575e-05
884	Plan-Structured Deep Neural Network Models for Query Performance Prediction	2019	VLDB	0.00015654004
3,658	Towards a Hands-Free Query Optimizer through Deep Learning	2019	CIDR	6.8704209e-05
9,892	DBMS Fitting: Why should we learn what we already know?	2020	CIDR	4.261445e-05