Database Paper Browser

Back to papers

A Unified Deep Model of Learning from both Data and Queries for Cardinality Estimation

Summary: Unified deep autoregressive UAE combines data and query signals to learn distributions for cardinality estimation. Progressive sampling via Gumbel-Softmax enables query learning; UAE yields tail error in single digits and higher accuracy with efficiency. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
6110
Venue
SIGMOD
Year
2021
Pagerank
6.6271553e-05
Overall Rank
3,924 | 72.71%
DOI
10.1145/3448016.3452830

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 34 of 34 citing papers.

Rank Citing Paper Year Venue Pagerank
1,638 Cardinality Estimation in DBMS: A Comprehensive Benchmark Evaluation 2022 VLDB 0.00011049779
3,001 Neural Subgraph Counting with Wasserstein Estimator 2022 SIGMOD 7.7404487e-05
3,266 Learned Cardinality Estimation: An In-depth Study 2022 SIGMOD 7.3074684e-05
3,990 FactorJoin: A New Cardinality Estimation Framework for Join Queries 2023 SIGMOD 6.5581983e-05
4,417 Robust Query Driven Cardinality Estimation under Changing Workloads 2023 VLDB 6.2037371e-05
5,368 Fine-Grained Modeling and Optimization for Intelligent Resource Management in Big Data Processing 2022 VLDB 5.5457532e-05
5,401 ALECE: An Attention-based Learned Cardinality Estimator for SPJ Queries on Dynamic Workloads 2024 VLDB 5.5285035e-05
5,861 Machine Learning for Databases 2021 VLDB 5.298883e-05
5,942 SAM: Database Generation from Query Workloads with Supervised Autoregressive Models 2022 SIGMOD 5.2634242e-05
5,972 SafeBound: A Practical System for Generating Cardinality Bounds 2023 SIGMOD 5.2474768e-05
7,123 ASM: Harmonizing Autoregressive Model, Sampling, and Multi-dimensional Statistics Merging for Cardinality Estimation 2024 SIGMOD 4.8251036e-05
7,126 Debunking the Myth of Join Ordering: Toward Robust SQL Analytics 2025 SIGMOD 4.8232367e-05
7,221 Speeding Up End-to-end Query Execution via Learning-based Progressive Cardinality Estimation 2023 SIGMOD 4.797194e-05
7,828 Modeling Shifting Workloads for Learned Database Systems 2024 SIGMOD 4.6407986e-05
7,854 dbET: Execution Time Distribution-based Plan Selection 2023 SIGMOD 4.6350172e-05
8,220 PerfGuard: Deploying ML-for-Systems without Performance Regressions, Almost! 2021 VLDB 4.5557328e-05
8,617 A Spark Optimizer for Adaptive, Fine-Grained Parameter Tuning 2024 VLDB 4.4846425e-05
9,107 NeuroSketch: Fast and Approximate Evaluation of Range Aggregate Queries with Neural Networks 2023 SIGMOD 4.3950706e-05
9,317 Are Joins over LSM-trees Ready? Take RocksDB as an Example 2025 VLDB 4.3556432e-05
9,345 LIMAO: A Framework for Lifelong Modular Learned Query Optimization 2025 VLDB 4.3536343e-05
9,485 Spatial Query Optimization With Learning 2024 VLDB 4.3341665e-05
9,621 ShadowAQP: Efficient Approximate Group-by and Join Query via Attribute-oriented Sample Size Allocation and Data Generation 2023 VLDB 4.3167167e-05
9,691 Selectivity Estimation for Queries Containing Predicates over Set-Valued Attributes 2023 SIGMOD 4.3035354e-05
9,747 Still Asking: How Good Are Query Optimizers, Really? 2025 VLDB 4.2897489e-05
9,812 A Practical Theory of Generalization in Selectivity Learning 2025 VLDB 4.2783272e-05
9,825 Athena: An Effective Learning-based Framework for Query Optimizer Performance Improvement 2025 SIGMOD 4.2751057e-05
9,878 PRICE: A Pretrained Model for Cross-Database Cardinality Estimation 2025 VLDB 4.2656547e-05
9,960 An Elephant Under The Microscope: Analyzing The Interaction Of Optimizer Components In PostgreSQL 2025 SIGMOD 4.2294678e-05
10,197 Qualitative Join Discovery in Data Lakes using Examples 2026 SIGMOD 4.1945683e-05
10,219 Practical Parameterized Query Optimization via Efficient Plan Reuse and List-wise Ranking 2026 SIGMOD 4.1945683e-05
10,590 ACE: A Cardinality Estimator for Set-Valued Queries 2025 VLDB 4.1945683e-05
10,619 Data-Agnostic Cardinality Learning from Imperfect Workloads 2025 VLDB 4.1945683e-05
10,859 Graph Transformers for Query Plan Representation: Potentials and Challenges 2025 VLDB 4.1945683e-05
11,190 Efficient and Effective Cardinality Estimation for Skyline Family 2023 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 36 of 36 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
64 Improved Histograms for Selectivity Estimation of Range Predicates 1996 SIGMOD 0.00063612837
71 How Good Are Query Optimizers, Really? 2016 VLDB 0.00059038975
92 Practical Selectivity Estimation through Adaptive Sampling 1990 SIGMOD 0.00051315959
182 LEO - DB2's LEarning Optimizer 2001 VLDB 0.00036962631
204 Learned Cardinalities: Estimating Correlated Joins with Deep Learning 2019 CIDR 0.00034784455
224 CORDS: Automatic Discovery of Correlations and Soft Functional Dependencies 2004 SIGMOD 0.00032746205
372 Selectivity Estimation using Probabilistic Models 2001 SIGMOD 0.00025354779
512 STHoles: A Multidimensional Workload-Aware Histogram 2001 SIGMOD 0.00021380733
529 Self-tuning Histograms: Building Histograms Without Looking at Data 1999 SIGMOD 0.00020828852
608 DeepDB: Learn from Data, not from Queries! 2020 VLDB 0.00019235898
629 Preventing Bad Plans by Bounding the Impact of Cardinality Estimation Errors 2009 VLDB 0.00018942366
758 Deep Unsupervised Cardinality Estimation 2020 VLDB 0.0001706608
806 An End-to-End Learning-based Cost Estimator 2020 VLDB 0.00016434274
811 On the Relative Cost of Sampling for Join Selectivity Estimation 1994 PODS 0.00016425612
842 Independence is Good: Dependency-Based Histogram Synopses for High-Dimensional Data 2001 SIGMOD 0.00016031973
852 Dynamic Multidimensional Histograms 2002 SIGMOD 0.00015941524
897 Selectivity Estimation and Query Optimization in Large Databases with Highly Skewed Distributions of Column Values 1988 VLDB 0.00015528028
910 NeuroCard: One Cardinality Estimator for All Tables 2021 VLDB 0.00015423056
996 Approximating Multi-Dimensional Aggregate Range Queries Over Real Attributes 2000 SIGMOD 0.00014741524
1,105 Cardinality Estimation Done Right: Index-Based Join Sampling 2017 CIDR 0.00013990395
1,120 Global Optimization of Histograms 2001 SIGMOD 0.00013856211
1,254 Selectivity Estimation for Range Predicates using Lightweight Models 2019 VLDB 0.00013027411
1,369 Random Sampling over Joins Revisited 2018 SIGMOD 0.00012339777
1,547 Lightweight Graphical Models for Selectivity Estimation Without Independence Assumptions 2011 VLDB 0.00011442359
1,737 QuickSel: Quick Selectivity Learning with Mixture Models 2020 SIGMOD 0.00010720294
2,083 Towards a Learning Optimizer for Shared Clouds 2019 VLDB 9.5834572e-05
2,137 SASH: A Self-Adaptive Histogram Set for Dynamically Changing Workloads 2003 VLDB 9.4719326e-05
2,142 Pessimistic Cardinality Estimation: Tighter Upper Bounds for Intermediate Join Cardinalities 2019 SIGMOD 9.4507296e-05
2,165 Self-Tuning, GPU-Accelerated Kernel Density Models for Multidimensional Selectivity Estimation 2015 SIGMOD 9.389622e-05
2,291 Data Generation using Declarative Constraints 2011 SIGMOD 9.0926719e-05
2,364 Deep Learning Models for Selectivity Estimation of Multi-Attribute Queries 2020 SIGMOD 8.9554751e-05
2,969 Estimating Join Selectivities using Bandwidth-Optimized Kernel Density Models 2017 VLDB 7.7974762e-05
3,053 Multiple Join Size Estimation by Virtual Domains (extended abstract) 1993 PODS 7.64969e-05
3,593 Graph-Based Synopses for Relational Selectivity Estimation 2006 SIGMOD 6.9385476e-05
4,517 Generating Databases for Query Workloads 2010 VLDB 6.1178732e-05
6,493 Joins on Samples: A Theoretical Guide for Practitioners 2020 VLDB 5.0424713e-05
Previous Page 1 / 1 Next

Semantically Similar Papers