Database Paper Browser

Back to papers

Cardinality Estimation Done Right: Index-Based Join Sampling

Summary: Index-based join sampling: a main-memory cardinality estimator that uses existing indexes to sample join results and produce accurate multi-table cardinalities. Low, configurable sampling overhead substantially improves estimates and end-to-end plan quality and integrates easily into existing systems. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
313
Venue
CIDR
Year
2017
Pagerank
0.00013990395
Overall Rank
1,105 | 92.32%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 52 citing papers.

Rank Citing Paper Year Venue Pagerank
204 Learned Cardinalities: Estimating Correlated Joins with Deep Learning 2019 CIDR 0.00034784455
608 DeepDB: Learn from Data, not from Queries! 2020 VLDB 0.00019235898
758 Deep Unsupervised Cardinality Estimation 2020 VLDB 0.0001706608
806 An End-to-End Learning-based Cost Estimator 2020 VLDB 0.00016434274
910 NeuroCard: One Cardinality Estimator for All Tables 2021 VLDB 0.00015423056
1,638 Cardinality Estimation in DBMS: A Comprehensive Benchmark Evaluation 2022 VLDB 0.00011049779
1,703 Are We Ready For Learned Cardinality Estimation? 2021 VLDB 0.00010836769
1,981 Improved Selectivity Estimation by Combining Knowledge from Sampling and Synopses 2018 VLDB 9.8687545e-05
2,142 Pessimistic Cardinality Estimation: Tighter Upper Bounds for Intermediate Join Cardinalities 2019 SIGMOD 9.4507296e-05
2,364 Deep Learning Models for Selectivity Estimation of Multi-Attribute Queries 2020 SIGMOD 8.9554751e-05
2,762 FLAT: Fast, Lightweight and Accurate Method for Cardinality Estimation 2021 VLDB 8.1585394e-05
2,783 Flow-Loss: Learning Cardinality Estimates That Matter 2021 VLDB 8.1293383e-05
2,969 Estimating Join Selectivities using Bandwidth-Optimized Kernel Density Models 2017 VLDB 7.7974762e-05
3,266 Learned Cardinality Estimation: An In-depth Study 2022 SIGMOD 7.3074684e-05
3,449 Learned Cardinality Estimation: A Design Space Exploration and A Comparative Evaluation 2022 VLDB 7.0824319e-05
3,499 Fauce: Fast and Accurate Deep Ensembles with Uncertainty for Cardinality Estimation 2021 VLDB 7.0376445e-05
3,511 Accurate Summary-based Cardinality Estimation Through the Lens of Cardinality Estimation Graphs 2022 VLDB 7.0254052e-05
3,702 Every Row Counts: Combining Sketches and Sampling for Accurate Group-By Result Estimates 2019 CIDR 6.8295759e-05
3,824 Correlation Sketches for Approximate Join-Correlation Queries 2021 SIGMOD 6.7260705e-05
3,885 Density-optimized Intersection-free Mapping and Matrix Multiplication for Join-Project Operations 2022 VLDB 6.6674822e-05
3,924 A Unified Deep Model of Learning from both Data and Queries for Cardinality Estimation 2021 SIGMOD 6.6271553e-05
3,954 Efficiently Approximating Selectivity Functions using Low Overhead Regression Models 2020 VLDB 6.5926838e-05
3,990 FactorJoin: A New Cardinality Estimation Framework for Join Queries 2023 SIGMOD 6.5581983e-05
4,359 Astrid: Accurate Selectivity Estimation for String Predicates using Deep Learning 2021 VLDB 6.2569955e-05
4,523 Simplicity Done Right for Join Ordering 2021 CIDR 6.1135504e-05
4,543 FACE: A Normalizing Flow based Cardinality Estimator 2022 VLDB 6.1011198e-05
4,694 Scalable Reservoir Sampling on Many-Core CPUs 2019 SIGMOD 5.9944898e-05
4,833 MNC: Structure-Exploiting Sparsity Estimation for Matrix Expressions 2019 SIGMOD 5.8916346e-05
5,880 COMPASS: Online Sketch-based Query Optimization for In-Memory Databases 2021 SIGMOD 5.2898074e-05
5,930 FASTgres: Making Learned Query Optimizer Hinting Effective 2023 VLDB 5.2682075e-05
6,493 Joins on Samples: A Theoretical Guide for Practitioners 2020 VLDB 5.0424713e-05
6,704 Combining Sampling and Synopses with Worst-Case Optimal Runtime and Quality Guarantees for Graph Pattern Cardinality Estimation 2021 SIGMOD 4.9554912e-05
6,898 Disclosure-Compliant Query Answering 2024 SIGMOD 4.8925595e-05
7,123 ASM: Harmonizing Autoregressive Model, Sampling, and Multi-dimensional Statistics Merging for Cardinality Estimation 2024 SIGMOD 4.8251036e-05
7,221 Speeding Up End-to-end Query Execution via Learning-based Progressive Cardinality Estimation 2023 SIGMOD 4.797194e-05
7,714 Identifying Insufficient Data Coverage in Databases with Multiple Relations 2020 VLDB 4.6700455e-05
7,854 dbET: Execution Time Distribution-based Plan Selection 2023 SIGMOD 4.6350172e-05
8,047 Thrifty Query Execution via Incrementability 2020 SIGMOD 4.5983505e-05
8,127 Robust Query Processing: Mission Possible 2020 VLDB 4.579056e-05
8,350 alpha to omega: The Greek Alphabet of Sampling 2020 CIDR 4.5404832e-05
9,380 Small Selectivities Matter: Lifting the Burden of Empty Samples 2021 SIGMOD 4.3461329e-05
9,662 Efficient Query Re-optimization with Judicious Subquery Selections 2023 SIGMOD 4.3097631e-05
9,878 PRICE: A Pretrained Model for Cross-Database Cardinality Estimation 2025 VLDB 4.2656547e-05
9,960 An Elephant Under The Microscope: Analyzing The Interaction Of Optimizer Components In PostgreSQL 2025 SIGMOD 4.2294678e-05
10,149 CorrBound: Cardinality Estimation Accounting for Inter- and Intra-relation Correlations 2026 SIGMOD 4.1945683e-05
10,197 Qualitative Join Discovery in Data Lakes using Examples 2026 SIGMOD 4.1945683e-05
10,859 Graph Transformers for Query Plan Representation: Potentials and Challenges 2025 VLDB 4.1945683e-05
11,056 Agile-Ant: Self-managing Distributed Cache Management for Cost Optimization of Big Data Applications 2024 VLDB 4.1945683e-05
11,084 Presto’s History-based Query Optimizer 2024 VLDB 4.1945683e-05
11,341 Juggler: Autonomous Cost Optimization and Performance Prediction of Big Data Applications 2022 SIGMOD 4.1945683e-05
Previous Page 1 / 2 Next

Outgoing Citations (Sorted by Pagerank)

Showing 20 of 20 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
71 How Good Are Query Optimizers, Really? 2016 VLDB 0.00059038975
92 Practical Selectivity Estimation through Adaptive Sampling 1990 SIGMOD 0.00051315959
99 On the Propagation of Errors in the Size of Join Results 1991 SIGMOD 0.00050022914
115 Eddies: Continuously Adaptive Query Processing 2000 SIGMOD 0.00046221215
182 LEO - DB2's LEarning Optimizer 2001 VLDB 0.00036962631
418 Morsel-Driven Parallelism: A NUMA-Aware Query Evaluation Framework for the Many-Core Age 2014 SIGMOD 0.00023729211
553 Bifocal Sampling for Skew-Resistant Join Size Estimation 1996 SIGMOD 0.00020272061
650 Robust Query Processing through Progressive Optimization 2004 SIGMOD 0.00018659177
1,272 Proactive Re-Optimization 2005 SIGMOD 0.00012920076
1,341 Dynamic Programming Strikes Back 2008 SIGMOD 0.00012486285
1,758 Sampling-Based Query Re-Optimization 2016 SIGMOD 0.00010655546
2,356 Consistently Estimating the Selectivity of Conjuncts of Predicates 2005 VLDB 8.9620762e-05
2,377 CS2: A New Database Synopsis for Query Estimation 2013 SIGMOD 8.9402115e-05
2,631 Plan Bouquets: Query Processing without Selectivity Estimation 2014 SIGMOD 8.4101843e-05
2,742 Cache-Efficient Aggregation: Hashing Is Sorting 2015 SIGMOD 8.1906104e-05
2,785 Counter Strike: Generic Top-Down Join Enumeration for Hypergraphs 2013 VLDB 8.1286814e-05
4,262 Efficient Processing of Window Functions in Analytical SQL Queries 2015 VLDB 6.3117226e-05
4,617 Adaptive Query Processing in the Looking Glass 2005 CIDR 6.0446738e-05
4,738 Query Simplification: Graceful Degradation for Join-Order Optimization 2009 SIGMOD 5.9600502e-05
6,874 ROX: Run-time Optimization of XQueries 2009 SIGMOD 4.8978984e-05
Previous Page 1 / 1 Next

Semantically Similar Papers