Database Paper Browser

Back to papers

Efficiently Approximating Selectivity Functions using Low Overhead Regression Models

Summary: Introduces incremental data generation with approximate labels to train low-overhead selectivity regression models. Extends to select-project-join with ranges and IN clauses, yielding 95th percentile error 10–100x lower than baselines. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
12110
Venue
VLDB
Year
2020
Pagerank
6.5926838e-05
Overall Rank
3,954 | 72.50%
DOI
10.14778/3407790.3407820

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 24 of 24 citing papers.

Rank Citing Paper Year Venue Pagerank
1,703 Are We Ready For Learned Cardinality Estimation? 2021 VLDB 0.00010836769
2,783 Flow-Loss: Learning Cardinality Estimates That Matter 2021 VLDB 8.1293383e-05
2,985 DSB: A Decision Support Benchmark for Workload-Driven and Traditional Database Systems 2021 VLDB 7.7795847e-05
3,499 Fauce: Fast and Accurate Deep Ensembles with Uncertainty for Cardinality Estimation 2021 VLDB 7.0376445e-05
3,750 Data Acquisition for Improving Machine Learning Models 2021 VLDB 6.7895763e-05
4,417 Robust Query Driven Cardinality Estimation under Changing Workloads 2023 VLDB 6.2037371e-05
4,434 Lightweight and Accurate Cardinality Estimation by Neural Network Gaussian Process 2022 SIGMOD 6.1929999e-05
5,645 Warper: Efficiently Adapting Learned Cardinality Estimators to Data and Workload Drifts 2022 SIGMOD 5.3923454e-05
6,040 Steering Query Optimizers: A Practical Take on Big Data Workloads 2021 SIGMOD 5.2412035e-05
6,667 Leveraging Query Logs and Machine Learning for Parametric Query Optimization 2022 VLDB 4.9688874e-05
7,221 Speeding Up End-to-end Query Execution via Learning-based Progressive Cardinality Estimation 2023 SIGMOD 4.797194e-05
7,296 Multi-Tenant Cloud Data Services: State-of-the-Art, Challenges and Opportunities 2022 SIGMOD 4.7723197e-05
7,610 Learning to be a Statistician: Learned Estimator for Number of Distinct Values 2022 VLDB 4.6965039e-05
7,854 dbET: Execution Time Distribution-based Plan Selection 2023 SIGMOD 4.6350172e-05
8,041 DISTILL: Low-Overhead Data-Driven Techniques for Filtering and Costing Indexes for Scalable Index Tuning 2022 VLDB 4.5998045e-05
8,220 PerfGuard: Deploying ML-for-Systems without Performance Regressions, Almost! 2021 VLDB 4.5557328e-05
9,006 Hit the Gym: Accelerating Query Execution to Efficiently Bootstrap Behavior Models for Self-Driving Database Management Systems 2024 VLDB 4.4101482e-05
9,345 LIMAO: A Framework for Lifelong Modular Learned Query Optimization 2025 VLDB 4.3536343e-05
9,662 Efficient Query Re-optimization with Judicious Subquery Selections 2023 SIGMOD 4.3097631e-05
9,691 Selectivity Estimation for Queries Containing Predicates over Set-Valued Attributes 2023 SIGMOD 4.3035354e-05
9,812 A Practical Theory of Generalization in Selectivity Learning 2025 VLDB 4.2783272e-05
9,878 PRICE: A Pretrained Model for Cross-Database Cardinality Estimation 2025 VLDB 4.2656547e-05
10,293 Vodka: Rethink Benchmarking Philosophy in HTAP Systems 2026 VLDB 4.1945683e-05
10,619 Data-Agnostic Cardinality Learning from Imperfect Workloads 2025 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 29 of 29 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
1 Access Path Selection in a Relational Database Management System 1979 SIGMOD 0.0040449103
18 On Random Sampling over Joins 1999 SIGMOD 0.00092385438
204 Learned Cardinalities: Estimating Correlated Joins with Deep Learning 2019 CIDR 0.00034784455
211 Join Synopses for Approximate Query Answering 1999 SIGMOD 0.00033981214
372 Selectivity Estimation using Probabilistic Models 2001 SIGMOD 0.00025354779
512 STHoles: A Multidimensional Workload-Aware Histogram 2001 SIGMOD 0.00021380733
529 Self-tuning Histograms: Building Histograms Without Looking at Data 1999 SIGMOD 0.00020828852
608 DeepDB: Learn from Data, not from Queries! 2020 VLDB 0.00019235898
629 Preventing Bad Plans by Bounding the Impact of Cardinality Estimation Errors 2009 VLDB 0.00018942366
735 Umbra: A Disk-Based System with In-Memory Performance 2020 CIDR 0.00017452467
758 Deep Unsupervised Cardinality Estimation 2020 VLDB 0.0001706608
790 Exploiting Statistics on Query Expressions for Optimization 2002 SIGMOD 0.0001663283
1,105 Cardinality Estimation Done Right: Index-Based Join Sampling 2017 CIDR 0.00013990395
1,193 Join Size Estimation Subject to Filter Conditions 2015 VLDB 0.00013414989
1,254 Selectivity Estimation for Range Predicates using Lightweight Models 2019 VLDB 0.00013027411
1,260 Dynamic Sample Selection for Approximate Query Processing 2003 SIGMOD 0.00012993347
1,737 QuickSel: Quick Selectivity Learning with Mixture Models 2020 SIGMOD 0.00010720294
1,981 Improved Selectivity Estimation by Combining Knowledge from Sampling and Synopses 2018 VLDB 9.8687545e-05
2,083 Towards a Learning Optimizer for Shared Clouds 2019 VLDB 9.5834572e-05
2,165 Self-Tuning, GPU-Accelerated Kernel Density Models for Multidimensional Selectivity Estimation 2015 SIGMOD 9.389622e-05
2,254 Two-Level Sampling for Join Size Estimation 2017 SIGMOD 9.1897043e-05
2,377 CS2: A New Database Synopsis for Query Estimation 2013 SIGMOD 8.9402115e-05
2,669 A Black-Box Approach to Query Cardinality Estimation 2007 CIDR 8.3389856e-05
2,841 Selectivity Estimation in Extensible Databases - A Neural Network Approach 1998 VLDB 8.0287389e-05
3,397 Statistics on Views 2003 VLDB 7.1437062e-05
3,408 Query Optimizers: Time to Rethink the Contract? 2009 SIGMOD 7.1288167e-05
5,150 Efficient Join Synopsis Maintenance for Data Warehouse 2020 SIGMOD 5.6626586e-05
5,668 A Pay-As-You-Go Framework for Query Execution Feedback 2008 VLDB 5.3806337e-05
5,905 Exploiting Ordered Dictionaries to Efficiently Construct Histograms with Q-Error Guarantees in SAP HANA 2014 SIGMOD 5.2788785e-05
Previous Page 1 / 1 Next

Semantically Similar Papers