Back to papers
Selectivity Estimation and Query Optimization in Large Databases with Highly Skewed Distributions of Column Values
Summary: Selectivity estimation for predicates in skewed distributions (Zipf) where histograms falter, hindering plan choice. Proposes user-defined, skew-aware selectivity estimators; evaluation on a bibliographic corpus shows gains vs uniform/histogram baselines.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 7855
- Venue
- VLDB
- Year
- 1988
- Pagerank
- 0.00015528028
- Overall Rank
- 897 | 93.77%
- DOI
-
-
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 18 of 18 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 92 |
Practical Selectivity Estimation through Adaptive Sampling |
1990 |
SIGMOD |
0.00051315959 |
| 182 |
LEO - DB2's LEarning Optimizer |
2001 |
VLDB |
0.00036962631 |
| 252 |
Adaptive Selectivity Estimation Using Query Feedback |
1994 |
SIGMOD |
0.00030632263 |
| 688 |
Estimating the Size of Generalized Transitive Closures |
1989 |
VLDB |
0.00018134733 |
| 762 |
Query Size Estimation by Adaptive Sampling (Extended Abstract) |
1990 |
PODS |
0.00017036868 |
| 861 |
A Taxonomy and Performance Model of Data Skew Effects in Parallel Joins |
1991 |
VLDB |
0.00015848554 |
| 1,020 |
An Instant and Accurate Size Estimation Method for Joins and Selection in a Retrieval-Intensive Environment |
1993 |
SIGMOD |
0.00014624893 |
| 1,737 |
QuickSel: Quick Selectivity Learning with Mixture Models |
2020 |
SIGMOD |
0.00010720294 |
| 2,356 |
Consistently Estimating the Selectivity of Conjuncts of Predicates |
2005 |
VLDB |
8.9620762e-05 |
| 2,455 |
Optimizing Boolean Expressions in Object Bases |
1992 |
VLDB |
8.7770449e-05 |
| 3,053 |
Multiple Join Size Estimation by Virtual Domains (extended abstract) |
1993 |
PODS |
7.64969e-05 |
| 3,924 |
A Unified Deep Model of Learning from both Data and Queries for Cardinality Estimation |
2021 |
SIGMOD |
6.6271553e-05 |
| 5,532 |
A Padded Encoding Scheme to Accelerate Scans by Leveraging Skew |
2015 |
SIGMOD |
5.4548897e-05 |
| 6,741 |
DEX: Scalable Range Indexing on Disaggregated Memory |
2024 |
VLDB |
4.9432931e-05 |
| 9,790 |
Chimera: A system design of dual storage and traversal-join unified query processing for SQL/PGQ |
2025 |
VLDB |
4.2818172e-05 |
| 9,812 |
A Practical Theory of Generalization in Selectivity Learning |
2025 |
VLDB |
4.2783272e-05 |
| 10,619 |
Data-Agnostic Cardinality Learning from Imperfect Workloads |
2025 |
VLDB |
4.1945683e-05 |
| 12,941 |
Evaluating the Size of Queries on Relational Databases with non Uniform Distribution and Stochastic Dependence |
1989 |
SIGMOD |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Semantically Similar Papers