Database Paper Browser

Back to papers

Selectivity Estimation and Query Optimization in Large Databases with Highly Skewed Distributions of Column Values

Summary: Selectivity estimation for predicates in skewed distributions (Zipf) where histograms falter, hindering plan choice. Proposes user-defined, skew-aware selectivity estimators; evaluation on a bibliographic corpus shows gains vs uniform/histogram baselines. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
7855
Venue
VLDB
Year
1988
Pagerank
0.00015528028
Overall Rank
897 | 93.77%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 18 of 18 citing papers.

Rank Citing Paper Year Venue Pagerank
92 Practical Selectivity Estimation through Adaptive Sampling 1990 SIGMOD 0.00051315959
182 LEO - DB2's LEarning Optimizer 2001 VLDB 0.00036962631
252 Adaptive Selectivity Estimation Using Query Feedback 1994 SIGMOD 0.00030632263
688 Estimating the Size of Generalized Transitive Closures 1989 VLDB 0.00018134733
762 Query Size Estimation by Adaptive Sampling (Extended Abstract) 1990 PODS 0.00017036868
861 A Taxonomy and Performance Model of Data Skew Effects in Parallel Joins 1991 VLDB 0.00015848554
1,020 An Instant and Accurate Size Estimation Method for Joins and Selection in a Retrieval-Intensive Environment 1993 SIGMOD 0.00014624893
1,737 QuickSel: Quick Selectivity Learning with Mixture Models 2020 SIGMOD 0.00010720294
2,356 Consistently Estimating the Selectivity of Conjuncts of Predicates 2005 VLDB 8.9620762e-05
2,455 Optimizing Boolean Expressions in Object Bases 1992 VLDB 8.7770449e-05
3,053 Multiple Join Size Estimation by Virtual Domains (extended abstract) 1993 PODS 7.64969e-05
3,924 A Unified Deep Model of Learning from both Data and Queries for Cardinality Estimation 2021 SIGMOD 6.6271553e-05
5,532 A Padded Encoding Scheme to Accelerate Scans by Leveraging Skew 2015 SIGMOD 5.4548897e-05
6,741 DEX: Scalable Range Indexing on Disaggregated Memory 2024 VLDB 4.9432931e-05
9,790 Chimera: A system design of dual storage and traversal-join unified query processing for SQL/PGQ 2025 VLDB 4.2818172e-05
9,812 A Practical Theory of Generalization in Selectivity Learning 2025 VLDB 4.2783272e-05
10,619 Data-Agnostic Cardinality Learning from Imperfect Workloads 2025 VLDB 4.1945683e-05
12,941 Evaluating the Size of Queries on Relational Databases with non Uniform Distribution and Stochastic Dependence 1989 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
1 Access Path Selection in a Relational Database Management System 1979 SIGMOD 0.0040449103
28 Accurate Estimation Of The Number Of Tuples Satisfying A Condition 1984 SIGMOD 0.00080435857
44 The Design Of Postgres 1986 SIGMOD 0.00071838587
68 The Database Language GEM 1983 SIGMOD 0.00060795269
3,079 Extended User-Defined Indexing with Application to Textual Databases 1988 VLDB 7.6037529e-05
Previous Page 1 / 1 Next

Semantically Similar Papers