Database Paper Browser

Back to papers

Lightweight Cardinality Estimation in LSM-based Systems

Summary: Lightweight statistics for LSM stores; piggybacks on flush/merge to stay updated under high ingestion. Uses equi-width/equi-height histograms and wavelets for cardinality estimates, implemented on Apache AsterixDB with accuracy and overhead evaluation. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5481
Venue
SIGMOD
Year
2018
Pagerank
5.4539235e-05
Overall Rank
5,535 | 61.50%
DOI
10.1145/3183713.3183761

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 12 of 12 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 24 of 24 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
1 Access Path Selection in a Relational Database Management System 1979 SIGMOD 0.0040449103
64 Improved Histograms for Selectivity Estimation of Range Predicates 1996 SIGMOD 0.00063612837
70 Hive - A Warehousing Solution Over a Map-Reduce Framework 2009 VLDB 0.00059533166
71 How Good Are Query Optimizers, Really? 2016 VLDB 0.00059038975
99 On the Propagation of Errors in the Size of Join Results 1991 SIGMOD 0.00050022914
126 Space-Efficient Online Computation of Quantile Summaries 2001 SIGMOD 0.00044744986
182 LEO - DB2's LEarning Optimizer 2001 VLDB 0.00036962631
222 Wavelet-Based Histograms for Selectivity Estimation 1998 SIGMOD 0.00032828302
327 Balancing Histogram Optimality and Practicality for Query Result Size Estimation 1995 SIGMOD 0.00027308479
344 Surfing Wavelets on Streams: One-Pass Summaries for Approximate Aggregate Queries 2001 VLDB 0.00026702512
367 Sequential Sampling Procedures For Query Size Estimation 1992 SIGMOD 0.00025509745
405 Approximate Query Processing Using Wavelets 2000 VLDB 0.00024057494
429 The Aqua Approximate Query Answering System 1999 SIGMOD 0.00023476494
454 An Overview of Query Optimization in Relational Systems 1998 PODS 0.00022734812
476 Impala: A Modern, Open-Source SQL Engine for Hadoop 2015 CIDR 0.00022226941
512 STHoles: A Multidimensional Workload-Aware Histogram 2001 SIGMOD 0.00021380733
529 Self-tuning Histograms: Building Histograms Without Looking at Data 1999 SIGMOD 0.00020828852
1,127 Dynamic Maintenance of Wavelet-Based Histograms 2000 VLDB 0.00013819179
1,438 AsterixDB: A Scalable, Open Source BDMS 2014 VLDB 0.00011973592
2,021 Storage Management in AsterixDB 2014 VLDB 9.7601304e-05
3,013 Cardinality Estimation Using Sample Views with Quality Assurance 2007 SIGMOD 7.7137441e-05
3,066 HAWQ: A Massively Parallel Processing SQL Engine in Hadoop 2014 SIGMOD 7.6221974e-05
5,262 SnappyData: A Hybrid Transactional Analytical Store Built On Spark 2016 SIGMOD 5.5977349e-05
7,415 Efficient and Scalable Statistics Gathering for Large Databases in Oracle 11g 2008 SIGMOD 4.7355557e-05
Previous Page 1 / 1 Next

Semantically Similar Papers