Database Paper Browser

Back to papers

Fast Incremental Maintenance of Approximate Histograms

Summary: Introduces sampling-based incremental maintenance of approximate histograms to keep estimates up-to-date with minimal overhead. A backing random sample drives updates, enabling highly accurate equi-depth and Compressed histograms with orders-of-magnitude gains over prior methods. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
8447
Venue
VLDB
Year
1997
Pagerank
0.00029656549
Overall Rank
269 | 98.14%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 46 of 46 citing papers.

Rank Citing Paper Year Venue Pagerank
126 Space-Efficient Online Computation of Quantile Summaries 2001 SIGMOD 0.00044744986
182 LEO - DB2's LEarning Optimizer 2001 VLDB 0.00036962631
184 New Sampling-Based Summary Statistics for Improving Approximate Query Answers 1998 SIGMOD 0.00036625711
211 Join Synopses for Approximate Query Answering 1999 SIGMOD 0.00033981214
308 Distinct Sampling for Highly-Accurate Answers to Distinct Values Queries and Event Reports 2001 VLDB 0.00028142852
325 The History of Histograms (abridged) 2003 VLDB 0.00027378328
344 Surfing Wavelets on Streams: One-Pass Summaries for Approximate Aggregate Queries 2001 VLDB 0.00026702512
361 Histogram-Based Approximation of Set-Valued Query Answers 1999 VLDB 0.00025775749
429 The Aqua Approximate Query Answering System 1999 SIGMOD 0.00023476494
443 Random Sampling Techniques for Space Efficient Online Computation of Order Statistics of Large Datasets 1999 SIGMOD 0.00022996573
449 Approximate Query Processing: Taming the TeraBytes! A Tutorial 2001 VLDB 0.00022846068
454 An Overview of Query Optimization in Relational Systems 1998 PODS 0.00022734812
512 STHoles: A Multidimensional Workload-Aware Histogram 2001 SIGMOD 0.00021380733
529 Self-tuning Histograms: Building Histograms Without Looking at Data 1999 SIGMOD 0.00020828852
530 Random Sampling for Histogram Construction: How much is enough? 1998 SIGMOD 0.00020803682
549 Tracking Join and Self-Join Sizes in Limited Storage 1999 PODS 0.00020376603
619 On Computing Correlated Aggregates Over Continual Data Streams 2001 SIGMOD 0.00019066583
852 Dynamic Multidimensional Histograms 2002 SIGMOD 0.00015941524
865 What’s Hot and What’s Not: Tracking Most Frequent Items Dynamically 2003 PODS 0.00015808172
956 How to Summarize the Universe: Dynamic Maintenance of Quantiles 2002 VLDB 0.00015066967
967 Aqua: A Fast Decision Support System Using Approximate Query Answers 1999 VLDB 0.00014959939
1,064 Processing Complex Aggregate Queries over Data Streams 2002 SIGMOD 0.00014356481
1,127 Dynamic Maintenance of Wavelet-Based Histograms 2000 VLDB 0.00013819179
1,260 Dynamic Sample Selection for Approximate Query Processing 2003 SIGMOD 0.00012993347
1,443 Compressing SQL Workloads 2002 SIGMOD 0.00011947004
1,695 Combining Histograms and Parametric Curve Fitting for Feedback-Driven Query Result-Size Estimation 1999 VLDB 0.00010882793
1,797 Effective Use of Block-Level Sampling in Statistics Estimation 2004 SIGMOD 0.00010523169
2,282 Summarizing and Mining Inverse Distributions on Data Streams via Dynamic Inverse Sampling 2005 VLDB 9.1073603e-05
2,377 CS2: A New Database Synopsis for Query Estimation 2013 SIGMOD 8.9402115e-05
2,665 Statistical Learning Techniques for Costing XML Queries 2005 VLDB 8.3498101e-05
2,878 Sampling Time-Based Sliding Windows in Bounded Space 2008 SIGMOD 7.9706235e-05
3,486 Holistic UDAFs at Streaming Speeds 2004 SIGMOD 7.0502199e-05
3,651 Conditional Selectivity for Statistics on Query Expressions 2004 SIGMOD 6.8768678e-05
3,719 Space efficiency in Synopsis construction algorithms 2005 VLDB 6.8204683e-05
3,878 Data Canopy: Accelerating Exploratory Statistical Analysis 2017 SIGMOD 6.6731435e-05
3,922 Pushing Data-Induced Predicates Through Joins in Big-Data Clusters 2020 VLDB 6.6291079e-05
4,146 Selectivity Estimation for Spatio-Temporal Queries to Moving Objects 2002 SIGMOD 6.4100417e-05
5,632 Bloom Histogram: Path Selectivity Estimation for XML Data with Updates 2004 VLDB 5.4014372e-05
5,879 Fast and Near–Optimal Algorithms for Approximating Distributions by Histograms 2015 PODS 5.2908101e-05
6,444 Evaluating Interactive Data Systems: Workloads, Metrics, and Guidelines 2018 SIGMOD 5.059132e-05
6,491 Robust Estimation With Sampling and Approximate Pre-Aggregation 2003 VLDB 5.0429323e-05
6,637 Approximating and Testing k-Histogram Distributions in Sub-linear Time 2012 PODS 4.9816401e-05
7,581 Synopses for Query Optimization: A Space-Complexity Perspective 2004 PODS 4.7057641e-05
9,950 Distributed Wavelet Thresholding for Maximum Error Metrics 2016 SIGMOD 4.2421586e-05
11,434 Data-Independent Space Partitionings for Summaries 2021 PODS 4.1945683e-05
11,821 Are Few Bins Enough: Testing Histogram Distributions 2016 PODS 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 4 of 4 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers