Database Paper Browser

Back to papers

New Sampling-Based Summary Statistics for Improving Approximate Query Answers

Summary: Introduces concise samples and counting samples—two sampling-based summary statistics for fast approximate answers. Demonstrates fast incremental maintenance across distributions, outperforming standard sample views in view-size efficiency; enables hot-list query speedups under continuous insertions. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
3025
Venue
SIGMOD
Year
1998
Pagerank
0.00036625711
Overall Rank
184 | 98.73%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 58 citing papers.

Rank Citing Paper Year Venue Pagerank
166 Approximate Frequency Counts over Data Streams 2002 VLDB 0.00039361552
211 Join Synopses for Approximate Query Answering 1999 SIGMOD 0.00033981214
273 Approximate Computation of Multidimensional Aggregates of Sparse Data Using Wavelets 1999 SIGMOD 0.00029390945
308 Distinct Sampling for Highly-Accurate Answers to Distinct Values Queries and Event Reports 2001 VLDB 0.00028142852
344 Surfing Wavelets on Streams: One-Pass Summaries for Approximate Aggregate Queries 2001 VLDB 0.00026702512
361 Histogram-Based Approximation of Set-Valued Query Answers 1999 VLDB 0.00025775749
443 Random Sampling Techniques for Space Efficient Online Computation of Order Statistics of Large Datasets 1999 SIGMOD 0.00022996573
449 Approximate Query Processing: Taming the TeraBytes! A Tutorial 2001 VLDB 0.00022846068
467 Evaluating Probabilistic Queries over Imprecise Data 2003 SIGMOD 0.00022443768
477 Model-Driven Data Acquisition in Sensor Networks 2004 VLDB 0.00022221803
549 Tracking Join and Self-Join Sizes in Limited Storage 1999 PODS 0.00020376603
739 Congressional Samples for Approximate Answering of Group-By Queries 2000 SIGMOD 0.00017401518
745 Distributed Top-K Monitoring 2003 SIGMOD 0.00017330487
781 Spectral Bloom Filters 2003 SIGMOD 0.00016741046
848 Approximate Counts and Quantiles over Sliding Windows 2004 PODS 0.0001597308
865 What’s Hot and What’s Not: Tracking Most Frequent Items Dynamically 2003 PODS 0.00015808172
943 Wander Join: Online Aggregation via Random Walks 2016 SIGMOD 0.00015145883
967 Aqua: A Fast Decision Support System Using Approximate Query Answers 1999 VLDB 0.00014959939
1,120 Global Optimization of Histograms 2001 SIGMOD 0.00013856211
1,335 ICICLES: Self-tuning Samples for Approximate Query Answering 2000 VLDB 0.00012502131
1,464 Online Aggregation for Large MapReduce Jobs 2011 VLDB 0.00011865546
1,531 Online Dynamic Reordering for Interactive Data Processing 1999 VLDB 0.00011482597
1,695 Combining Histograms and Parametric Curve Fitting for Feedback-Driven Query Result-Size Estimation 1999 VLDB 0.00010882793
1,773 Offering a Precision-Performance Tradeoff for Aggregation Queries over Replicated Data 2000 VLDB 0.00010609478
1,909 SciBORQ: Scientific data management with Bounds On Runtime and Quality 2011 CIDR 0.00010121304
2,087 Answering Aggregation Queries in a Secure System Model 2007 VLDB 9.5732194e-05
2,118 Using Probabilistic Models for Data Management in Acquisitional Environments 2005 CIDR 9.5100739e-05
2,282 Summarizing and Mining Inverse Distributions on Data Streams via Dynamic Inverse Sampling 2005 VLDB 9.1073603e-05
2,662 Dwarf: Shrinking the PetaCube 2002 SIGMOD 8.3532302e-05
2,759 A Simpler and More Efficient Deterministic Scheme for Finding Frequent Items over Sliding Windows 2006 PODS 8.1636123e-05
2,789 Optimal Sampling from Sliding Windows 2009 PODS 8.1249652e-05
3,051 Partial Results in Database Systems 2014 SIGMOD 7.6512591e-05
3,271 Data Sketches for Disaggregated Subset Sum and Frequent Item Estimation 2018 SIGMOD 7.2968732e-05
3,928 Tighter Estimation using Bottom-k Sketches 2008 VLDB 6.6254568e-05
3,944 AQP++: Connecting Approximate Query Processing With Aggregate Precomputation for Interactive Analytics 2018 SIGMOD 6.6078243e-05
4,177 Density Biased Sampling: An Improved Method for Data Mining and Clustering 2000 SIGMOD 6.3835403e-05
4,350 On Biased Reservoir Sampling in the Presence of Stream Evolution 2006 VLDB 6.2645054e-05
4,723 Exact and Approximate Aggregation in Constraint Query Languages 1999 PODS 5.9714196e-05
4,955 Estimating arbitrary subset sums with few probes 2005 PODS 5.8053317e-05
5,579 XWAVE: Optimal and Approximate Extended Wavelets for Streaming Data 2004 VLDB 5.4245689e-05
5,796 Finding Frequent Items in Probabilistic Data 2008 SIGMOD 5.3240234e-05
6,190 Maintaining Bernoulli Samples over Evolving Multisets 2007 PODS 5.1645517e-05
6,286 A Dip in the Reservoir: Maintaining Sample Synopses of Evolving Datasets 2006 VLDB 5.1280225e-05
6,491 Robust Estimation With Sampling and Approximate Pre-Aggregation 2003 VLDB 5.0429323e-05
6,838 Capturing Data Uncertainty in High-Volume Stream Processing 2009 CIDR 4.9109732e-05
7,351 Distributed Outlier Detection using Compressive Sensing 2015 SIGMOD 4.7545562e-05
7,547 Sketching Unaggregated Data Streams for Subpopulation-Size Queries 2007 PODS 4.7144329e-05
8,240 Experiences with Approximating Queries in Microsoft’s Production Big-Data Clusters 2019 VLDB 4.5522563e-05
8,470 Sampling Big Ideas in Query Optimization 2023 PODS 4.5038423e-05
8,474 Adaptive Index Structures 2002 VLDB 4.5029015e-05
Previous Page 1 / 2 Next

Outgoing Citations (Sorted by Pagerank)

Showing 10 of 10 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers