New Sampling-Based Summary Statistics for Improving Approximate Query Answers
Summary: Introduces concise samples and counting samples—two sampling-based summary statistics for fast approximate answers. Demonstrates fast incremental maintenance across distributions, outperforming standard sample views in view-size efficiency; enables hot-list query speedups under continuous insertions. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 8 of 58 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,792 | Database Optimization for the Cloud: Where Costs, Partial Results, and Consumer Choice Meet | 2015 | CIDR | 4.4506724e-05 |
| 9,950 | Distributed Wavelet Thresholding for Maximum Error Metrics | 2016 | SIGMOD | 4.2421586e-05 |
| 10,353 | Perfect Sampling in Turnstile Streams Beyond Small Moments | 2025 | PODS | 4.1945683e-05 |
| 10,586 | GREAT: Generalized Reservoir Sampling based Triangle Counting Estimation over Streaming Graphs | 2025 | VLDB | 4.1945683e-05 |
| 10,927 | Computing A Well-Representative Summary of Conjunctive Query Results | 2024 | PODS | 4.1945683e-05 |
| 11,320 | Truly Perfect Samplers for Data Streams and Sliding Windows | 2022 | PODS | 4.1945683e-05 |
| 11,897 | Capturing the Laws of (Data) Nature | 2015 | CIDR | 4.1945683e-05 |
| 12,344 | Composable, Scalable, and Accurate Weight Summarization of Unaggregated Data Sets | 2009 | VLDB | 4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 14 | Online Aggregation | 1997 | SIGMOD | 0.0010801504 |
| 59 | Sampling-Based Estimation of the Number of Distinct Values of an Attribute | 1995 | VLDB | 0.00064501896 |
| 64 | Improved Histograms for Selectivity Estimation of Range Predicates | 1996 | SIGMOD | 0.00063612837 |
| 269 | Fast Incremental Maintenance of Approximate Histograms | 1997 | VLDB | 0.00029656549 |
| 327 | Balancing Histogram Optimality and Practicality for Query Result Size Estimation | 1995 | SIGMOD | 0.00027308479 |
| 357 | Random Sampling from B+ trees | 1989 | VLDB | 0.00026020098 |
| 523 | Recovering Information from Summary Data | 1997 | VLDB | 0.00021089782 |
| 657 | Dynamic Itemset Counting and Implication Rules for Market Basket Data | 1997 | SIGMOD | 0.00018553891 |
| 808 | Universality of Serial Histograms | 1993 | VLDB | 0.00016432772 |
| 3,966 | Random Sampling from Pseudo-Ranked B+ Trees | 1992 | VLDB | 6.580483e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 39 | Statistical Estimators for Relational Algebra Expressions | 1988 | PODS | 0.00074745564 |
| 211 | Join Synopses for Approximate Query Answering | 1999 | SIGMOD | 0.00033981214 |
| 46 | Simple Random Sampling from Relational Databases | 1986 | VLDB | 0.00070894702 |
| 2,808 | A Robust, Optimization-Based Approach for Approximate Answering of Aggregate Queries | 2001 | SIGMOD | 8.0870741e-05 |
| 92 | Practical Selectivity Estimation through Adaptive Sampling | 1990 | SIGMOD | 0.00051315959 |
| 361 | Histogram-Based Approximation of Set-Valued Query Answers | 1999 | VLDB | 0.00025775749 |
| 2,580 | Sample + Seek: Approximating Aggregates with Distribution Precision Guarantee | 2016 | SIGMOD | 8.5058814e-05 |
| 8,240 | Experiences with Approximating Queries in Microsoft’s Production Big-Data Clusters | 2019 | VLDB | 4.5522563e-05 |
| 8,605 | Structure-Aware Sampling: Flexible and Accurate Summarization | 2011 | VLDB | 4.4865144e-05 |
| 269 | Fast Incremental Maintenance of Approximate Histograms | 1997 | VLDB | 0.00029656549 |