Effective Use of Block-Level Sampling in Statistics Estimation
Summary: Block-level sampling is efficient but error-prone for statistics. Proposes a two-phase adaptive histogram algorithm using a phase-1 sample and a subset-extraction technique to adapt estimators to block-level distinct-value data; experiments show accuracy and speed gains. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 16 of 16 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 28 | Accurate Estimation Of The Number Of Tuples Satisfying A Condition | 1984 | SIGMOD | 0.00080435857 |
| 39 | Statistical Estimators for Relational Algebra Expressions | 1988 | PODS | 0.00074745564 |
| 59 | Sampling-Based Estimation of the Number of Distinct Values of an Attribute | 1995 | VLDB | 0.00064501896 |
| 64 | Improved Histograms for Selectivity Estimation of Range Predicates | 1996 | SIGMOD | 0.00063612837 |
| 92 | Practical Selectivity Estimation through Adaptive Sampling | 1990 | SIGMOD | 0.00051315959 |
| 269 | Fast Incremental Maintenance of Approximate Histograms | 1997 | VLDB | 0.00029656549 |
| 315 | Error-Constrained COUNT Query Evaluation in Relational Databases | 1991 | SIGMOD | 0.0002802103 |
| 367 | Sequential Sampling Procedures For Query Size Estimation | 1992 | SIGMOD | 0.00025509745 |
| 378 | Towards Estimation Error Guarantees for Distinct Values | 2000 | PODS | 0.0002497492 |
| 530 | Random Sampling for Histogram Construction: How much is enough? | 1998 | SIGMOD | 0.00020803682 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 39 | Statistical Estimators for Relational Algebra Expressions | 1988 | PODS | 0.00074745564 |
| 2,995 | A Sampling Algebra for Aggregate Estimation | 2013 | VLDB | 7.7587199e-05 |
| 12,108 | Space-Efficient Estimation of Statistics over Sub-Sampled Streams | 2012 | PODS | 4.1945683e-05 |
| 5,879 | Fast and Near–Optimal Algorithms for Approximating Distributions by Histograms | 2015 | PODS | 5.2908101e-05 |
| 308 | Distinct Sampling for Highly-Accurate Answers to Distinct Values Queries and Event Reports | 2001 | VLDB | 0.00028142852 |
| 530 | Random Sampling for Histogram Construction: How much is enough? | 1998 | SIGMOD | 0.00020803682 |
| 9,523 | Analysis and Application of Adaptive Sampling | 2000 | PODS | 4.331052e-05 |
| 12,166 | Get the Most out of Your Sample: Optimal Unbiased Estimators using Partial Information | 2011 | PODS | 4.1945683e-05 |
| 59 | Sampling-Based Estimation of the Number of Distinct Values of an Attribute | 1995 | VLDB | 0.00064501896 |
| 378 | Towards Estimation Error Guarantees for Distinct Values | 2000 | PODS | 0.0002497492 |