On Synopses for Distinct-Value Estimation Under Multiset Operations
Summary: DV estimation via scalable synopsis store: partition synopses computed in parallel, combinable for unions, intersections, and differences. Order-statistics-based estimators are unbiased; a Cohen-driven limit theorem sizes synopses, reducing cost and boosting accuracy. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Kevin Beyer
- 2. Peter J. Haas
- 3. Berthold Reinwald
- 4. Yannis Sismanis
- 5. Rainer Gemulla
Incoming Citations (Sorted by Pagerank)
Showing 34 of 34 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1 | Access Path Selection in a Relational Database Management System | 1979 | SIGMOD | 0.0040449103 |
| 308 | Distinct Sampling for Highly-Accurate Answers to Distinct Values Queries and Event Reports | 2001 | VLDB | 0.00028142852 |
| 325 | The History of Histograms (abridged) | 2003 | VLDB | 0.00027378328 |
| 378 | Towards Estimation Error Guarantees for Distinct Values | 2000 | PODS | 0.0002497492 |
| 475 | Mining Database Structure; Or, How to Build a Data Quality Browser | 2002 | SIGMOD | 0.00022303253 |
| 593 | Storage Estimation for Multidimensional Aggregates in the Presence of Hierarchies | 1996 | VLDB | 0.00019536993 |
| 2,045 | Multi-Dimensional Clustering: A New Data Layout Scheme in DB2 | 2003 | SIGMOD | 9.6939983e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 11,304 | Bayesian Sketches for Volume Estimation in Data Streams | 2023 | VLDB | 4.1945683e-05 |
| 956 | How to Summarize the Universe: Dynamic Maintenance of Quantiles | 2002 | VLDB | 0.00015066967 |
| 3,271 | Data Sketches for Disaggregated Subset Sum and Frequent Item Estimation | 2018 | SIGMOD | 7.2968732e-05 |
| 12,166 | Get the Most out of Your Sample: Optimal Unbiased Estimators using Partial Information | 2011 | PODS | 4.1945683e-05 |
| 1,683 | Cardinality Estimation: An Experimental Survey | 2018 | VLDB | 0.00010922679 |
| 59 | Sampling-Based Estimation of the Number of Distinct Values of an Attribute | 1995 | VLDB | 0.00064501896 |
| 12,531 | Join-Distinct Aggregate Estimation over Update Streams | 2005 | PODS | 4.1945683e-05 |
| 308 | Distinct Sampling for Highly-Accurate Answers to Distinct Values Queries and Event Reports | 2001 | VLDB | 0.00028142852 |
| 378 | Towards Estimation Error Guarantees for Distinct Values | 2000 | PODS | 0.0002497492 |
| 6,244 | Approximate Distinct Counts for Billions of Datasets | 2019 | SIGMOD | 5.139669e-05 |