A Simple and Efficient Estimation Method for Stream Expression Cardinalities
Summary: Proposes a statistical model and a simple estimator for set-expression cardinalities over distributed streams via a continuous Flajolet–Martin sketch. For two streams, it matches MLE efficiency; with many streams, it stays simple and memory is O(delta^-2 |S|^-1 N log log N), beating prior sketches. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Aiyou Chen
- 2. Jin Cao
- 3. Tian Bu
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 1 of 1 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 308 | Distinct Sampling for Highly-Accurate Answers to Distinct Values Queries and Event Reports | 2001 | VLDB | 0.00028142852 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,905 | Randomized Error Removal for Online Spread Estimation in Data Streaming | 2021 | VLDB | 5.8398332e-05 |
| 5,977 | Understanding Cardinality Estimation using Entropy Maximization | 2010 | PODS | 5.2455909e-05 |
| 12,108 | Space-Efficient Estimation of Statistics over Sub-Sampled Streams | 2012 | PODS | 4.1945683e-05 |
| 3,102 | Processing Set Expressions over Continuous Update Streams | 2003 | SIGMOD | 7.5586568e-05 |
| 11,304 | Bayesian Sketches for Volume Estimation in Data Streams | 2023 | VLDB | 4.1945683e-05 |
| 1,683 | Cardinality Estimation: An Experimental Survey | 2018 | VLDB | 0.00010922679 |
| 8,697 | Convolution and Cross-Correlation of Count Sketches Enables Fast Cardinality Estimation of Multi-Join Queries | 2024 | SIGMOD | 4.4657888e-05 |
| 6,244 | Approximate Distinct Counts for Billions of Datasets | 2019 | SIGMOD | 5.139669e-05 |
| 8,451 | Efficient framework for operating on data sketches | 2023 | VLDB | 4.5086031e-05 |
| 5,673 | Distributed Set-Expression Cardinality Estimation | 2004 | VLDB | 5.3780919e-05 |