Back to papers
A Universal Sketch for Estimating Heavy Hitters and Per-Element Frequency Moments in Data Streams with Bounded Deletions
Summary: Introduces Removable Augmented Sketch and Removable Universal Sketch for bounded-deletion turnstile streams, enabling online, memory-efficient estimation of heavy hitters, per-element frequencies, frequency moments, and distribution. Augments with Online Moment Estimator and compressed counters to boost accuracy and memory, delivering 16–69% F1 gains and up to 3e4× throughput in moment estimation.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 6973
- Venue
- SIGMOD
- Year
- 2024
- Pagerank
- 4.1945683e-05
- Overall Rank
- 10,983 | 23.60%
- DOI
-
10.1145/3698799
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
Outgoing Citations (Sorted by Pagerank)
Showing 28 of 28 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 1 |
Access Path Selection in a Relational Database Management System |
1979 |
SIGMOD |
0.0040449103 |
| 71 |
How Good Are Query Optimizers, Really? |
2016 |
VLDB |
0.00059038975 |
| 166 |
Approximate Frequency Counts over Data Streams |
2002 |
VLDB |
0.00039361552 |
| 549 |
Tracking Join and Self-Join Sizes in Limited Storage |
1999 |
PODS |
0.00020376603 |
| 1,584 |
Augmented Sketch: Faster and More Accurate Stream Processing |
2016 |
SIGMOD |
0.00011255801 |
| 1,941 |
Cold Filter: A Meta-Framework for Faster and More Accurate Stream Processing |
2018 |
SIGMOD |
0.00010017745 |
| 2,884 |
BPTree: an ℓ2 Heavy Hitters Algorithm Using Constant Memory |
2017 |
PODS |
7.9620506e-05 |
| 2,953 |
Moment-Based Quantile Sketches for Efficient High Cardinality Aggregation Queries |
2018 |
VLDB |
7.8267643e-05 |
| 3,271 |
Data Sketches for Disaggregated Subset Sum and Frequent Item Estimation |
2018 |
SIGMOD |
7.2968732e-05 |
| 3,319 |
Sketching Linear Classifiers over Data Streams |
2018 |
SIGMOD |
7.226439e-05 |
| 3,990 |
FactorJoin: A New Cardinality Estimation Framework for Join Queries |
2023 |
SIGMOD |
6.5581983e-05 |
| 4,905 |
Randomized Error Removal for Online Spread Estimation in Data Streaming |
2021 |
VLDB |
5.8398332e-05 |
| 5,151 |
String Similarity Measures and Joins with Synonyms |
2013 |
SIGMOD |
5.6609851e-05 |
| 5,401 |
ALECE: An Attention-based Learned Cardinality Estimator for SPJ Queries on Dynamic Workloads |
2024 |
VLDB |
5.5285035e-05 |
| 5,627 |
KLL± Approximate Quantile Sketches over Dynamic Datasets |
2021 |
VLDB |
5.403782e-05 |
| 5,880 |
COMPASS: Online Sketch-based Query Optimization for In-Memory Databases |
2021 |
SIGMOD |
5.2898074e-05 |
| 5,972 |
SafeBound: A Practical System for Generating Cardinality Bounds |
2023 |
SIGMOD |
5.2474768e-05 |
| 6,593 |
Out of Many We are One: Measuring Item Batch with Clock-Sketch |
2021 |
SIGMOD |
4.9999287e-05 |
| 7,062 |
EAGr: Supporting Continuous Ego-centric Aggregate Queries over Large Dynamic Graphs |
2014 |
SIGMOD |
4.8462038e-05 |
| 7,912 |
Mining Quality Phrases from Massive Text Corpora |
2015 |
SIGMOD |
4.6183486e-05 |
| 8,203 |
SpaceSaving±: An Optimal Algorithm for Frequency Estimation and Frequent Items in the Bounded-Deletion Model |
2022 |
VLDB |
4.5596344e-05 |
| 8,380 |
Single Update Sketch with Variable Counter Structure |
2023 |
VLDB |
4.5310997e-05 |
| 8,812 |
Memory-Efficient and Flexible Detection of Heavy Hitters in High-Speed Networks |
2023 |
SIGMOD |
4.4438508e-05 |
| 8,829 |
A Distributed System for Large-scale n-gram Language Models at Tencent |
2019 |
VLDB |
4.4406886e-05 |
| 9,019 |
STAR: A Distributed Stream Warehouse System for Spatial Data |
2020 |
SIGMOD |
4.4082606e-05 |
| 9,227 |
Panakos: Chasing the Tails for Multidimensional Data Streams |
2023 |
VLDB |
4.3692732e-05 |
| 9,350 |
Tracking Set Correlations at Large Scale |
2014 |
SIGMOD |
4.3525655e-05 |
| 9,858 |
VIP Hashing - Adapting to Skew in Popularity of Data on the Fly |
2022 |
VLDB |
4.269353e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 1,629 |
Space-optimal Heavy Hitters with Strong Error Bounds |
2009 |
PODS |
0.00011085267 |
| 8,697 |
Convolution and Cross-Correlation of Count Sketches Enables Fast Cardinality Estimation of Multi-Join Queries |
2024 |
SIGMOD |
4.4657888e-05 |
| 10,034 |
SieveSketch: A Fine-grained and Adaptive Sketch Framework for Accurate Frequency Estimation |
2026 |
SIGMOD |
4.1945683e-05 |
| 8,062 |
Together is Better: Heavy Hitters Quantile Estimation |
2023 |
SIGMOD |
4.5943269e-05 |
| 11,440 |
Frequent Elements with Witnesses in Data Streams |
2021 |
PODS |
4.1945683e-05 |
| 5,369 |
Pyramid Sketch: a Sketch Framework for Frequency Estimation of Data Streams |
2017 |
VLDB |
5.5434712e-05 |
| 1,584 |
Augmented Sketch: Faster and More Accurate Stream Processing |
2016 |
SIGMOD |
0.00011255801 |
| 3,271 |
Data Sketches for Disaggregated Subset Sum and Frequent Item Estimation |
2018 |
SIGMOD |
7.2968732e-05 |
| 3,385 |
Estimating Statistical Aggregates on Probabilistic Data Streams |
2007 |
PODS |
7.1580391e-05 |
| 12,108 |
Space-Efficient Estimation of Statistics over Sub-Sampled Streams |
2012 |
PODS |
4.1945683e-05 |