Database Paper Browser

Back to papers

A Universal Sketch for Estimating Heavy Hitters and Per-Element Frequency Moments in Data Streams with Bounded Deletions

Summary: Introduces Removable Augmented Sketch and Removable Universal Sketch for bounded-deletion turnstile streams, enabling online, memory-efficient estimation of heavy hitters, per-element frequencies, frequency moments, and distribution. Augments with Online Moment Estimator and compressed counters to boost accuracy and memory, delivering 16–69% F1 gains and up to 3e4× throughput in moment estimation. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
6973
Venue
SIGMOD
Year
2024
Pagerank
4.1945683e-05
Overall Rank
10,983 | 23.60%
DOI
10.1145/3698799

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank Citing Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 28 of 28 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
1 Access Path Selection in a Relational Database Management System 1979 SIGMOD 0.0040449103
71 How Good Are Query Optimizers, Really? 2016 VLDB 0.00059038975
166 Approximate Frequency Counts over Data Streams 2002 VLDB 0.00039361552
549 Tracking Join and Self-Join Sizes in Limited Storage 1999 PODS 0.00020376603
1,584 Augmented Sketch: Faster and More Accurate Stream Processing 2016 SIGMOD 0.00011255801
1,941 Cold Filter: A Meta-Framework for Faster and More Accurate Stream Processing 2018 SIGMOD 0.00010017745
2,884 BPTree: an ℓ2 Heavy Hitters Algorithm Using Constant Memory 2017 PODS 7.9620506e-05
2,953 Moment-Based Quantile Sketches for Efficient High Cardinality Aggregation Queries 2018 VLDB 7.8267643e-05
3,271 Data Sketches for Disaggregated Subset Sum and Frequent Item Estimation 2018 SIGMOD 7.2968732e-05
3,319 Sketching Linear Classifiers over Data Streams 2018 SIGMOD 7.226439e-05
3,990 FactorJoin: A New Cardinality Estimation Framework for Join Queries 2023 SIGMOD 6.5581983e-05
4,905 Randomized Error Removal for Online Spread Estimation in Data Streaming 2021 VLDB 5.8398332e-05
5,151 String Similarity Measures and Joins with Synonyms 2013 SIGMOD 5.6609851e-05
5,401 ALECE: An Attention-based Learned Cardinality Estimator for SPJ Queries on Dynamic Workloads 2024 VLDB 5.5285035e-05
5,627 KLL± Approximate Quantile Sketches over Dynamic Datasets 2021 VLDB 5.403782e-05
5,880 COMPASS: Online Sketch-based Query Optimization for In-Memory Databases 2021 SIGMOD 5.2898074e-05
5,972 SafeBound: A Practical System for Generating Cardinality Bounds 2023 SIGMOD 5.2474768e-05
6,593 Out of Many We are One: Measuring Item Batch with Clock-Sketch 2021 SIGMOD 4.9999287e-05
7,062 EAGr: Supporting Continuous Ego-centric Aggregate Queries over Large Dynamic Graphs 2014 SIGMOD 4.8462038e-05
7,912 Mining Quality Phrases from Massive Text Corpora 2015 SIGMOD 4.6183486e-05
8,203 SpaceSaving±: An Optimal Algorithm for Frequency Estimation and Frequent Items in the Bounded-Deletion Model 2022 VLDB 4.5596344e-05
8,380 Single Update Sketch with Variable Counter Structure 2023 VLDB 4.5310997e-05
8,812 Memory-Efficient and Flexible Detection of Heavy Hitters in High-Speed Networks 2023 SIGMOD 4.4438508e-05
8,829 A Distributed System for Large-scale n-gram Language Models at Tencent 2019 VLDB 4.4406886e-05
9,019 STAR: A Distributed Stream Warehouse System for Spatial Data 2020 SIGMOD 4.4082606e-05
9,227 Panakos: Chasing the Tails for Multidimensional Data Streams 2023 VLDB 4.3692732e-05
9,350 Tracking Set Correlations at Large Scale 2014 SIGMOD 4.3525655e-05
9,858 VIP Hashing - Adapting to Skew in Popularity of Data on the Fly 2022 VLDB 4.269353e-05
Previous Page 1 / 1 Next

Semantically Similar Papers