Database Paper Browser

Back to papers

Finding Frequent Items in Data Streams

Summary: Unifies leading frequent-item algorithms for data streams under a common framework; baseline implementations and uniform empirical comparison. Shows performance variation; achieves high accuracy with tens of KB and M items/sec on commodity hardware. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
9801
Venue
VLDB
Year
2008
Pagerank
0.00016109621
Overall Rank
835 | 94.20%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 28 of 28 citing papers.

Rank Citing Paper Year Venue Pagerank
402 Mergeable Summaries 2012 PODS 0.00024196343
1,584 Augmented Sketch: Faster and More Accurate Stream Processing 2016 SIGMOD 0.00011255801
1,629 Space-optimal Heavy Hitters with Strong Error Bounds 2009 PODS 0.00011085267
1,941 Cold Filter: A Meta-Framework for Faster and More Accurate Stream Processing 2018 SIGMOD 0.00010017745
2,254 Two-Level Sampling for Join Size Estimation 2017 SIGMOD 9.1897043e-05
2,437 gSketch: On Query Estimation in Graph Streams 2012 VLDB 8.8231651e-05
3,220 Structural Trend Analysis for Online Social Networks 2011 VLDB 7.3531514e-05
3,271 Data Sketches for Disaggregated Subset Sum and Frequent Item Estimation 2018 SIGMOD 7.2968732e-05
3,319 Sketching Linear Classifiers over Data Streams 2018 SIGMOD 7.226439e-05
4,076 Quantiles over Data Streams: An Experimental Study 2013 SIGMOD 6.4680854e-05
4,190 Randomized Algorithms for Tracking Distributed Count, Frequencies, and Ranks 2012 PODS 6.3739017e-05
4,249 Optimal Tracking of Distributed Heavy Hitters and Quantiles 2009 PODS 6.3245666e-05
5,163 Finding Persistent Items in Data Streams 2017 VLDB 5.6550193e-05
5,369 Pyramid Sketch: a Sketch Framework for Frequency Estimation of Data Streams 2017 VLDB 5.5434712e-05
5,903 Building Wavelet Histograms on Large Data in MapReduce 2012 VLDB 5.2791351e-05
6,418 An Optimal Algorithm for l1-Heavy Hitters in Insertion Streams and Related Problems 2016 PODS 5.0696932e-05
6,495 Sampling Based Algorithms for Quantile Computation in Sensor Networks 2011 SIGMOD 5.0413486e-05
6,599 Local Differentially Private Heavy Hitter Detection in Data Streams with Bounded Memory 2024 SIGMOD 4.9973567e-05
7,784 Authenticated Online Data Integration Services 2015 SIGMOD 4.6517065e-05
7,880 Thread Cooperation in Multicore Architectures for Frequency Counting over Multiple Data Streams 2009 VLDB 4.6291185e-05
7,914 Efficient Approximate Algorithms for Empirical Entropy and Mutual Information 2021 SIGMOD 4.6179608e-05
7,929 Optimal Approximate Matrix Multiplication over Sliding Windows 2026 VLDB 4.613363e-05
8,203 SpaceSaving±: An Optimal Algorithm for Frequency Estimation and Frequent Items in the Bounded-Deletion Model 2022 VLDB 4.5596344e-05
9,227 Panakos: Chasing the Tails for Multidimensional Data Streams 2023 VLDB 4.3692732e-05
10,659 Cuckoo Heavy Keeper and the balancing act of maintaining heavy hitters in stream processing 2025 VLDB 4.1945683e-05
10,712 DobLIX: A Dual-Objective Learned Index for Log-Structured Merge Trees 2025 VLDB 4.1945683e-05
11,502 In the Land of Data Streams where Synopses are Missing, One Framework to Bring Them All 2021 VLDB 4.1945683e-05
11,797 Runtime Optimization of Join Location in Parallel Data Management Systems 2017 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 10 of 10 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers