Database Paper Browser

Back to papers

Approximate Frequency Counts over Data Streams

Summary: Memory-efficient streaming frequency-count algorithms with provable error bounds for threshold-exceeding items. Handles both singleton-item streams (IP monitoring) and set-valued streams; includes a single-pass, optimized frequent itemset computation. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
8857
Venue
VLDB
Year
2002
Pagerank
0.00039361552
Overall Rank
166 | 98.85%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 74 citing papers.

Rank Citing Paper Year Venue Pagerank
43 Models and Issues in Data Stream Systems 2002 PODS 0.00072723062
325 The History of Histograms (abridged) 2003 VLDB 0.00027378328
745 Distributed Top-K Monitoring 2003 SIGMOD 0.00017330487
781 Spectral Bloom Filters 2003 SIGMOD 0.00016741046
835 Finding Frequent Items in Data Streams 2008 VLDB 0.00016109621
848 Approximate Counts and Quantiles over Sliding Windows 2004 PODS 0.0001597308
865 What’s Hot and What’s Not: Tracking Most Frequent Items Dynamically 2003 PODS 0.00015808172
1,092 E-Store: Fine-Grained Elastic Partitioning for Distributed Transaction Processing Systems 2015 VLDB 0.00014135961
1,323 Quickr: Lazily Approximating Complex AdHoc Queries in BigData Clusters 2016 SIGMOD 0.00012601997
1,392 Sketching Streams Through the Net: Distributed Approximate Query Tracking 2005 VLDB 0.00012229045
1,472 Space Efficient Mining of Multigraph Streams 2005 PODS 0.00011828662
1,629 Space-optimal Heavy Hitters with Strong Error Bounds 2009 PODS 0.00011085267
1,640 Communication-Efficient Distributed Monitoring of Thresholded Counts 2006 SIGMOD 0.0001104808
1,941 Cold Filter: A Meta-Framework for Faster and More Accurate Stream Processing 2018 SIGMOD 0.00010017745
2,178 Tributaries and Deltas: Efficient and Robust Aggregation in Sensor Network Streams 2005 SIGMOD 9.3559565e-05
2,232 Effective Phrase Prediction 2007 VLDB 9.2293508e-05
2,282 Summarizing and Mining Inverse Distributions on Data Streams via Dynamic Inverse Sampling 2005 VLDB 9.1073603e-05
2,437 gSketch: On Query Estimation in Graph Streams 2012 VLDB 8.8231651e-05
2,607 Graph Stream Summarization: From Big Bang to Big Crunch 2016 SIGMOD 8.4630211e-05
2,789 Optimal Sampling from Sliding Windows 2009 PODS 8.1249652e-05
2,920 A Geometric Approach to Monitoring Threshold Functions Over Distributed Data Streams 2006 SIGMOD 7.9001024e-05
2,931 Holistic Aggregates in a Networked World: Distributed Tracking of Approximate Quantiles 2005 SIGMOD 7.8697258e-05
3,041 Sketching Probabilistic Data Streams 2007 SIGMOD 7.6697078e-05
3,256 Multidimensional Content eXploration 2008 VLDB 7.3158557e-05
3,271 Data Sketches for Disaggregated Subset Sum and Frequent Item Estimation 2018 SIGMOD 7.2968732e-05
3,319 Sketching Linear Classifiers over Data Streams 2018 SIGMOD 7.226439e-05
3,486 Holistic UDAFs at Streaming Speeds 2004 SIGMOD 7.0502199e-05
3,660 Space Complexity of Hierarchical Heavy Hitters in Multi-Dimensional Data Streams 2005 PODS 6.8691367e-05
3,838 Approximately Detecting Duplicates for Streaming Data using Stable Bloom Filters 2006 SIGMOD 6.7134945e-05
3,860 Fast Data Stream Algorithms using Associative Memories 2007 SIGMOD 6.6902516e-05
4,190 Randomized Algorithms for Tracking Distributed Count, Frequencies, and Ranks 2012 PODS 6.3739017e-05
4,249 Optimal Tracking of Distributed Heavy Hitters and Quantiles 2009 PODS 6.3245666e-05
4,334 Diamond in the Rough: Finding Hierarchical Heavy Hitters in Multi-Dimensional Data 2004 SIGMOD 6.2798179e-05
4,350 On Biased Reservoir Sampling in the Presence of Stream Evolution 2006 VLDB 6.2645054e-05
4,449 False Positive or False Negative: Mining Frequent Itemsets from High Speed Transactional Data Streams 2004 VLDB 6.1780147e-05
4,618 Approximate Frequency Counts over Data Streams 2012 VLDB 6.0446717e-05
5,051 Shape Sensitive Geometric Monitoring 2008 PODS 5.7340225e-05
5,117 Sampling Algorithms in a Stream Operator 2005 SIGMOD 5.6825418e-05
5,163 Finding Persistent Items in Data Streams 2017 VLDB 5.6550193e-05
5,457 Fast and Approximate Stream Mining of Quantiles and Frequencies Using Graphics Processors 2005 SIGMOD 5.4970777e-05
5,496 Time Adaptive Sketches (Ada-Sketches) for Summarizing Data Streams 2016 SIGMOD 5.4757316e-05
5,713 Remembrance of Streams Past: Overload-Sensitive Management of Archived Streams 2004 VLDB 5.3581653e-05
5,796 Finding Frequent Items in Probabilistic Data 2008 SIGMOD 5.3240234e-05
6,244 Approximate Distinct Counts for Billions of Datasets 2019 SIGMOD 5.139669e-05
6,342 A Regression-Based Temporal Pattern Mining Scheme for Data Streams 2003 VLDB 5.1034654e-05
6,362 SLEUTH: Single-publisher attack detection Using correlation Hunting 2008 VLDB 5.0953013e-05
6,368 Pre-training Summarization Models of Structured Datasets for Cardinality Estimation 2022 VLDB 5.0937722e-05
6,418 An Optimal Algorithm for l1-Heavy Hitters in Insertion Streams and Related Problems 2016 PODS 5.0696932e-05
6,431 Finding Global Icebergs over Distributed Data Sets 2006 PODS 5.0654592e-05
6,599 Local Differentially Private Heavy Hitter Detection in Data Streams with Bounded Memory 2024 SIGMOD 4.9973567e-05
Previous Page 1 / 2 Next

Outgoing Citations (Sorted by Pagerank)

Showing 16 of 16 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers