Optimal Sampling From Distributed Streams
Summary: Communication-optimal protocols for uniform sampling (with/without replacement) from k distributed streams, handling full-stream and sliding-window (last W items or last w time units) cases. Achieve minimal per-item time and space (optimal or near-optimal up to logs) with matching lower bounds. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Graham Cormode
- 2. S. Muthukrishnan
- 3. Ke Yi
- 4. Qin Zhang
Incoming Citations (Sorted by Pagerank)
Showing 8 of 8 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,040 | Graph Sketches: Sparsification, Spanners, and Subgraphs | 2012 | PODS | 0.00014488943 |
| 1,094 | Tight Bounds for Lp Samplers, Finding Duplicates in Streams, and Related Problems | 2011 | PODS | 0.00014129658 |
| 3,063 | Sliding Window-based Approximate Triangle Counting over Streaming Graphs with Duplicate Edges | 2021 | SIGMOD | 7.6321424e-05 |
| 3,313 | Quality and Efficiency in Kernel Density Estimates for Large Data | 2013 | SIGMOD | 7.2381634e-05 |
| 6,716 | Continuous Distributed Counting for Non-monotonic Streams | 2012 | PODS | 4.9507254e-05 |
| 11,320 | Truly Perfect Samplers for Data Streams and Sliding Windows | 2022 | PODS | 4.1945683e-05 |
| 11,970 | Stratified-Sampling over Social Networks Using MapReduce | 2014 | SIGMOD | 4.1945683e-05 |
| 12,054 | Utility-Maximizing Event Stream Suppression | 2013 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 126 | Space-Efficient Online Computation of Quantile Summaries | 2001 | SIGMOD | 0.00044744986 |
| 745 | Distributed Top-K Monitoring | 2003 | SIGMOD | 0.00017330487 |
| 1,392 | Sketching Streams Through the Net: Distributed Approximate Query Tracking | 2005 | VLDB | 0.00012229045 |
| 1,640 | Communication-Efficient Distributed Monitoring of Thresholded Counts | 2006 | SIGMOD | 0.0001104808 |
| 2,789 | Optimal Sampling from Sliding Windows | 2009 | PODS | 8.1249652e-05 |
| 2,878 | Sampling Time-Based Sliding Windows in Bounded Space | 2008 | SIGMOD | 7.9706235e-05 |
| 2,920 | A Geometric Approach to Monitoring Threshold Functions Over Distributed Data Streams | 2006 | SIGMOD | 7.9001024e-05 |
| 2,931 | Holistic Aggregates in a Networked World: Distributed Tracking of Approximate Quantiles | 2005 | SIGMOD | 7.8697258e-05 |
| 4,249 | Optimal Tracking of Distributed Heavy Hitters and Quantiles | 2009 | PODS | 6.3245666e-05 |
| 5,051 | Shape Sensitive Geometric Monitoring | 2008 | PODS | 5.7340225e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,789 | Optimal Sampling from Sliding Windows | 2009 | PODS | 8.1249652e-05 |
| 8,470 | Sampling Big Ideas in Query Optimization | 2023 | PODS | 4.5038423e-05 |
| 1,392 | Sketching Streams Through the Net: Distributed Approximate Query Tracking | 2005 | VLDB | 0.00012229045 |
| 4,718 | Weighted Reservoir Sampling from Distributed Streams | 2019 | PODS | 5.9749691e-05 |
| 745 | Distributed Top-K Monitoring | 2003 | SIGMOD | 0.00017330487 |
| 5,117 | Sampling Algorithms in a Stream Operator | 2005 | SIGMOD | 5.6825418e-05 |
| 7,334 | Streaming in a Connected World: Querying and Tracking Distributed Data Streams | 2007 | SIGMOD | 4.7604215e-05 |
| 1,003 | Adaptive Filters for Continuous Queries over Distributed Data Streams | 2003 | SIGMOD | 0.00014698435 |
| 7,834 | Sketch-based Querying of Distributed Sliding-Window Data Streams | 2012 | VLDB | 4.6382551e-05 |
| 11,853 | Scalable Approximate Query Tracking over Highly Distributed Data Streams | 2016 | SIGMOD | 4.1945683e-05 |