Statistical Analysis of Sketch Estimators
Summary: Statistical analysis of linear sketch estimators for streaming and distributed aggregates; empirical comparison of Fast-AGMS, Count-Min, and others. Finds theory underestimates practical performance; Fast-AGMS often best or near-best across problem classes, guiding practitioners. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Florin Rusu
- 2. Alin Dobra
Incoming Citations (Sorted by Pagerank)
Showing 7 of 7 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 835 | Finding Frequent Items in Data Streams | 2008 | VLDB | 0.00016109621 |
| 3,614 | Persistent Data Sketching | 2015 | SIGMOD | 6.9147318e-05 |
| 5,880 | COMPASS: Online Sketch-based Query Optimization for In-Memory Databases | 2021 | SIGMOD | 5.2898074e-05 |
| 7,164 | SKT: A One-Pass Multi-Sketch Data Analytics Accelerator | 2021 | VLDB | 4.8131514e-05 |
| 9,041 | TreeSensing: Linearly Compressing Sketches with Flexibility | 2023 | SIGMOD | 4.4039656e-05 |
| 9,082 | JoinSketch: A Sketch Algorithm for Accurate and Unbiased Inner-Product Estimation | 2023 | SIGMOD | 4.3998984e-05 |
| 11,338 | AutoMon: Automatic Distributed Monitoring for Arbitrary Multivariate Functions | 2022 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 217 | Ripple Joins for Online Aggregation | 1999 | SIGMOD | 0.00033536712 |
| 1,392 | Sketching Streams Through the Net: Distributed Approximate Query Tracking | 2005 | VLDB | 0.00012229045 |
| 3,543 | Approximation Techniques for Spatial Data | 2004 | SIGMOD | 6.9917053e-05 |
| 6,511 | Fast Range-Summable Random Variables for Efficient Aggregate Estimation | 2006 | SIGMOD | 5.032518e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,699 | Sketch-based Geometric Monitoring of Distributed Stream Queries | 2013 | VLDB | 4.6746076e-05 |
| 2,437 | gSketch: On Query Estimation in Graph Streams | 2012 | VLDB | 8.8231651e-05 |
| 6,244 | Approximate Distinct Counts for Billions of Datasets | 2019 | SIGMOD | 5.139669e-05 |
| 6,511 | Fast Range-Summable Random Variables for Efficient Aggregate Estimation | 2006 | SIGMOD | 5.032518e-05 |
| 7,834 | Sketch-based Querying of Distributed Sliding-Window Data Streams | 2012 | VLDB | 4.6382551e-05 |
| 11,304 | Bayesian Sketches for Volume Estimation in Data Streams | 2023 | VLDB | 4.1945683e-05 |
| 8,451 | Efficient framework for operating on data sketches | 2023 | VLDB | 4.5086031e-05 |
| 3,271 | Data Sketches for Disaggregated Subset Sum and Frequent Item Estimation | 2018 | SIGMOD | 7.2968732e-05 |
| 8,697 | Convolution and Cross-Correlation of Count Sketches Enables Fast Cardinality Estimation of Multi-Join Queries | 2024 | SIGMOD | 4.4657888e-05 |
| 1,064 | Processing Complex Aggregate Queries over Data Streams | 2002 | SIGMOD | 0.00014356481 |