Data Sketches for Disaggregated Subset Sum and Frequent Item Estimation
Summary: A sketch for disaggregated subset sum and heavy-hitter estimation, unbiased, high-accuracy sums under arbitrary filters. i.i.d. data: consistent heavy-hitter proportions; non-iid: outperforms uniform sampling, rivals priority sampling, with distributed extensions. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Daniel Ting
Incoming Citations (Sorted by Pagerank)
Showing 16 of 16 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 166 | Approximate Frequency Counts over Data Streams | 2002 | VLDB | 0.00039361552 |
| 184 | New Sampling-Based Summary Statistics for Improving Approximate Query Answers | 1998 | SIGMOD | 0.00036625711 |
| 402 | Mergeable Summaries | 2012 | PODS | 0.00024196343 |
| 835 | Finding Frequent Items in Data Streams | 2008 | VLDB | 0.00016109621 |
| 1,193 | Join Size Estimation Subject to Filter Conditions | 2015 | VLDB | 0.00013414989 |
| 5,496 | Time Adaptive Sketches (Ada-Sketches) for Summarizing Data Streams | 2016 | SIGMOD | 5.4757316e-05 |
| 7,547 | Sketching Unaggregated Data Streams for Subpopulation-Size Queries | 2007 | PODS | 4.7144329e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,244 | Approximate Distinct Counts for Billions of Datasets | 2019 | SIGMOD | 5.139669e-05 |
| 4,237 | Statistical Analysis of Sketch Estimators | 2007 | SIGMOD | 6.3333486e-05 |
| 8,452 | On the algebra of data sketches | 2021 | VLDB | 4.5086031e-05 |
| 1,064 | Processing Complex Aggregate Queries over Data Streams | 2002 | SIGMOD | 0.00014356481 |
| 10,983 | A Universal Sketch for Estimating Heavy Hitters and Per-Element Frequency Moments in Data Streams with Bounded Deletions | 2024 | SIGMOD | 4.1945683e-05 |
| 12,108 | Space-Efficient Estimation of Statistics over Sub-Sampled Streams | 2012 | PODS | 4.1945683e-05 |
| 7,547 | Sketching Unaggregated Data Streams for Subpopulation-Size Queries | 2007 | PODS | 4.7144329e-05 |
| 3,385 | Estimating Statistical Aggregates on Probabilistic Data Streams | 2007 | PODS | 7.1580391e-05 |
| 3,928 | Tighter Estimation using Bottom-k Sketches | 2008 | VLDB | 6.6254568e-05 |
| 12,344 | Composable, Scalable, and Accurate Weight Summarization of Unaggregated Data Sets | 2009 | VLDB | 4.1945683e-05 |