Database Paper Browser

Back to papers

Data Sketches for Disaggregated Subset Sum and Frequent Item Estimation

Summary: A sketch for disaggregated subset sum and heavy-hitter estimation, unbiased, high-accuracy sums under arbitrary filters. i.i.d. data: consistent heavy-hitter proportions; non-iid: outperforms uniform sampling, rivals priority sampling, with distributed extensions. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5479
Venue
SIGMOD
Year
2018
Pagerank
7.2968732e-05
Overall Rank
3,271 | 77.25%
DOI
10.1145/3183713.3183759

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 16 of 16 citing papers.

Rank Citing Paper Year Venue Pagerank
2,142 Pessimistic Cardinality Estimation: Tighter Upper Bounds for Intermediate Join Cardinalities 2019 SIGMOD 9.4507296e-05
3,751 BurstSketch: Finding Bursts in Data Streams 2021 SIGMOD 6.7888099e-05
6,244 Approximate Distinct Counts for Billions of Datasets 2019 SIGMOD 5.139669e-05
6,790 On-Off Sketch: A Fast and Accurate Sketch on Persistence 2021 VLDB 4.9251439e-05
7,732 Double-Anonymous Sketch: Achieving Top-K-fairness for Finding Global Top-K Frequent Items 2023 SIGMOD 4.6657123e-05
7,870 LadderFilter: Filtering Infrequent Items with Small Memory and Time Overhead 2023 SIGMOD 4.6308128e-05
8,203 SpaceSaving±: An Optimal Algorithm for Frequency Estimation and Frequent Items in the Bounded-Deletion Model 2022 VLDB 4.5596344e-05
8,250 Stingy Sketch: A Sketch Framework for Accurate and Fast Frequency Estimation 2022 VLDB 4.5506131e-05
8,673 CoopStore: Optimizing Precomputed Summaries for Aggregation 2020 VLDB 4.4709116e-05
9,082 JoinSketch: A Sketch Algorithm for Accurate and Unbiased Inner-Product Estimation 2023 SIGMOD 4.3998984e-05
9,227 Panakos: Chasing the Tails for Multidimensional Data Streams 2023 VLDB 4.3692732e-05
9,402 CAFE: Towards Compact, Adaptive, and Fast Embedding for Large-scale Recommendation Models 2024 SIGMOD 4.3441378e-05
9,962 Adaptive threshold sampling 2022 SIGMOD 4.2294678e-05
10,386 Pandora: An Efficient and Rapid Solution for Persistence-Based Tasks in High-Speed Data Streams 2025 SIGMOD 4.1945683e-05
10,983 A Universal Sketch for Estimating Heavy Hitters and Per-Element Frequency Moments in Data Streams with Bounded Deletions 2024 SIGMOD 4.1945683e-05
11,364 MinMax Sampling: A Near-optimal Global Summary for Aggregation in the Wide Area 2022 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 7 of 7 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers