JoinSketch: A Sketch Algorithm for Accurate and Unbiased Inner-Product Estimation
Summary: JoinSketch is a multi-component sketch for accurate, unbiased inner-product estimation in data management tasks (join size, stream similarity, cosine similarity) under skewed data. It provably achieves lower variance than AGMS/Fast-AGMS with ~10× accuracy gains and comparable throughput; code is open source. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Feiyu Wang
- 2. Qizhi Chen
- 3. Yuanpeng Li
- 4. Tong Yang
- 5. Yaofeng Tu
- 6. Lian Yu
- 7. Bin Cui
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,697 | Convolution and Cross-Correlation of Count Sketches Enables Fast Cardinality Estimation of Multi-Join Queries | 2024 | SIGMOD | 4.4657888e-05 |
| 10,149 | CorrBound: Cardinality Estimation Accounting for Inter- and Intra-relation Correlations | 2026 | SIGMOD | 4.1945683e-05 |
| 10,981 | Enabling Adaptive Sampling for Intra-Window Join: Simultaneously Optimizing Quantity and Quality | 2024 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 22 of 22 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,141 | ClusterJoin: A Similarity Joins Framework using Map-Reduce | 2014 | VLDB | 7.4829448e-05 |
| 1,193 | Join Size Estimation Subject to Filter Conditions | 2015 | VLDB | 0.00013414989 |
| 10,930 | Similarity Joins of Sparse Features | 2024 | SIGMOD | 4.1945683e-05 |
| 1,584 | Augmented Sketch: Faster and More Accurate Stream Processing | 2016 | SIGMOD | 0.00011255801 |
| 5,200 | SetSketch: Filling the Gap between MinHash and HyperLogLog | 2021 | VLDB | 5.6337581e-05 |
| 12,531 | Join-Distinct Aggregate Estimation over Update Streams | 2005 | PODS | 4.1945683e-05 |
| 11,025 | Sampling Methods for Inner Product Sketching | 2024 | VLDB | 4.1945683e-05 |
| 9,628 | Approximate Sketches | 2024 | SIGMOD | 4.3143499e-05 |
| 8,697 | Convolution and Cross-Correlation of Count Sketches Enables Fast Cardinality Estimation of Multi-Join Queries | 2024 | SIGMOD | 4.4657888e-05 |
| 11,168 | Weighted Minwise Hashing Beats Linear Sketching for Inner Product Estimation | 2023 | PODS | 4.1945683e-05 |