Tracking Join and Self-Join Sizes in Limited Storage
Summary: Approximate self-join sizes under insertions/deletions in tiny space; tug-of-war sketches outperform sample-and-count and plain sampling for skew detection and self-join estimation. Proves lower bounds showing sampling-based compact join signatures are essentially optimal absent assumptions, but proposes tug-of-war-based join signatures whose error scales with relations' self-join sizes and can significantly beat sampling. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Noga Alon
- 2. Phillip B. Gibbons
- 3. Yossi Matias
- 4. Mario Szegedy
Incoming Citations (Sorted by Pagerank)
Showing 41 of 41 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 13 of 13 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 99 | On the Propagation of Errors in the Size of Join Results | 1991 | SIGMOD | 0.00050022914 |
| 18 | On Random Sampling over Joins | 1999 | SIGMOD | 0.00092385438 |
| 8,959 | Reservoir Sampling over Joins | 2024 | SIGMOD | 4.4206222e-05 |
| 811 | On the Relative Cost of Sampling for Join Selectivity Estimation | 1994 | PODS | 0.00016425612 |
| 11,446 | Index-Based Join Size Estimation Using Adaptive Sampling | 2021 | SIGMOD | 4.1945683e-05 |
| 5,104 | Guaranteeing the O~(AGM/OUT) Runtime for Uniform Sampling and Size Estimation over Joins | 2023 | PODS | 5.6946113e-05 |
| 92 | Practical Selectivity Estimation through Adaptive Sampling | 1990 | SIGMOD | 0.00051315959 |
| 1,193 | Join Size Estimation Subject to Filter Conditions | 2015 | VLDB | 0.00013414989 |
| 762 | Query Size Estimation by Adaptive Sampling (Extended Abstract) | 1990 | PODS | 0.00017036868 |
| 2,254 | Two-Level Sampling for Join Size Estimation | 2017 | SIGMOD | 9.1897043e-05 |