On Joining and Caching Stochastic Streams
Summary: A statistics-aware caching framework for joining stochastic streams with bounded memory aims to maximize the expected number of output tuples. It shows that full lookahead can be suboptimal, derives a dominance condition between candidate tuples, and provides a heuristic that matches optimal behavior under that condition, with empirical gains and a reduction of static paging to stream joins. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Junyi Xie
- 2. Jun Yang
- 3. Yuguo Chen
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,133 | Memory-Limited Execution of Windowed Stream Joins | 2004 | VLDB | 6.4196026e-05 |
| 5,150 | Efficient Join Synopsis Maintenance for Data Warehouse | 2020 | SIGMOD | 5.6626586e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 18 | On Random Sampling over Joins | 1999 | SIGMOD | 0.00092385438 |
| 194 | Query Processing, Resource Management, and Approximation in a Data Stream Management System | 2003 | CIDR | 0.00035426067 |
| 726 | Load Shedding in a Data Stream Manager | 2003 | VLDB | 0.00017511209 |
| 1,717 | Approximate Join Processing Over Data Streams | 2003 | SIGMOD | 0.00010793312 |
| 2,404 | Maintaining Variance and k–Medians over Data Stream Windows | 2003 | PODS | 8.8837279e-05 |
| 2,448 | Multi-Dimensional Regression Analysis of Time-Series Data Streams | 2002 | VLDB | 8.8032353e-05 |
| 4,133 | Memory-Limited Execution of Windowed Stream Joins | 2004 | VLDB | 6.4196026e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,275 | Adopting Worst-Case Optimal Joins in Relational Database Systems | 2020 | VLDB | 9.1262202e-05 |
| 3,041 | Sketching Probabilistic Data Streams | 2007 | SIGMOD | 7.6697078e-05 |
| 8,959 | Reservoir Sampling over Joins | 2024 | SIGMOD | 4.4206222e-05 |
| 10,967 | Low-Latency Adaptive Distributed Stream Join System Based on a Flexible Join Model | 2024 | SIGMOD | 4.1945683e-05 |
| 1,064 | Processing Complex Aggregate Queries over Data Streams | 2002 | SIGMOD | 0.00014356481 |
| 7,967 | Stochastic Consistency, and Scalable Pull-Based Caching for Erratic Data Stream Sources | 2004 | VLDB | 4.613363e-05 |
| 3,656 | Processing Sliding Window Multi-Joins in Continuous Queries over Data Streams | 2003 | VLDB | 6.8714509e-05 |
| 12,531 | Join-Distinct Aggregate Estimation over Update Streams | 2005 | PODS | 4.1945683e-05 |
| 1,717 | Approximate Join Processing Over Data Streams | 2003 | SIGMOD | 0.00010793312 |
| 4,133 | Memory-Limited Execution of Windowed Stream Joins | 2004 | VLDB | 6.4196026e-05 |