A Dip in the Reservoir: Maintaining Sample Synopses of Evolving Datasets
Summary: Proposes Random Pairing (RP): bounded-size sample maintenance for evolving data; extends reservoir sampling to deletions. Stable data: RP yields fast samples; growing data: minimal-time resize touches base; experiments confirm speed and stability. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Rainer Gemulla
- 2. Wolfgang Lehner
- 3. Peter J. Haas
Incoming Citations (Sorted by Pagerank)
Showing 6 of 6 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,266 | Estimating the Confidence of Conditional Functional Dependencies | 2009 | SIGMOD | 9.1540815e-05 |
| 3,013 | Cardinality Estimation Using Sample Views with Quality Assurance | 2007 | SIGMOD | 7.7137441e-05 |
| 6,190 | Maintaining Bernoulli Samples over Evolving Multisets | 2007 | PODS | 5.1645517e-05 |
| 7,415 | Efficient and Scalable Statistics Gathering for Large Databases in Oracle 11g | 2008 | SIGMOD | 4.7355557e-05 |
| 8,240 | Experiences with Approximating Queries in Microsoft’s Production Big-Data Clusters | 2019 | VLDB | 4.5522563e-05 |
| 12,203 | Resiliency-Aware Data Management | 2011 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 11 of 11 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,470 | Sampling Big Ideas in Query Optimization | 2023 | PODS | 4.5038423e-05 |
| 92 | Practical Selectivity Estimation through Adaptive Sampling | 1990 | SIGMOD | 0.00051315959 |
| 4,694 | Scalable Reservoir Sampling on Many-Core CPUs | 2019 | SIGMOD | 5.9944898e-05 |
| 18 | On Random Sampling over Joins | 1999 | SIGMOD | 0.00092385438 |
| 6,190 | Maintaining Bernoulli Samples over Evolving Multisets | 2007 | PODS | 5.1645517e-05 |
| 2,368 | Online Maintenance of Very Large Random Samples | 2004 | SIGMOD | 8.9501526e-05 |
| 269 | Fast Incremental Maintenance of Approximate Histograms | 1997 | VLDB | 0.00029656549 |
| 46 | Simple Random Sampling from Relational Databases | 1986 | VLDB | 0.00070894702 |
| 8,959 | Reservoir Sampling over Joins | 2024 | SIGMOD | 4.4206222e-05 |
| 4,350 | On Biased Reservoir Sampling in the Presence of Stream Evolution | 2006 | VLDB | 6.2645054e-05 |