Streaming Quotient Filter: A Near Optimal Approximate Duplicate Detection Approach for Data Streams
Summary: Proposes Streaming Quotient Filter (SQF), a signature-based data structure with eviction for real-time, memory-efficient duplicate detection on unbounded streams. Offers near-zero FP/FN; Dynamic SQF for evolving streams, with parallel implementation. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Sourav Dutta
- 2. Ankur Narang
- 3. Suman K. Bera
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,446 | Stable Learned Bloom Filters for Data Streams | 2020 | VLDB | 6.1800659e-05 |
| 11,222 | A Learned Cuckoo Filter for Approximate Membership Queries over Variable-sized Sliding Windows on Data Streams | 2023 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 619 | On Computing Correlated Aggregates Over Continual Data Streams | 2001 | SIGMOD | 0.00019066583 |
| 781 | Spectral Bloom Filters | 2003 | SIGMOD | 0.00016741046 |
| 1,248 | Don't Thrash: How to Cache Your Hash on Flash | 2012 | VLDB | 0.00013046661 |
| 2,589 | DogmatiX Tracks down Duplicates in XML | 2005 | SIGMOD | 8.4847146e-05 |
| 3,838 | Approximately Detecting Duplicates for Streaming Data using Stable Bloom Filters | 2006 | SIGMOD | 6.7134945e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,879 | Approximately Counting Triangles in Large Graph Streams Including Edge Duplicates with a Fixed Memory Usage | 2018 | VLDB | 5.8575676e-05 |
| 10,198 | Quantile Estimation with Duplicates | 2026 | SIGMOD | 4.1945683e-05 |
| 2,843 | A General-Purpose Counting Filter: Making Every Bit Count | 2017 | SIGMOD | 8.0257314e-05 |
| 11,833 | Streaming Algorithms for Robust Distinct Elements | 2016 | SIGMOD | 4.1945683e-05 |
| 1,717 | Approximate Join Processing Over Data Streams | 2003 | SIGMOD | 0.00010793312 |
| 4,994 | Stacked Filters: Learning to Filter by Structure | 2021 | VLDB | 5.78027e-05 |
| 11,222 | A Learned Cuckoo Filter for Approximate Membership Queries over Variable-sized Sliding Windows on Data Streams | 2023 | SIGMOD | 4.1945683e-05 |
| 4,446 | Stable Learned Bloom Filters for Data Streams | 2020 | VLDB | 6.1800659e-05 |
| 8,957 | Adaptive Quotient Filters | 2024 | SIGMOD | 4.4211093e-05 |
| 3,838 | Approximately Detecting Duplicates for Streaming Data using Stable Bloom Filters | 2006 | SIGMOD | 6.7134945e-05 |