An Optimal Algorithm for the Distinct Elements Problem
Summary: First algorithm achieving information-theoretically optimal space O(ε^-2 + log n) bits for (1±ε)-approximate distinct elements in data streams, closing decades of work. Also attains worst-case O(1) update and reporting time and extends to Hamming-norm estimation. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Daniel M. Kane
- 2. Jelani Nelson
- 3. David P. Woodruff
Incoming Citations (Sorted by Pagerank)
Showing 27 of 27 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 8 of 8 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1 | Access Path Selection in a Relational Database Management System | 1979 | SIGMOD | 0.0040449103 |
| 308 | Distinct Sampling for Highly-Accurate Answers to Distinct Values Queries and Event Reports | 2001 | VLDB | 0.00028142852 |
| 429 | The Aqua Approximate Query Answering System | 1999 | SIGMOD | 0.00023476494 |
| 475 | Mining Database Structure; Or, How to Build a Data Quality Browser | 2002 | SIGMOD | 0.00022303253 |
| 593 | Storage Estimation for Multidimensional Aggregates in the Presence of Hierarchies | 1996 | VLDB | 0.00019536993 |
| 727 | On Synopses for Distinct-Value Estimation Under Multiset Operations | 2007 | SIGMOD | 0.00017508726 |
| 2,045 | Multi-Dimensional Clustering: A New Data Layout Scheme in DB2 | 2003 | SIGMOD | 9.6939983e-05 |
| 3,050 | Comparing Data Streams Using Hamming Norms (How to Zero In) | 2002 | VLDB | 7.6512619e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,145 | Tight Trade-offs for the Maximum k-Coverage Problem in the General Streaming Model | 2019 | PODS | 4.8179617e-05 |
| 3,102 | Processing Set Expressions over Continuous Update Streams | 2003 | SIGMOD | 7.5586568e-05 |
| 12,475 | A Simple and Efficient Estimation Method for Stream Expression Cardinalities | 2007 | VLDB | 4.1945683e-05 |
| 6,418 | An Optimal Algorithm for l1-Heavy Hitters in Insertion Streams and Related Problems | 2016 | PODS | 5.0696932e-05 |
| 10,901 | Streaming Algorithms with Few State Changes | 2024 | PODS | 4.1945683e-05 |
| 835 | Finding Frequent Items in Data Streams | 2008 | VLDB | 0.00016109621 |
| 3,050 | Comparing Data Streams Using Hamming Norms (How to Zero In) | 2002 | VLDB | 7.6512619e-05 |
| 12,531 | Join-Distinct Aggregate Estimation over Update Streams | 2005 | PODS | 4.1945683e-05 |
| 11,440 | Frequent Elements with Witnesses in Data Streams | 2021 | PODS | 4.1945683e-05 |
| 11,833 | Streaming Algorithms for Robust Distinct Elements | 2016 | SIGMOD | 4.1945683e-05 |