Space-efficient Substring Occurrence Estimation
Summary: Space-optimal substring-occurrence estimators: for additive error l store Θ(|T| log σ / l) bits to answer Count≈_l(P), enabling compact selectivity estimation via compressed text indexing. Also a frequency-aware Count≥_l structure exact for counts ≥l with space scaling by number of frequent patterns. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,496 | Dynamic Data Structures for Document Collections and Graphs | 2015 | PODS | 4.4981899e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 2 of 2 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,146 | Estimating Alphanumeric Selectivity in the Presence of Wildcards | 1996 | SIGMOD | 0.00013679782 |
| 1,379 | Substring Selectivity Estimation | 1999 | PODS | 0.00012286879 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,379 | Substring Selectivity Estimation | 1999 | PODS | 0.00012286879 |
| 6,097 | Two-dimensional Substring Indexing | 2001 | PODS | 5.2119402e-05 |
| 1,146 | Estimating Alphanumeric Selectivity in the Presence of Wildcards | 1996 | SIGMOD | 0.00013679782 |
| 2,171 | Selectivity Estimation For Boolean Queries | 2000 | PODS | 9.3807165e-05 |
| 7,708 | Efficient Top-k Algorithms for Approximate Substring Matching | 2013 | SIGMOD | 4.6721808e-05 |
| 10,346 | Differentially Private Substring and Document Counting | 2025 | PODS | 4.1945683e-05 |
| 12,295 | Secondary Indexing in One Dimension: Beyond B-trees and Bitmap Indexes | 2009 | PODS | 4.1945683e-05 |
| 9,724 | Text Indexing for Long Patterns: Anchors are All you Need | 2023 | VLDB | 4.295436e-05 |
| 9,945 | SSCard: Substring Cardinality Estimation using Suffix Tree-Guided Learned FM-Index | 2026 | SIGMOD | 4.2432653e-05 |
| 4,333 | An Efficient Index Structure for String Databases | 2001 | VLDB | 6.2805237e-05 |