Multi-Dimensional Substring Selectivity Estimation
Summary: Multi-dimensional count-suffix trees for substring selectivity in cross-attribute queries. A probabilistic, space-efficient construction builds pruned trees directly; estimators GNO and MO, with MO leveraging all maximal multi-dimensional substrings to capture inter-dimension correlations and outperforming GNO empirically. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. H. V. Jagadish
- 2. Raymond T. Ng
- 3. Olga Kapitskaia
- 4. Divesh Srivastava
Incoming Citations (Sorted by Pagerank)
Showing 8 of 8 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 325 | The History of Histograms (abridged) | 2003 | VLDB | 0.00027378328 |
| 1,046 | Estimating the Selectivity of XML Path Expressions for Internet Scale Applications | 2001 | VLDB | 0.00014462307 |
| 1,184 | On Effective Multi-Dimensional Indexing for Strings | 2000 | SIGMOD | 0.00013455208 |
| 1,737 | QuickSel: Quick Selectivity Learning with Mixture Models | 2020 | SIGMOD | 0.00010720294 |
| 2,171 | Selectivity Estimation For Boolean Queries | 2000 | PODS | 9.3807165e-05 |
| 4,438 | Selectivity Estimation for Fuzzy String Predicates in Large Data Sets | 2005 | VLDB | 6.1898903e-05 |
| 7,742 | CXHist : An On-line Classification-Based Histogram for XML String Selectivity Estimation | 2005 | VLDB | 4.6628263e-05 |
| 12,648 | Searching on the Secondary Structure of Protein Sequences | 2002 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 0 of 0 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,945 | SSCard: Substring Cardinality Estimation using Suffix Tree-Guided Learned FM-Index | 2026 | SIGMOD | 4.2432653e-05 |
| 5,813 | Space-efficient Substring Occurrence Estimation | 2011 | PODS | 5.3170565e-05 |
| 1,046 | Estimating the Selectivity of XML Path Expressions for Internet Scale Applications | 2001 | VLDB | 0.00014462307 |
| 1,241 | Multi-dimensional Selectivity Estimation Using Compressed Histogram Information | 1999 | SIGMOD | 0.00013097578 |
| 6,097 | Two-dimensional Substring Indexing | 2001 | PODS | 5.2119402e-05 |
| 1,146 | Estimating Alphanumeric Selectivity in the Presence of Wildcards | 1996 | SIGMOD | 0.00013679782 |
| 1,184 | On Effective Multi-Dimensional Indexing for Strings | 2000 | SIGMOD | 0.00013455208 |
| 3,226 | Extending Q-Grams to Estimate Selectivity of String Matching with Low Edit Distance | 2007 | VLDB | 7.3433307e-05 |
| 2,171 | Selectivity Estimation For Boolean Queries | 2000 | PODS | 9.3807165e-05 |
| 1,379 | Substring Selectivity Estimation | 1999 | PODS | 0.00012286879 |