Database Paper Browser

Back to papers

Substring Selectivity Estimation

Summary: Introduces MO (Maximal Overlap), a substring selectivity estimator using pruned count-suffix trees that leverages all maximal query substrings to produce better estimates. Proves MO dominates KVI under short‑memory strings and gives MOC/MOLC algs trading accuracy vs cost. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
1179
Venue
PODS
Year
1999
Pagerank
0.00012286879
Overall Rank
1,379 | 90.41%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 17 of 17 citing papers.

Rank Citing Paper Year Venue Pagerank
325 The History of Histograms (abridged) 2003 VLDB 0.00027378328
1,046 Estimating the Selectivity of XML Path Expressions for Internet Scale Applications 2001 VLDB 0.00014462307
1,202 VGRAM: Improving Performance of Approximate Queries on String Collections Using Variable-Length Grams 2007 VLDB 0.00013326298
1,737 QuickSel: Quick Selectivity Learning with Mixture Models 2020 SIGMOD 0.00010720294
2,171 Selectivity Estimation For Boolean Queries 2000 PODS 9.3807165e-05
2,193 Cost-Based Variable-Length-Gram Selection for String Collections to Support Approximate Queries Efficiently 2008 SIGMOD 9.3178557e-05
2,232 Effective Phrase Prediction 2007 VLDB 9.2293508e-05
3,226 Extending Q-Grams to Estimate Selectivity of String Matching with Low Edit Distance 2007 VLDB 7.3433307e-05
4,359 Astrid: Accurate Selectivity Estimation for String Predicates using Deep Learning 2021 VLDB 6.2569955e-05
4,660 XPathLearner: An On-Line Self-Tuning Markov Histogram for XML Path Selectivity Estimation 2002 VLDB 6.014625e-05
5,813 Space-efficient Substring Occurrence Estimation 2011 PODS 5.3170565e-05
7,186 LPLM: A Neural Language Model for Cardinality Estimation of LIKE-Queries 2024 SIGMOD 4.8063731e-05
7,474 Cardinality Estimation of Approximate Substring Queries using Deep Learning 2022 VLDB 4.7194345e-05
7,742 CXHist : An On-line Classification-Based Histogram for XML String Selectivity Estimation 2005 VLDB 4.6628263e-05
8,496 Dynamic Data Structures for Document Collections and Graphs 2015 PODS 4.4981899e-05
9,726 Cardinality Estimation of LIKE Predicate Queries using Deep Learning 2025 SIGMOD 4.2943379e-05
9,945 SSCard: Substring Cardinality Estimation using Suffix Tree-Guided Learned FM-Index 2026 SIGMOD 4.2432653e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 9 of 9 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers