Database Paper Browser

Back to papers

Cost-Based Variable-Length-Gram Selection for String Collections to Support Approximate Queries Efficiently

Summary: Cost-based selection of variable-length grams for approximate string queries with VGRAM-style indexing. Dynamic programming yields tight lower bounds on shared grams and enables automatic gram discovery for workloads, linking gram choice to index structure and performance. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4000
Venue
SIGMOD
Year
2008
Pagerank
9.3178557e-05
Overall Rank
2,193 | 84.75%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 17 of 17 citing papers.

Rank Citing Paper Year Venue Pagerank
1,944 WHAM: A High-throughput Sequence Alignment Method 2011 SIGMOD 0.00010004608
3,578 Efficient Approximate Entity Extraction with Edit Distance Constraints 2009 SIGMOD 6.9503858e-05
3,774 Efficient Exact Edit Similarity Query Processing with the Asymmetric Signature Scheme 2011 SIGMOD 6.7757301e-05
4,359 Astrid: Accurate Selectivity Estimation for String Predicates using Deep Learning 2021 VLDB 6.2569955e-05
4,988 Incremental Maintenance of Length Normalized Indexes for Approximate String Matching 2009 SIGMOD 5.783959e-05
5,291 Fast Subtrajectory Similarity Search in Road Networks under Weighted Edit Distance Constraints 2020 VLDB 5.5826473e-05
5,812 Reference-Based Alignment in Large Sequence Databases 2009 VLDB 5.3172025e-05
5,887 Efficient Approximate Search on String Collections (Tutorial) 2009 VLDB 5.2879769e-05
6,074 Pigeonring: A Principle for Faster Thresholded Similarity Search 2019 VLDB 5.2242306e-05
6,983 A Generic Framework for Efficient and Effective Subsequence Retrieval 2012 VLDB 4.8732757e-05
9,439 On-the-Fly Token Similarity Joins in Relational Databases 2014 SIGMOD 4.3423824e-05
9,932 Local Filtering: Improving the Performance of Approximate Queries on String Collections 2015 SIGMOD 4.2500258e-05
9,933 Efficient and Effective KNN Sequence Search with Approximate n-grams 2014 VLDB 4.2500258e-05
10,216 The Case For Language Model Approximated LIKE Predicate 2026 SIGMOD 4.1945683e-05
11,724 ZigZag: Supporting Similarity Queries on Vector Space Models 2018 SIGMOD 4.1945683e-05
12,049 LinkIT: Privacy Preserving Record Linkage and Integration via Transformations 2013 SIGMOD 4.1945683e-05
12,247 SimDB: A Similarity-aware Database System 2010 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 10 of 10 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers