Incremental Maintenance of Length Normalized Indexes for Approximate String Matching
Summary: Lazy-update framework for length-normalized indexes in approximate string matching enables incremental updates rather than periodic rebuilds. Guarantees: no false negatives, bounded false positives; prototype demonstrates practicality on real data. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 9 of 9 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,376 | Bed-Tree: An All-Purpose Index Structure for String Similarity Search Based on Edit Distance | 2010 | SIGMOD | 8.9424361e-05 |
| 2,592 | Pass-Join: A Partition-based Method for Similarity Joins | 2012 | VLDB | 8.4795761e-05 |
| 4,216 | Trie-Join: Efficient Trie-based String Similarity Joins with Edit-Distance Constraints | 2010 | VLDB | 6.3521675e-05 |
| 5,073 | Faerie: Efficient Filtering Algorithms for Approximate Dictionary-based Entity Extraction | 2011 | SIGMOD | 5.7177424e-05 |
| 5,887 | Efficient Approximate Search on String Collections (Tutorial) | 2009 | VLDB | 5.2879769e-05 |
| 6,726 | A Pivotal Prefix Based Filtering Algorithm for String Similarity Search | 2014 | SIGMOD | 4.9484027e-05 |
| 7,109 | Efficient Similarity Join and Search on Multi-Attribute Data | 2015 | SIGMOD | 4.8292998e-05 |
| 9,832 | Balance-Aware Distributed String Similarity-Based Query Processing System | 2019 | VLDB | 4.2751057e-05 |
| 11,724 | ZigZag: Supporting Similarity Queries on Vector Space Models | 2018 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 8 of 8 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7 | Optimal Aggregation Algorithms for Middleware [Extended Abstract] | 2001 | PODS | 0.0015496097 |
| 125 | Approximate String Joins in a Database (Almost) for Free | 2001 | VLDB | 0.00044847972 |
| 250 | Efficient set joins on similarity predicates | 2004 | SIGMOD | 0.00030661988 |
| 266 | Efficient Exact Set-Similarity Joins | 2006 | VLDB | 0.00029718727 |
| 322 | Record Linkage: Similarity Measures and Algorithms | 2006 | SIGMOD | 0.00027518768 |
| 2,193 | Cost-Based Variable-Length-Gram Selection for String Collections to Support Approximate Queries Efficiently | 2008 | SIGMOD | 9.3178557e-05 |
| 3,267 | Benchmarking Declarative Approximate Selection Predicates | 2007 | SIGMOD | 7.3058429e-05 |
| 4,026 | Flexible String Matching Against Large Databases in Practice | 2004 | VLDB | 6.5169976e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,333 | An Efficient Index Structure for String Databases | 2001 | VLDB | 6.2805237e-05 |
| 7,777 | Indexing Mixed Types for Approximate Retrieval | 2005 | VLDB | 4.653704e-05 |
| 1,184 | On Effective Multi-Dimensional Indexing for Strings | 2000 | SIGMOD | 0.00013455208 |
| 9,724 | Text Indexing for Long Patterns: Anchors are All you Need | 2023 | VLDB | 4.295436e-05 |
| 5,663 | Incremental Maintenance of XML Structural Indexes | 2004 | SIGMOD | 5.3832923e-05 |
| 1,154 | Fast Incremental Indexing for Full-Text Information Retrieval | 1994 | VLDB | 0.00013642184 |
| 7,708 | Efficient Top-k Algorithms for Approximate Substring Matching | 2013 | SIGMOD | 4.6721808e-05 |
| 9,322 | Indexing for Keyword Search with Structured Constraints | 2023 | PODS | 4.3556432e-05 |
| 6,732 | An Incrementally Maintainable Index for Approximate Lookups in Hierarchical Data | 2006 | VLDB | 4.9477058e-05 |
| 1,517 | Incremental Updates of Inverted Lists for Text Document Retrieval | 1994 | SIGMOD | 0.00011578859 |