Database Paper Browser

Back to papers

Flexible String Matching Against Large Databases in Practice

Summary: Extends tf-idf-based fuzzy string matching to multi-attribute queries and known semantic equivalences in large databases. Reports practical performance optimizations, including accuracy-speed trade-offs, demonstrated on real AT&T datasets. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
9126
Venue
VLDB
Year
2004
Pagerank
6.5169976e-05
Overall Rank
4,026 | 72.00%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 9 of 9 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 1 of 1 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
155 Robust and Efficient Fuzzy Match for Online Data Cleaning 2003 SIGMOD 0.00040637896
Previous Page 1 / 1 Next

Semantically Similar Papers

Overall Rank Paper Year Venue Pagerank
13,612 Using SPIDER: An Experience Report 2006 SIGMOD -
9,563 Towards a Unified Framework for String Similarity Joins 2019 VLDB 4.3254416e-05
3,529 Merging the Results of Approximate Match Operations 2004 VLDB 7.0059524e-05
125 Approximate String Joins in a Database (Almost) for Free 2001 VLDB 0.00044847972
4,435 Sampling Dirty Data for Matching Attributes 2010 SIGMOD 6.1918164e-05
11,979 Similarity Joins for Uncertain Strings 2014 SIGMOD 4.1945683e-05
4,901 Probabilistic String Similarity Joins 2010 SIGMOD 5.8411648e-05
7,669 Incorporating String Transformations in Record Matching 2008 SIGMOD 4.6833751e-05
2,740 String Similarity Joins: An Experimental Evaluation 2014 VLDB 8.1980628e-05
155 Robust and Efficient Fuzzy Match for Online Data Cleaning 2003 SIGMOD 0.00040637896