Merging the Results of Approximate Match Operations
Summary: Adapts footrule distance to merge multiple ranked approximate-match lists across query attributes into a single top-K result. Presents two novel, declarative algorithms for single-attribute approximate matching with SQL specs and validates practicality via real data and a commercial DB. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Sudipto Guha
- 2. Nick Koudas
- 3. Amit Marathe
- 4. Divesh Srivastava
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 702 | Reasoning about Record Matching Rules | 2009 | VLDB | 0.00017918203 |
| 1,262 | RankSQL: Query Algebra and Optimization for Relational Top-k Queries | 2005 | SIGMOD | 0.00012986539 |
| 1,533 | Example-driven Design of Efficient Record Matching Queries | 2007 | VLDB | 0.00011471971 |
| 7,190 | Database Support for Matching: Limitations and Opportunities | 2006 | SIGMOD | 4.8051876e-05 |
| 7,256 | Effective and Efficient Retrieval of Structured Entities | 2020 | VLDB | 4.7869419e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7 | Optimal Aggregation Algorithms for Middleware [Extended Abstract] | 2001 | PODS | 0.0015496097 |
| 67 | The Merge/Purge Problem for Large Databases | 1995 | SIGMOD | 0.00061348205 |
| 125 | Approximate String Joins in a Database (Almost) for Free | 2001 | VLDB | 0.00044847972 |
| 150 | Integration of Heterogeneous Databases Without Common Domains Using Queries Based on Textual Similarity | 1998 | SIGMOD | 0.00041055843 |
| 155 | Robust and Efficient Fuzzy Match for Online Data Cleaning | 2003 | SIGMOD | 0.00040637896 |
| 637 | Automatic segmentation of text into structured records | 2001 | SIGMOD | 0.00018824614 |
| 709 | Efficient Similarity Search and Classification via Rank Aggregation | 2003 | SIGMOD | 0.00017768547 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,026 | Flexible String Matching Against Large Databases in Practice | 2004 | VLDB | 6.5169976e-05 |
| 10,924 | Improved Approximation Algorithms for Relational Clustering | 2024 | PODS | 4.1945683e-05 |
| 805 | Evaluating Top-k Selection Queries | 1999 | VLDB | 0.00016437265 |
| 12,111 | Optimal Top-k Generation of Attribute Combinations based on Ranked Lists | 2012 | SIGMOD | 4.1945683e-05 |
| 7,777 | Indexing Mixed Types for Approximate Retrieval | 2005 | VLDB | 4.653704e-05 |
| 9,430 | Approximate Joins: Concepts and Techniques | 2005 | VLDB | 4.3441378e-05 |
| 322 | Record Linkage: Similarity Measures and Algorithms | 2006 | SIGMOD | 0.00027518768 |
| 125 | Approximate String Joins in a Database (Almost) for Free | 2001 | VLDB | 0.00044847972 |
| 155 | Robust and Efficient Fuzzy Match for Online Data Cleaning | 2003 | SIGMOD | 0.00040637896 |
| 1,533 | Example-driven Design of Efficient Record Matching Queries | 2007 | VLDB | 0.00011471971 |