String Similarity Measures and Joins with Synonyms
Summary: Expansion-based string similarity with synonyms; NP-hardness tackled by selective-expansion, practical optimality. SI-tree index for joins; signature- and length-filtering with an online estimator enabling low-space, low-error candidate sizing, validated. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Jiaheng Lu
- 2. Chunbin Lin
- 3. Wei Wang
- 4. Chen Li
- 5. Haiyong Wang
Incoming Citations (Sorted by Pagerank)
Showing 6 of 6 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,320 | Schemaless and Structureless Graph Querying | 2014 | VLDB | 7.2249102e-05 |
| 4,684 | Approximate String Joins with Abbreviations | 2018 | VLDB | 6.0006406e-05 |
| 4,808 | On the Complexity of Inner Product Similarity Join | 2016 | PODS | 5.908896e-05 |
| 9,563 | Towards a Unified Framework for String Similarity Joins | 2019 | VLDB | 4.3254416e-05 |
| 10,983 | A Universal Sketch for Estimating Heavy Hitters and Per-Element Frequency Moments in Data Streams with Bounded Deletions | 2024 | SIGMOD | 4.1945683e-05 |
| 11,087 | Dealing with Acronyms, Abbreviations, and Typos in Real-World Entity Matching | 2024 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 12 of 12 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,708 | Efficient Top-k Algorithms for Approximate Substring Matching | 2013 | SIGMOD | 4.6721808e-05 |
| 3,199 | Similarity Evaluation on Tree-structured Data | 2005 | SIGMOD | 7.3927291e-05 |
| 7,669 | Incorporating String Transformations in Record Matching | 2008 | SIGMOD | 4.6833751e-05 |
| 3,451 | Learning String Transformations From Examples | 2009 | VLDB | 7.0822216e-05 |
| 4,684 | Approximate String Joins with Abbreviations | 2018 | VLDB | 6.0006406e-05 |
| 6,241 | Scaling Similarity Joins over Tree-Structured Data | 2015 | VLDB | 5.1411469e-05 |
| 2,740 | String Similarity Joins: An Experimental Evaluation | 2014 | VLDB | 8.1980628e-05 |
| 11,979 | Similarity Joins for Uncertain Strings | 2014 | SIGMOD | 4.1945683e-05 |
| 4,216 | Trie-Join: Efficient Trie-based String Similarity Joins with Edit-Distance Constraints | 2010 | VLDB | 6.3521675e-05 |
| 9,563 | Towards a Unified Framework for String Similarity Joins | 2019 | VLDB | 4.3254416e-05 |