On Indexing Error-Tolerant Set Containment
Summary: Indexing asymmetric Jaccard containment with error tolerance and synonym-aware string transformations. Proposes an inverted-list index on token-sets and a size-aware lookup for containment queries; first study of Jaccard containment indexing under string transformations. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Parag Agrawal
- 2. Arvind Arasu
- 3. Raghav Kaushik
Incoming Citations (Sorted by Pagerank)
Showing 4 of 4 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,141 | LSH Ensemble: Internet-Scale Domain Search | 2016 | VLDB | 9.4542625e-05 |
| 2,730 | Open Data Integration | 2018 | VLDB | 8.2126735e-05 |
| 4,250 | Local Similarity Search for Unstructured Text | 2016 | SIGMOD | 6.3241139e-05 |
| 5,179 | SilkMoth: An Efficient Method for Finding Related Sets with Maximum Matching Constraints | 2017 | VLDB | 5.6428428e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 13 of 13 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,774 | Efficient Exact Edit Similarity Query Processing with the Asymmetric Signature Scheme | 2011 | SIGMOD | 6.7757301e-05 |
| 1,345 | Entity Matching: How Similar Is Similar | 2011 | VLDB | 0.00012468408 |
| 11,979 | Similarity Joins for Uncertain Strings | 2014 | SIGMOD | 4.1945683e-05 |
| 3,490 | Leveraging Set Relations in Exact Set Similarity Join | 2017 | VLDB | 7.0465856e-05 |
| 7,708 | Efficient Top-k Algorithms for Approximate Substring Matching | 2013 | SIGMOD | 4.6721808e-05 |
| 4,988 | Incremental Maintenance of Length Normalized Indexes for Approximate String Matching | 2009 | SIGMOD | 5.783959e-05 |
| 5,151 | String Similarity Measures and Joins with Synonyms | 2013 | SIGMOD | 5.6609851e-05 |
| 250 | Efficient set joins on similarity predicates | 2004 | SIGMOD | 0.00030661988 |
| 7,669 | Incorporating String Transformations in Record Matching | 2008 | SIGMOD | 4.6833751e-05 |
| 7,522 | Efficient and Tunable Similar Set Retrieval | 2001 | SIGMOD | 4.7180617e-05 |