Database Paper Browser

Back to papers

Extensible and Robust Evaluation of Similarity Queries

Summary: Fast, a similarity-join system, introduces "reductions" that map complex domains/distance functions into simpler ones and builds a reduction graph to enumerate extensible query plans. Using runtime partitioning and sampling-based plan selection (no cost models), plus hybrid index-on-the-fly and caching, Fast robustly attains near-optimal performance across datasets and distance functions. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
14007
Venue
VLDB
Year
2025
Pagerank
4.1945683e-05
Overall Rank
10,706 | 25.53%
DOI
10.14778/3749466.3749660

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank Citing Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 30 of 30 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
115 Eddies: Continuously Adaptive Query Processing 2000 SIGMOD 0.00046221215
125 Approximate String Joins in a Database (Almost) for Free 2001 VLDB 0.00044847972
266 Efficient Exact Set-Similarity Joins 2006 VLDB 0.00029718727
853 Everything You Always Wanted to Know About Compiled and Vectorized Queries But Were Afraid to Ask 2018 VLDB 0.00015940507
1,234 Ed-Join: An Efficient Algorithm for Similarity Joins With Edit Distance Constraints 2008 VLDB 0.00013122499
1,396 Can We Beat the Prefix Filtering? An Adaptive Framework for Similarity Join and Search 2012 SIGMOD 0.00012204748
2,219 SkinnerDB: Regret-Bounded Query Evaluation via Reinforcement Learning 2019 SIGMOD 9.2623533e-05
2,281 Epsilon Grid Order: An Algorithm for the Similarity Join on Massive High-Dimensional Data 2001 SIGMOD 9.1077704e-05
2,460 Combining Quantitative and Logical Data Cleaning 2016 VLDB 8.7617484e-05
2,592 Pass-Join: A Partition-based Method for Similarity Joins 2012 VLDB 8.4795761e-05
2,740 String Similarity Joins: An Experimental Evaluation 2014 VLDB 8.1980628e-05
2,784 Approximate XML Joins 2002 SIGMOD 8.128931e-05
3,199 Similarity Evaluation on Tree-structured Data 2005 SIGMOD 7.3927291e-05
3,459 An Empirical Evaluation of Set Similarity Join Techniques 2016 VLDB 7.072508e-05
3,490 Leveraging Set Relations in Exact Set Similarity Join 2017 VLDB 7.0465856e-05
3,514 Spatio-Textual Similarity Joins 2013 VLDB 7.0226998e-05
3,774 Efficient Exact Edit Similarity Query Processing with the Asymmetric Signature Scheme 2011 SIGMOD 6.7757301e-05
4,050 An Efficient Partition Based Method for Exact Set Similarity Joins 2016 VLDB 6.4953612e-05
4,353 Overlap Set Similarity Joins with Theoretical Guarantees 2018 SIGMOD 6.263585e-05
4,406 Approximate Matching of Hierarchical Data Using pq-Grams 2005 VLDB 6.2141638e-05
5,469 Learned Cardinality Estimation for Similarity Queries 2021 SIGMOD 5.4898192e-05
5,622 Monotonic Cardinality Estimation of Similarity Selection: A Deep Learning Approach 2020 SIGMOD 5.4060403e-05
6,074 Pigeonring: A Principle for Faster Thresholded Similarity Search 2019 VLDB 5.2242306e-05
6,241 Scaling Similarity Joins over Tree-Structured Data 2015 VLDB 5.1411469e-05
6,605 Dima: A Distributed In-Memory Similarity-Based Query Processing System 2017 VLDB 4.9965703e-05
7,215 SyncSignature: A Simple, Efficient, Parallelizable Framework for Tree Similarity Joins 2023 VLDB 4.7985991e-05
8,511 JEDI: These aren't the JSON documents you're looking for... 2022 SIGMOD 4.495029e-05
9,832 Balance-Aware Distributed String Similarity-Based Query Processing System 2019 VLDB 4.2751057e-05
9,833 SIREN: A Similarity Retrieval Engine for Complex Data 2006 VLDB 4.2751057e-05
11,247 A Two-Level Signature Scheme for Stable Set Similarity Joins 2023 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Semantically Similar Papers