Fast Approximate Similarity Join in Vector Databases
Summary: SimJoin exploits join-window reuse to speed up approximate similarity joins in vector databases, beating per-point range queries. Join-window order optimization, k-similarity support, and a proximity-graph index; experiments show large speedups. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Jiadong Xie
- 2. Jeffrey Xu Yu
- 3. Yingfan Liu
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,130 | MorphingDB: A Task-Centric AI-Native DBMS for Model Management and Inference | 2026 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 12 of 12 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,109 | Efficient Similarity Join and Search on Multi-Attribute Data | 2015 | SIGMOD | 4.8292998e-05 |
| 3,335 | DeepJoin: Joinable Table Discovery with Pre-trained Language Models | 2023 | VLDB | 7.2065006e-05 |
| 3,490 | Leveraging Set Relations in Exact Set Similarity Join | 2017 | VLDB | 7.0465856e-05 |
| 9,321 | Efficient and Accurate SimRank-based Similarity Joins: Experiments, Analysis, and Improvement | 2024 | VLDB | 4.3556432e-05 |
| 250 | Efficient set joins on similarity predicates | 2004 | SIGMOD | 0.00030661988 |
| 10,930 | Similarity Joins of Sparse Features | 2024 | SIGMOD | 4.1945683e-05 |
| 10,068 | DiskJoin: Large-scale Vector Similarity Join with SSD | 2026 | SIGMOD | 4.1945683e-05 |
| 10,706 | Extensible and Robust Evaluation of Similarity Queries | 2025 | VLDB | 4.1945683e-05 |
| 13,473 | Exploiting Database Similarity Joins for Metric Spaces | 2012 | VLDB | - |
| 4,976 | Efficient Top-K SimRank-based Similarity Join | 2015 | VLDB | 5.7882361e-05 |