Database Paper Browser

Back to papers

Multi-Probe LSH: Efficient Indexing for High-Dimensional Similarity Search

Summary: Introduces multi-probe LSH to cut hash-table count for high-dimensional similarity search. Probes multiple likely buckets per table, achieving similar query time to basic LSH but with 5-8x fewer tables; faster than entropy-based LSH in space and time. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
9644
Venue
VLDB
Year
2007
Pagerank
0.0002427237
Overall Rank
400 | 97.22%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 45 of 45 citing papers.

Rank Citing Paper Year Venue Pagerank
605 Locality-Sensitive Hashing Scheme Based on Dynamic Collision Counting 2012 SIGMOD 0.000193396
682 Quality and Efficiency in High Dimensional Nearest Neighbor Search 2009 SIGMOD 0.00018201541
754 Distributed Representations of Tuples for Entity Resolution 2018 VLDB 0.00017117211
867 SRS: Solving c-Approximate Nearest Neighbor Queries in High Dimensional Euclidean Space with a Tiny Index 2015 VLDB 0.00015792021
1,010 HD-Index: Pushing the Scalability-Accuracy Boundary for Approximate kNN Search in High-Dimensional Spaces 2018 VLDB 0.00014652858
1,229 SK-LSH : An Efficient Index Structure for Approximate Nearest Neighbor Search 2014 VLDB 0.00013157271
1,305 Bayesian Locality Sensitive Hashing for Fast Similarity Search 2012 VLDB 0.00012687101
1,757 VHP: Approximate Nearest Neighbor Search via Virtual Hypersphere Partitioning 2020 VLDB 0.00010660932
1,971 LazyLSH: Approximate Nearest Neighbor Search for Multiple Distance Functions with a Single Index 2016 SIGMOD 9.893198e-05
2,181 PM-LSH: A Fast and Accurate LSH Framework for High-Dimensional Approximate NN Search 2020 VLDB 9.3451821e-05
2,435 iDEC: Indexable Distance Estimating Codes for Approximate Nearest Neighbor Search 2020 VLDB 8.8252237e-05
2,641 Locality-Sensitive Hashing for Earthquake Detection: A Case Study of Scaling Data-Driven Science 2018 VLDB 8.3905374e-05
2,681 NET-FLi: On-the-fly Compression, Archiving and Indexing of Streaming Network Traffic 2010 VLDB 8.3232427e-05
3,056 DSH: Data Sensitive Hashing for High-Dimensional k-NN Search 2014 SIGMOD 7.6432146e-05
3,225 DeltaPQ: Lossless Product Quantization Code Compression for High Dimensional Similarity Search 2020 VLDB 7.3463484e-05
3,510 Inter-Media Hashing for Large-scale Retrieval from Heterogeneous Data Sources 2013 SIGMOD 7.0258619e-05
3,624 SeRF: Segment Graph for Range-Filtering Approximate Nearest Neighbor Search 2024 SIGMOD 6.9056e-05
3,938 Intelligent Probing for Locality Sensitive Hashing: Multi-Probe LSH and Beyond 2017 VLDB 6.6155909e-05
4,243 Locality-Sensitive Hashing Scheme based on Longest Circular Co-Substring 2020 SIGMOD 6.32976e-05
4,609 A General and Efficient Querying Method for Learning to Hash 2018 SIGMOD 6.0528541e-05
4,862 Vexless: A Serverless Vector Data Management System Using Cloud Functions 2024 SIGMOD 5.8707776e-05
5,200 SetSketch: Filling the Gap between MinHash and HyperLogLog 2021 VLDB 5.6337581e-05
5,224 Neighbor-Sensitive Hashing 2016 VLDB 5.6197981e-05
5,469 Learned Cardinality Estimation for Similarity Queries 2021 SIGMOD 5.4898192e-05
5,569 Revisiting the Index Construction of Proximity Graph-Based Approximate Nearest Neighbor Search 2025 VLDB 5.4290942e-05
5,707 FARGO: Fast Maximum Inner Product Search via Global Multi-Probing 2023 VLDB 5.3611041e-05
5,996 A New Sparse Data Clustering Method Based On Frequent Items 2023 SIGMOD 5.2415551e-05
6,074 Pigeonring: A Principle for Faster Thresholded Similarity Search 2019 VLDB 5.2242306e-05
6,107 Continuously Adaptive Similarity Search 2020 SIGMOD 5.2066612e-05
6,399 Similarity Search and Locality Sensitive Hashing using Ternary Content Addressable Memories 2010 SIGMOD 5.0818596e-05
7,204 ARKGraph: All-Range Approximate K-Nearest-Neighbor Graph 2023 VLDB 4.8015761e-05
7,301 Randomized Algorithms Accelerated over CPU-GPU for Ultra-High Dimensional Similarity Search 2018 SIGMOD 4.768971e-05
7,700 Near-Duplicate Text Alignment with One Permutation Hashing 2024 SIGMOD 4.6744372e-05
7,832 LIDER: An Efficient High-dimensional Learned Index for Large-scale Dense Passage Retrieval 2023 VLDB 4.6387029e-05
8,656 Dynamic Range-Filtering Approximate Nearest Neighbor Search 2025 VLDB 4.4737647e-05
8,712 ANN Softmax: Acceleration of Extreme Classification Training 2022 VLDB 4.4626362e-05
8,763 Smooth Tradeoffs between Insert and Query Complexity in Nearest Neighbor Search 2015 PODS 4.456315e-05
10,201 RAIRS: Optimizing Redundant Assignment and List Layout for IVF-Based ANN Search 2026 SIGMOD 4.1945683e-05
10,260 JHQ: Johnson-Lindenstrauss Enhanced Hierarchical Quantization for High-Dimensional Approximate Nearest Neighbor Search 2026 VLDB 4.1945683e-05
10,279 ConANN: Conformal Approximate Nearest Neighbor Search 2026 VLDB 4.1945683e-05
10,566 Unleashing Graph Partitioning for Large-Scale Nearest Neighbor Search 2025 VLDB 4.1945683e-05
11,412 ONe Index for All Kernels (ONIAK): A Zero Re-Indexing LSH Solution to ANNS-ALT (After Linear Transformation) 2022 VLDB 4.1945683e-05
11,535 MP-RW-LSH: An Efficient Multi-Probe LSH Solution to ANNS-L1 2021 VLDB 4.1945683e-05
11,655 Top-k Queries over Digital Traces 2019 SIGMOD 4.1945683e-05
12,176 Effective Data Co-Reduction for Multimedia Similarity Search 2011 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers