Back to papers
Bayesian Locality Sensitive Hashing for Fast Similarity Search
Summary: BayesLSH prunes false positives and refines similarity after LSH. BayesLSH-Lite computes exact similarities; offers probabilistic accuracy/recall guarantees, tunable output without fixed hash counts, and 2x-20x speedups versus AllPairs/LSH.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 10490
- Venue
- VLDB
- Year
- 2012
- Pagerank
- 0.00012687101
- Overall Rank
- 1,305 | 90.93%
- DOI
-
-
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 27 of 27 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 867 |
SRS: Solving c-Approximate Nearest Neighbor Queries in High Dimensional Euclidean Space with a Tiny Index |
2015 |
VLDB |
0.00015792021 |
| 1,971 |
LazyLSH: Approximate Nearest Neighbor Search for Multiple Distance Functions with a Single Index |
2016 |
SIGMOD |
9.893198e-05 |
| 2,181 |
PM-LSH: A Fast and Accurate LSH Framework for High-Dimensional Approximate NN Search |
2020 |
VLDB |
9.3451821e-05 |
| 2,740 |
String Similarity Joins: An Experimental Evaluation |
2014 |
VLDB |
8.1980628e-05 |
| 2,811 |
High-Dimensional Approximate Nearest Neighbor Search: with Reliable and Efficient Distance Comparison Operations |
2023 |
SIGMOD |
8.0806307e-05 |
| 3,056 |
DSH: Data Sensitive Hashing for High-Dimensional k-NN Search |
2014 |
SIGMOD |
7.6432146e-05 |
| 3,141 |
ClusterJoin: A Similarity Joins Framework using Map-Reduce |
2014 |
VLDB |
7.4829448e-05 |
| 3,459 |
An Empirical Evaluation of Set Similarity Join Techniques |
2016 |
VLDB |
7.072508e-05 |
| 3,490 |
Leveraging Set Relations in Exact Set Similarity Join |
2017 |
VLDB |
7.0465856e-05 |
| 4,050 |
An Efficient Partition Based Method for Exact Set Similarity Joins |
2016 |
VLDB |
6.4953612e-05 |
| 4,250 |
Local Similarity Search for Unstructured Text |
2016 |
SIGMOD |
6.3241139e-05 |
| 4,353 |
Overlap Set Similarity Joins with Theoretical Guarantees |
2018 |
SIGMOD |
6.263585e-05 |
| 4,401 |
LEMP: Fast Retrieval of Large Entries in a Matrix Product |
2015 |
SIGMOD |
6.2211271e-05 |
| 4,808 |
On the Complexity of Inner Product Similarity Join |
2016 |
PODS |
5.908896e-05 |
| 5,179 |
SilkMoth: An Efficient Method for Finding Related Sets with Maximum Matching Constraints |
2017 |
VLDB |
5.6428428e-05 |
| 5,224 |
Neighbor-Sensitive Hashing |
2016 |
VLDB |
5.6197981e-05 |
| 6,074 |
Pigeonring: A Principle for Faster Thresholded Similarity Search |
2019 |
VLDB |
5.2242306e-05 |
| 7,635 |
Allign: Aligning All-Pair Near-Duplicate Passages in Long Texts |
2021 |
SIGMOD |
4.6908858e-05 |
| 7,668 |
Human-in-the-loop Data Integration |
2017 |
VLDB |
4.6834075e-05 |
| 8,755 |
Multivariate Correlations Discovery in Static and Streaming Data |
2022 |
VLDB |
4.456315e-05 |
| 8,997 |
Chasing Similarity: Distribution-aware Aggregation Scheduling |
2019 |
VLDB |
4.4120041e-05 |
| 9,832 |
Balance-Aware Distributed String Similarity-Based Query Processing System |
2019 |
VLDB |
4.2751057e-05 |
| 11,305 |
TokenJoin: Efficient Filtering for Set Similarity Join with Maximum Weighted Bipartite Matching |
2023 |
VLDB |
4.1945683e-05 |
| 11,504 |
LES3: Learning-based Exact Set Similarity Search |
2021 |
VLDB |
4.1945683e-05 |
| 11,535 |
MP-RW-LSH: An Efficient Multi-Probe LSH Solution to ANNS-L1 |
2021 |
VLDB |
4.1945683e-05 |
| 11,655 |
Top-k Queries over Digital Traces |
2019 |
SIGMOD |
4.1945683e-05 |
| 12,075 |
PLASMA-HD: Probing the LAttice Structure and MAkeup of High-dimensional Data |
2013 |
VLDB |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Semantically Similar Papers