Randomized Algorithms Accelerated over CPU-GPU for Ultra-High Dimensional Similarity Search

Summary: FLASH: CPU-GPU accelerated LSH-style similarity search for ultra-high dimensional data on a single node. Fuses reservoir sampling, minwise hashing, and count-based estimations with HPC optimizations to cut compute, delivering sub-10s full k-NN on webspam. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID: 5572
Venue: SIGMOD
Year: 2018
Pagerank: 4.7821026e-05
Overall Rank: 7,255 | 49.58%
DOI: 10.1145/3183713.3196925

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 2 of 2 citing papers.

Rank	Citing Paper	Year	Venue	Pagerank
4,230	Locality-Sensitive Hashing Scheme based on Longest Circular Co-Substring	2020	SIGMOD	6.3337893e-05
10,103	Query-Aware Path Inference from Spatial Videos	2026	SIGMOD	4.1905499e-05

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank	Cited Paper	Year	Venue	Pagerank
78	A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces	1998	VLDB	0.00056385781
399	Multi-Probe LSH: Efficient Indexing for High-Dimensional Similarity Search	2007	VLDB	0.00024359304
579	Locality-Sensitive Hashing Scheme Based on Dynamic Collision Counting	2012	SIGMOD	0.0001982328
2,872	Streaming Similarity Search over one Billion Tweets using Parallel Locality-Sensitive Hashing	2013	VLDB	7.9797548e-05
3,689	Database-friendly Random Projections	2001	PODS	6.8363304e-05

Semantically Similar Papers

Overall Rank	Paper	Year	Venue	Pagerank
2,160	PM-LSH: A Fast and Accurate LSH Framework for High-Dimensional Approximate NN Search	2020	VLDB	9.4037759e-05
858	SRS: Solving c-Approximate Nearest Neighbor Queries in High Dimensional Euclidean Space with a Tiny Index	2015	VLDB	0.00015833075
399	Multi-Probe LSH: Efficient Indexing for High-Dimensional Similarity Search	2007	VLDB	0.00024359304
3,541	Similarity search in the blink of an eye with compressed indices	2023	VLDB	6.9910982e-05
6,388	Similarity Search and Locality Sensitive Hashing using Ternary Content Addressable Memories	2010	SIGMOD	5.079988e-05
3,019	DSH: Data Sensitive Hashing for High-Dimensional k-NN Search	2014	SIGMOD	7.699097e-05
2,872	Streaming Similarity Search over one Billion Tweets using Parallel Locality-Sensitive Hashing	2013	VLDB	7.9797548e-05
34	Similarity Search in High Dimensions via Hashing	1999	VLDB	0.00076824554
10,167	FlashANNS: GPU-Driven Asynchronous I/O Pipelining for Eliminating Storage-Compute Bottlenecks in Billion-Scale Similarity Search	2026	SIGMOD	4.1905499e-05
8,430	Accelerating Graph Indexing for ANNS on Modern CPUs	2025	SIGMOD	4.508568e-05