Similarity Query Processing for High-Dimensional Data
Summary: Tutorial surveying high-dimensional similarity query processing, bridging DB and ML with embeddings, auto-encoders, and pre-trained models. Reviews exact and approximate methods (cover trees, LSH, product quantization, proximity graphs), ML-driven selectivity estimation, and DB–ML synergy to spur ML-for-DB and DB-for-ML solutions. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Jianbin Qin
- 2. Wei Wang
- 3. Chuan Xiao
- 4. Ying Zhang
Incoming Citations (Sorted by Pagerank)
Showing 13 of 13 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 22 of 22 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,079 | High-Dimensional Index Structures: Database Support for Next Decade's Applications | 1998 | SIGMOD | 4.5914115e-05 |
| 9,351 | On Efficient Approximate Queries over Machine Learning Models | 2023 | VLDB | 4.3524472e-05 |
| 4,609 | A General and Efficient Querying Method for Learning to Hash | 2018 | SIGMOD | 6.0528541e-05 |
| 6,082 | Query-Sensitive Embeddings | 2005 | SIGMOD | 5.2205711e-05 |
| 7,522 | Efficient and Tunable Similar Set Retrieval | 2001 | SIGMOD | 4.7180617e-05 |
| 6,360 | High-Dimensional Vector Similarity Search: From Time Series to Deep Network Embeddings | 2020 | SIGMOD | 5.0961051e-05 |
| 8,899 | Fast Approximate Similarity Join in Vector Databases | 2025 | SIGMOD | 4.427232e-05 |
| 10,843 | Machine Learning for Graph Data Management and Query Processing | 2025 | VLDB | 4.1945683e-05 |
| 34 | Similarity Search in High Dimensions via Hashing | 1999 | VLDB | 0.00076637636 |
| 4,200 | New Trends in High-D Vector Similarity Search: AI-driven, Progressive, and Distributed | 2021 | VLDB | 6.3651489e-05 |