BigVectorBench: Heterogeneous Data Embedding and Compound Queries are Essential in Evaluating Vector Databases
Summary: Shows that heterogeneous-data embedding and compound (multimodal + constrained) queries are essential, yet overlooked, axes that determine vector DB performance. Presents BigVectorBench, with embedding metrics and compound-query abstractions to reveal mainstream systems' bottlenecks. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Guoxin Kang
- 2. Zhongxin Ge
- 3. Jingpei Hu
- 4. Xueya Zhang
- 5. Lei Wang
- 6. Jianfeng Zhan
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,204 | Reveal Hidden Pitfalls and Navigate Next Generation of Vector Similarity Search from Task-Centric Views: [Experiments & Analysis] | 2026 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 6 of 6 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 212 | Fast Approximate Nearest Neighbor Search With The Navigating Spreading-out Graph | 2019 | VLDB | 0.00033913475 |
| 495 | Milvus: A Purpose-Built Vector Data Management System | 2021 | SIGMOD | 0.00021767688 |
| 1,010 | HD-Index: Pushing the Scalability-Accuracy Boundary for Approximate kNN Search in High-Dimensional Spaces | 2018 | VLDB | 0.00014652858 |
| 1,914 | Creating Embeddings of Heterogeneous Relational Datasets for Data Integration Tasks | 2020 | SIGMOD | 0.00010109102 |
| 2,262 | Manu: A Cloud Native Vector Database Management System | 2022 | VLDB | 9.1624446e-05 |
| 6,389 | Chat2Data: An Interactive Data Analysis System with RAG, Vector Databases and LLMs | 2024 | VLDB | 5.0844009e-05 |
Previous
Page 1 / 1
Next