Database Paper Browser

Back to papers

Starling: An I/O-Efficient Disk-Resident Graph Index Framework for High-Dimensional Vector Similarity Search on Data Segment

Summary: Starling: a disk-resident, I/O-efficient HVSS framework for segment-wide vector search. Hybrid layout (in-memory navigation graph + reordered disk graph) and a block I/O strategy reduce disk traffic, enabling 33M 128-D vectors with >0.9 AP and ~1 ms latency, ~44x throughput vs prior methods. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
6823
Venue
SIGMOD
Year
2024
Pagerank
8.293714e-05
Overall Rank
2,690 | 81.29%
DOI
10.1145/3639269

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 28 of 28 citing papers.

Rank Citing Paper Year Venue Pagerank
4,598 Practical and Asymptotically Optimal Quantization of High-Dimensional Vectors in Euclidean Space for Approximate Nearest Neighbor Search 2025 SIGMOD 6.0586236e-05
5,569 Revisiting the Index Construction of Proximity Graph-Based Approximate Nearest Neighbor Search 2025 VLDB 5.4290942e-05
7,239 Efficient Data-aware Distance Comparison Operations for High-Dimensional Approximate Nearest Neighbor Search 2025 VLDB 4.792836e-05
7,544 A Topology-Aware Localized Update Strategy for Graph-Based ANN Index 2026 VLDB 4.7149033e-05
7,879 PDX: A Data Layout for Vector Similarity Search 2025 SIGMOD 4.6292417e-05
8,439 Accelerating Graph Indexing for ANNS on Modern CPUs 2025 SIGMOD 4.5128946e-05
9,449 An Interactive Multi-modal Query Answering System with Retrieval-Augmented Large Language Models 2024 VLDB 4.3399593e-05
9,880 CoTra: Towards Efficient and Scalable Distributed Vector Search with RDMA 2026 SIGMOD 4.2643674e-05
10,007 Proximity Graphs for Similarity Search: Fast Construction, Lower Bounds, and Euclidean Separation 2026 PODS 4.1945683e-05
10,035 SWIFT: Enabling Large-Scale Temporal Graph Learning on a Single Machine 2026 SIGMOD 4.1945683e-05
10,042 Accelerating High-Dimensional ANN Search via Skipping Redundant Distance Computations 2026 SIGMOD 4.1945683e-05
10,068 DiskJoin: Large-scale Vector Similarity Join with SSD 2026 SIGMOD 4.1945683e-05
10,071 Dynamically Detect and Fix Hardness for Efficient Approximate Nearest Neighbor Search 2026 SIGMOD 4.1945683e-05
10,086 High-Throughput, Cost-Effective Billion-Scale Vector Search with a Single GPU 2026 SIGMOD 4.1945683e-05
10,111 Scalable Graph Indexing using GPUs for Approximate Nearest Neighbor Search 2026 SIGMOD 4.1945683e-05
10,124 TRIM: Accelerating High-Dimensional Vector Similarity Search with Enhanced Triangle-Inequality-Based Pruning 2026 SIGMOD 4.1945683e-05
10,154 Distribution-Aware Exploration for Adaptive HNSW Search 2026 SIGMOD 4.1945683e-05
10,158 Efficient and Robust Out-Of-Distribution Vector Similarity Search with Cross-Distribution Monotonic Graph 2026 SIGMOD 4.1945683e-05
10,160 Efficient Vector Index Merging in Vector Databases 2026 SIGMOD 4.1945683e-05
10,165 Fast-Convergent Proximity Graphs for Approximate Nearest Neighbor Search 2026 SIGMOD 4.1945683e-05
10,201 RAIRS: Optimizing Redundant Assignment and List Layout for IVF-Based ANN Search 2026 SIGMOD 4.1945683e-05
10,256 I/O Optimizations for Graph-Based Disk-Resident Approximate Nearest Neighbor Search: A Design Space Exploration 2026 VLDB 4.1945683e-05
10,303 Elastic Index Selection for Label-Hybrid AKNN Search 2026 VLDB 4.1945683e-05
10,367 Aster: Enhancing LSM-structures for Scalable Graph Database 2025 SIGMOD 4.1945683e-05
10,654 HAKES: Scalable Vector Database for Embedding Search Service 2025 VLDB 4.1945683e-05
10,703 Fast Graph Vector Search via Hardware Acceleration and Delayed-Synchronization Traversal 2025 VLDB 4.1945683e-05
10,737 Select Edges Wisely: Monotonic Path Aware Graph Layout Optimization for Disk-based ANN Search 2025 VLDB 4.1945683e-05
10,760 Turbocharging Vector Databases using Modern SSDs 2025 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 18 of 18 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
4 Pregel: A System for Large-Scale Graph Processing 2010 SIGMOD 0.0019005923
212 Fast Approximate Nearest Neighbor Search With The Navigating Spreading-out Graph 2019 VLDB 0.00033913475
495 Milvus: A Purpose-Built Vector Data Management System 2021 SIGMOD 0.00021767688
562 Query-Aware Locality-Sensitive Hashing for Approximate Nearest Neighbor Search 2016 VLDB 0.00020091752
736 AnalyticDB-V: A Hybrid Analytical Engine Towards Query Fusion for Structured and Unstructured Data 2020 VLDB 0.00017447617
770 A Comprehensive Survey and Experimental Comparison of Graph-Based Approximate Nearest Neighbor Search 2021 VLDB 0.00016917602
1,269 Cache locality is not enough: High-Performance Nearest Neighbor Search with Product Quantization Fast Scan 2016 VLDB 0.00012930432
1,636 PASE: PostgreSQL Ultra-High-Dimensional Approximate Nearest Neighbor Search Extension 2020 SIGMOD 0.00011053863
1,676 Speedup Graph Processing by Graph Ordering 2016 SIGMOD 0.00010946423
1,757 VHP: Approximate Nearest Neighbor Search via Virtual Hypersphere Partitioning 2020 VLDB 0.00010660932
2,262 Manu: A Cloud Native Vector Database Management System 2022 VLDB 9.1624446e-05
2,435 iDEC: Indexable Distance Estimating Codes for Approximate Nearest Neighbor Search 2020 VLDB 8.8252237e-05
2,494 Streaming Graph Partitioning: An Experimental Study 2018 VLDB 8.6508229e-05
2,725 HVS: Hierarchical Graph Structure Based on Voronoi Diagrams for Solving Approximate Nearest Neighbor Search 2022 VLDB 8.2294908e-05
3,839 Experimental Analysis of Streaming Algorithms for Graph Partitioning 2019 SIGMOD 6.7120651e-05
5,551 LANNS: A Web-Scale Approximate Nearest Neighbor Lookup System 2022 VLDB 5.4421769e-05
7,363 An I/O-Efficient Disk-based Graph System for Scalable Second-Order Random Walk of Large Graphs 2022 VLDB 4.7523184e-05
7,832 LIDER: An Efficient High-dimensional Learned Index for Large-scale Dense Passage Retrieval 2023 VLDB 4.6387029e-05
Previous Page 1 / 1 Next

Semantically Similar Papers