Back to papers
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Summary: Chameleon: a disaggregated heterogeneous accelerator architecture pairing FPGA vector-search accelerators with GPU LLM inference and CPU coordinators to independently scale retrieval and inference. Prototype yields up to 2.16× latency reduction and 3.18× throughput speedup vs CPU–GPU baselines.
(summarized by gpt-5-mini on Feb 09 2026)
- Paper ID
- 14038
- Venue
- VLDB
- Year
- 2025
- Pagerank
- 4.5676289e-05
- Overall Rank
- 8,175 | 43.13%
- DOI
-
10.14778/3696435.3696439
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 6 of 6 citing papers.
Outgoing Citations (Sorted by Pagerank)
Showing 24 of 24 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 34 |
Similarity Search in High Dimensions via Hashing |
1999 |
VLDB |
0.00076637636 |
| 212 |
Fast Approximate Nearest Neighbor Search With The Navigating Spreading-out Graph |
2019 |
VLDB |
0.00033913475 |
| 495 |
Milvus: A Purpose-Built Vector Data Management System |
2021 |
SIGMOD |
0.00021767688 |
| 736 |
AnalyticDB-V: A Hybrid Analytical Engine Towards Query Fusion for Structured and Unstructured Data |
2020 |
VLDB |
0.00017447617 |
| 867 |
SRS: Solving c-Approximate Nearest Neighbor Queries in High Dimensional Euclidean Space with a Tiny Index |
2015 |
VLDB |
0.00015792021 |
| 1,636 |
PASE: PostgreSQL Ultra-High-Dimensional Approximate Nearest Neighbor Search Extension |
2020 |
SIGMOD |
0.00011053863 |
| 1,920 |
Fast and Unified Local Search for Random Walk Based K-Nearest-Neighbor Query in Large Graphs |
2014 |
SIGMOD |
0.00010090791 |
| 1,931 |
Efficient Processing of k Nearest Neighbor Joins using MapReduce |
2012 |
VLDB |
0.00010040427 |
| 1,971 |
LazyLSH: Approximate Nearest Neighbor Search for Multiple Distance Functions with a Single Index |
2016 |
SIGMOD |
9.893198e-05 |
| 2,023 |
Efficient Approximate Nearest Neighbor Search in Multi-dimensional Databases |
2023 |
SIGMOD |
9.7544991e-05 |
| 2,262 |
Manu: A Cloud Native Vector Database Management System |
2022 |
VLDB |
9.1624446e-05 |
| 2,320 |
High-Throughput Vector Similarity Search in Knowledge Graphs |
2023 |
SIGMOD |
9.0366225e-05 |
| 2,324 |
RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search |
2024 |
SIGMOD |
9.0326444e-05 |
| 2,725 |
HVS: Hierarchical Graph Structure Based on Voronoi Diagrams for Solving Approximate Nearest Neighbor Search |
2022 |
VLDB |
8.2294908e-05 |
| 2,811 |
High-Dimensional Approximate Nearest Neighbor Search: with Reliable and Efficient Distance Comparison Operations |
2023 |
SIGMOD |
8.0806307e-05 |
| 2,971 |
Towards Efficient Index Construction and Approximate Nearest Neighbor Search in High-Dimensional Spaces |
2023 |
VLDB |
7.7970531e-05 |
| 5,456 |
Point-to-Hyperplane Nearest Neighbor Search Beyond the Unit Hypersphere |
2021 |
SIGMOD |
5.4976692e-05 |
| 5,758 |
Top-k Nearest Neighbor Search In Uncertain Data Series |
2015 |
VLDB |
5.339397e-05 |
| 6,503 |
Progressive Top-K Nearest Neighbors Search in Large Road Networks |
2020 |
SIGMOD |
5.0357715e-05 |
| 7,204 |
ARKGraph: All-Range Approximate K-Nearest-Neighbor Graph |
2023 |
VLDB |
4.8015761e-05 |
| 7,277 |
Exact Top-k Nearest Keyword Search in Large Networks |
2015 |
SIGMOD |
4.7794907e-05 |
| 9,307 |
Range-based Obstructed Nearest Neighbor Queries |
2016 |
SIGMOD |
4.3571035e-05 |
| 9,308 |
Optimal Spatial Dominance: An Effective Search of Nearest Neighbor Candidates |
2015 |
SIGMOD |
4.3571035e-05 |
| 9,309 |
Reverse k Nearest Neighbors Query Processing: Experiments and Analysis |
2015 |
VLDB |
4.3571035e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 10,122 |
TranSQL+: Serving Large Language Models with SQL on Low-Resource Hardware |
2026 |
SIGMOD |
4.1945683e-05 |
| 7,020 |
LLM for Data Management |
2024 |
VLDB |
4.8595728e-05 |
| 13,087 |
VecFlow-Chamfer: A GPU-based Data Management System for High-Performance Multi-Vector Search on Superchips |
2026 |
SIGMOD |
- |
| 9,449 |
An Interactive Multi-modal Query Answering System with Retrieval-Augmented Large Language Models |
2024 |
VLDB |
4.3399593e-05 |
| 13,138 |
Database Perspective on LLM Inference Systems |
2025 |
VLDB |
- |
| 10,452 |
ScaleLLM: A Technique for Scalable LLM-augmented Data Systems |
2025 |
SIGMOD |
4.1945683e-05 |
| 10,222 |
RetroInfer: A Vector Storage Engine for Scalable Long-Context LLM Inference |
2026 |
VLDB |
4.1945683e-05 |
| 10,703 |
Fast Graph Vector Search via Hardware Acceleration and Delayed-Synchronization Traversal |
2025 |
VLDB |
4.1945683e-05 |
| 3,565 |
Cache-Craft: Managing Chunk-Caches for Efficient Retrieval-Augmented Generation |
2025 |
SIGMOD |
6.9655362e-05 |
| 10,170 |
From Prefix Cache to Fusion RAG Cache: Accelerating LLM Inference in Retrieval-Augmented Generation |
2026 |
SIGMOD |
4.1945683e-05 |