DepCache: A KV Cache Management Framework for GraphRAG with Dependency Attention
Summary: Dependency attention: graph-aware attention that prunes token-pair interactions to structural dependencies and reuses computations along relational paths to reduce inference cost. DepCache: KV-cache reuse aligned across graph-augmented prompts with a locality-aware replacement policy, yielding 1.5–5× throughput and up to 3.2× time-to-first-token reduction without accuracy loss. (summarized by gpt-5-mini on Feb 11 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Hao Yuan
- 2. Xin Ai
- 3. Qiange Wang
- 4. Peizheng Li
- 5. Jiayang Yu
- 6. Chaoyi Chen
- 7. Xinbo Yang
- 8. Yanfeng Zhang
- 9. Zhenbo Fu
- 10. Yingyou Wen
- 11. Ge Yu
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 9 of 9 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,025 | NeutronStar: Distributed GNN Training with Hybrid Dependency Management | 2022 | SIGMOD | 7.6906935e-05 |
| 3,131 | FINEdex: A Fine-grained Learned Index Scheme for Scalable and Concurrent Memory Systems | 2022 | VLDB | 7.4985793e-05 |
| 3,565 | Cache-Craft: Managing Chunk-Caches for Efficient Retrieval-Augmented Generation | 2025 | SIGMOD | 6.9655362e-05 |
| 5,737 | Comprehensive Evaluation of GNN Training Systems: A Data Management Perspective | 2024 | VLDB | 5.3480667e-05 |
| 6,357 | PQCache: Product Quantization-based KVCache for Long Context LLM Inference | 2025 | SIGMOD | 5.0970739e-05 |
| 7,091 | HongTu: Scalable Full-Graph GNN Training on Multiple GPUs | 2023 | SIGMOD | 4.8370645e-05 |
| 7,481 | Buffered Persistence in B+ Trees | 2024 | SIGMOD | 4.7180617e-05 |
| 9,395 | NeutronTP: Load-Balanced Distributed Full-Graph GNN Training with Tensor Parallelism | 2025 | VLDB | 4.3441378e-05 |
| 13,104 | NeutronRAG: Towards Understanding the Effectiveness of RAG from a Data Retrieval Perspective | 2025 | SIGMOD | - |
Previous
Page 1 / 1
Next