Database Paper Browser

Back to papers

ContextCache: Context-Aware Semantic Cache for Multi-Turn Queries in Large Language Models

Summary: ContextCache: a context-aware semantic cache for multi-turn LLM dialogues that first retrieves vector-based candidates for the current query, then refines matches by self-attention over current and historical dialogue representations. Improves precision/recall over per-query caches on real conversations and delivers ~10× lower latency for cached responses, cutting LLM compute costs. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
14164
Venue
VLDB
Year
2025
Pagerank
-
Overall Rank
13,135 | 8.63%
DOI
10.14778/3750601.3750679

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank Citing Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 0 of 0 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Semantically Similar Papers