Database Paper Browser

Back to papers

Database Perspective on LLM Inference Systems

Summary: Reframes LLM inference as a data-management problem, analyzing request processing, model execution/optimization, and memory management to lower cost and handle uncertain request lifecycles. Surveys system techniques—hardware acceleration, batching, caching, and distributed execution—and how they are composed across architectures to meet application and performance goals. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
14190
Venue
VLDB
Year
2025
Pagerank
-
Overall Rank
13,138 | 8.61%
DOI
10.14778/3750601.3750703

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank Citing Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 1 of 1 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
7,020 LLM for Data Management 2024 VLDB 4.8595728e-05
Previous Page 1 / 1 Next

Semantically Similar Papers