Back to papers
Database Perspective on LLM Inference Systems
Summary: Reframes LLM inference as a data-management problem, analyzing request processing, model execution/optimization, and memory management to lower cost and handle uncertain request lifecycles. Surveys system techniques—hardware acceleration, batching, caching, and distributed execution—and how they are composed across architectures to meet application and performance goals.
(summarized by gpt-5-mini on Feb 09 2026)
- Paper ID
- 14190
- Venue
- VLDB
- Year
- 2025
- Pagerank
- -
- Overall Rank
- 13,138 | 8.61%
- DOI
-
10.14778/3750601.3750703
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
Outgoing Citations (Sorted by Pagerank)
Showing 1 of 1 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 5,258 |
One Model to Rule them All: Towards Zero-Shot Learning for Databases |
2022 |
CIDR |
5.5998705e-05 |
| 8,488 |
Can Large Language Models Be Query Optimizer for Relational Databases? |
2026 |
SIGMOD |
4.4998609e-05 |
| 3,995 |
How Large Language Models Will Disrupt Data Management |
2023 |
VLDB |
6.5513237e-05 |
| 10,217 |
This is Going to Sound Crazy, But What If We Used Large Language Models to Boost Automatic Database Tuning Algorithms By Leveraging Prior History? We Will Find Better Configurations More Quickly Than Retraining From Scratch! |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,122 |
TranSQL+: Serving Large Language Models with SQL on Low-Resource Hardware |
2026 |
SIGMOD |
4.1945683e-05 |
| 5,840 |
Logical and Physical Optimizations for SQL Query Execution over Large Language Models |
2025 |
SIGMOD |
5.3042561e-05 |
| 9,391 |
Database as Runtime: Compiling LLMs to SQL for In-database Model Serving |
2025 |
SIGMOD |
4.3441378e-05 |
| 10,452 |
ScaleLLM: A Technique for Scalable LLM-augmented Data Systems |
2025 |
SIGMOD |
4.1945683e-05 |
| 13,171 |
Reimagining Deep Learning Systems Through the Lens of Data Systems |
2024 |
VLDB |
- |
| 7,020 |
LLM for Data Management |
2024 |
VLDB |
4.8595728e-05 |