LLM for Data Management
Summary: Tutorial surveying use of LLMs to optimize data management, advocating retrieval-augmented generation (RAG) to curb hallucination, vector databases to cut latency, and LLM-agent multi-round pipelines for complex workflows. Focuses on applications (query rewrite, DB diagnosis, NL analytics), implementation trade-offs (cost, accuracy, latency), and open research challenges. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Guoliang Li
- 2. Xuanhe Zhou
- 3. Xinyang Zhao
Incoming Citations (Sorted by Pagerank)
Showing 7 of 7 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,035 | R-Bot: An LLM-based Query Rewrite System | 2025 | VLDB | 4.8548467e-05 |
| 8,207 | SQLStorm: Taking Database Benchmarking into the LLM Era | 2025 | VLDB | 4.5583637e-05 |
| 10,415 | SAP HANA Cloud: Data Management for Modern Enterprise Applications | 2025 | SIGMOD | 4.1945683e-05 |
| 10,475 | Cracking SQL Barriers: An LLM-based Dialect Translation System | 2025 | SIGMOD | 4.1945683e-05 |
| 10,658 | LLMLog: Advanced Log Template Generation via LLM-driven Multi-Round Annotation | 2025 | VLDB | 4.1945683e-05 |
| 10,693 | Evoschema: Towards Text-To-Sql Robustness Against Schema Evolution | 2025 | VLDB | 4.1945683e-05 |
| 13,138 | Database Perspective on LLM Inference Systems | 2025 | VLDB | - |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 9 of 9 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,407 | DB-BERT: A Database Tuning Tool that "Reads the Manual" | 2022 | SIGMOD | 0.00012146739 |
| 1,956 | D-Bot: Database Diagnosis System using Large Language Models | 2024 | VLDB | 9.960627e-05 |
| 2,139 | Diagnosing Root Causes of Intermittent Slow Queries in Cloud Databases | 2020 | VLDB | 9.4640037e-05 |
| 2,349 | RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation | 2021 | VLDB | 8.9876423e-05 |
| 2,596 | WeTune: Automatic Discovery and Verification of Query Rewrite Rules | 2022 | SIGMOD | 8.4729982e-05 |
| 3,248 | A Learned Query Rewrite System using Monte Carlo Tree Search | 2022 | VLDB | 7.3258782e-05 |
| 5,023 | GenRewrite: Query Rewriting via Large Language Models | 2026 | SIGMOD | 5.75363e-05 |
| 5,525 | QueryBooster: Improving SQL Performance Using Middleware Services for Human-Centered Query Rewriting | 2023 | VLDB | 5.4600815e-05 |
| 5,861 | Machine Learning for Databases | 2021 | VLDB | 5.298883e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,316 | LLM-AutoDP: Automatic Data Processing via LLM Agents for Model Fine-tuning | 2026 | VLDB | 4.1945683e-05 |
| 10,452 | ScaleLLM: A Technique for Scalable LLM-augmented Data Systems | 2025 | SIGMOD | 4.1945683e-05 |
| 9,219 | Intelligent Agents for Data Exploration | 2024 | VLDB | 4.3702863e-05 |
| 1,116 | Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes | 2024 | VLDB | 0.00013890154 |
| 10,595 | Optimized Batch Prompting for Cost-effective LLMs | 2025 | VLDB | 4.1945683e-05 |
| 6,389 | Chat2Data: An Interactive Data Analysis System with RAG, Vector Databases and LLMs | 2024 | VLDB | 5.0844009e-05 |
| 1,532 | Data Management in Machine Learning: Challenges, Techniques, and Systems | 2017 | SIGMOD | 0.00011472681 |
| 13,138 | Database Perspective on LLM Inference Systems | 2025 | VLDB | - |
| 8,736 | Unveiling Challenges for LLMs in Enterprise Data Engineering | 2026 | VLDB | 4.456315e-05 |
| 3,995 | How Large Language Models Will Disrupt Data Management | 2023 | VLDB | 6.5513237e-05 |