Chat2Data: An Interactive Data Analysis System with RAG, Vector Databases and LLMs
Summary: Interactive NL-driven data analysis prototype addressing LLM hallucination, inference cost, and low accuracy on complex tasks. Three-layer design: RAG for domain grounding, vector DB caching to reduce LLM calls, and a pipeline agent for multi-round task decomposition. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Xinyang Zhao
- 2. Xuanhe Zhou
- 3. Guoliang Li
Incoming Citations (Sorted by Pagerank)
Showing 7 of 7 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,956 | D-Bot: Database Diagnosis System using Large Language Models | 2024 | VLDB | 9.960627e-05 |
| 7,035 | R-Bot: An LLM-based Query Rewrite System | 2025 | VLDB | 4.8548467e-05 |
| 8,207 | SQLStorm: Taking Database Benchmarking into the LLM Era | 2025 | VLDB | 4.5583637e-05 |
| 9,394 | BigVectorBench: Heterogeneous Data Embedding and Compound Queries are Essential in Evaluating Vector Databases | 2025 | VLDB | 4.3441378e-05 |
| 10,160 | Efficient Vector Index Merging in Vector Databases | 2026 | SIGMOD | 4.1945683e-05 |
| 10,222 | RetroInfer: A Vector Storage Engine for Scalable Long-Context LLM Inference | 2026 | VLDB | 4.1945683e-05 |
| 10,658 | LLMLog: Advanced Log Template Generation via LLM-driven Multi-Round Annotation | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,956 | D-Bot: Database Diagnosis System using Large Language Models | 2024 | VLDB | 9.960627e-05 |
| 2,349 | RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation | 2021 | VLDB | 8.9876423e-05 |
| 3,727 | Cost-based or Learning-based? A Hybrid Query Optimizer for Query Plan Selection | 2022 | VLDB | 6.8141709e-05 |
| 5,074 | Learned Index: A Comprehensive Experimental Evaluation | 2023 | VLDB | 5.7175726e-05 |
| 8,103 | Grep: A Graph Learning Based Database Partitioning System | 2023 | SIGMOD | 4.5852201e-05 |
Previous
Page 1 / 1
Next