ELT-Bench: An End-to-End Benchmark for Evaluating AI Agents on ELT Pipelines
Summary: ELT-Bench: end-to-end benchmark for AI agents to build ELT pipelines—integrating diverse sources, using data tools, writing code/SQL and orchestrating workflows (100 pipelines, 203 models). Eval of 4 agents with 6 LLMs: best agent succeeds on 11.3% of models (avg $1.41, 72.2 steps), exposing major automation gaps. (summarized by gpt-5-mini on Mar 13 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Tengjun Jin
- 2. Yuxuan Zhu
- 3. Daniel Kang
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 9 of 9 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 167 | The Snowflake Elastic Data Warehouse | 2016 | SIGMOD | 0.00039180521 |
| 369 | Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation | 2024 | VLDB | 0.0002547515 |
| 998 | CodeS: Towards Building Open-source Language Models for Text-to-SQL | 2024 | SIGMOD | 0.00014729379 |
| 1,963 | DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing | 2025 | VLDB | 9.929429e-05 |
| 3,359 | Text2SQL is Not Enough: Unifying AI and Databases with TAG | 2025 | CIDR | 7.1744146e-05 |
| 4,717 | Cloud Analytics Benchmark | 2023 | VLDB | 5.9751539e-05 |
| 5,114 | TPC-DI: The First Industry Benchmark for Data Integration | 2014 | VLDB | 5.6863051e-05 |
| 5,605 | TPCx-AI - An Industry Standard Benchmark for Artificial Intelligence and Machine Learning Systems | 2023 | VLDB | 5.4142007e-05 |
| 6,217 | Pneuma: Leveraging LLMs for Tabular Data Representation and Retrieval in an End-to-End System | 2025 | SIGMOD | 5.1534752e-05 |
Previous
Page 1 / 1
Next