AOP: Automated and Interactive LLM Pipeline Orchestration for Answering Complex Queries
Summary: AOP: first system for automated, interactive LLM pipeline orchestration on data lakes; defines reusable semantic operators (retrieval, filter, aggregate, validate) to assemble multi‑step workflows across unstructured, semi‑structured, and structured data. LLM‑driven operator extraction plus pipeline optimizations (prefetching, parallelism, interactive self‑reflection) yields large accuracy gains (≈45%) on complex query benchmarks. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Jiayi Wang
- 2. Guoliang Li
Incoming Citations (Sorted by Pagerank)
Showing 7 of 7 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,035 | R-Bot: An LLM-based Query Rewrite System | 2025 | VLDB | 4.8548467e-05 |
| 9,729 | Semantic Integrity Constraints: Declarative Guardrails for AI-Augmented Data Processing Systems | 2025 | VLDB | 4.2942813e-05 |
| 9,990 | Deep Research is the New Analytics System: Towards Building the Runtime for AI-Driven Analytics | 2026 | CIDR | 4.1945683e-05 |
| 10,064 | Cut Costs, Not Accuracy: LLM-Powered Data Processing with Guarantees | 2026 | SIGMOD | 4.1945683e-05 |
| 10,215 | Task Cascades for Efficient Unstructured Data Processing | 2026 | SIGMOD | 4.1945683e-05 |
| 10,268 | OpenSQL: Data-Efficient Text-to-SQL for Open-Source LLMs via Synthesized Intermediate Supervision | 2026 | VLDB | 4.1945683e-05 |
| 10,800 | Unify: A System For Unstructured Data Analytics | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,082 | CAESURA: Language Models as Multi-Modal Query Planners | 2024 | CIDR | 0.00014214232 |
| 1,116 | Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes | 2024 | VLDB | 0.00013890154 |
| 1,541 | Symphony: Towards Natural Language Query Answering over Multi-modal Data Lakes | 2023 | CIDR | 0.00011456579 |
| 1,956 | D-Bot: Database Diagnosis System using Large Language Models | 2024 | VLDB | 9.960627e-05 |
| 3,662 | The Dawn of Natural Language to SQL: Are We Fully Ready? | 2024 | VLDB | 6.8672143e-05 |
| 4,543 | FACE: A Normalizing Flow based Cardinality Estimator | 2022 | VLDB | 6.1011198e-05 |
| 5,658 | Databases Unbound: Querying All of the World's Bytes with AI | 2024 | VLDB | 5.385675e-05 |
Previous
Page 1 / 1
Next