Back to papers
Palimpzest: Optimizing AI-Powered Analytics with Declarative Query Processing
Summary: PALIMPZEST: declarative language + system to express AI-powered analytics over unstructured corpora, automating orchestration of models, prompts, and data operations. A cost-based optimizer searches model/prompt/implementation choices to trade latency, cost, and accuracy, yielding up to 3.3x speedup and 2.9x cost reduction with improved F1.
(summarized by gpt-5-mini on Feb 09 2026)
- Paper ID
- 538
- Venue
- CIDR
- Year
- 2025
- Pagerank
- 9.5342543e-05
- Overall Rank
- 2,106 | 85.36%
- DOI
-
-
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 24 of 24 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 1,963 |
DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing |
2025 |
VLDB |
9.929429e-05 |
| 5,171 |
Abacus: A Cost-Based Optimizer for Semantic Operator Systems |
2026 |
VLDB |
5.6464993e-05 |
| 6,217 |
Pneuma: Leveraging LLMs for Tabular Data Representation and Retrieval in an End-to-End System |
2025 |
SIGMOD |
5.1534752e-05 |
| 8,736 |
Unveiling Challenges for LLMs in Enterprise Data Engineering |
2026 |
VLDB |
4.456315e-05 |
| 9,370 |
PalimpChat: Declarative and Interactive AI analytics |
2025 |
SIGMOD |
4.3480692e-05 |
| 9,729 |
Semantic Integrity Constraints: Declarative Guardrails for AI-Augmented Data Processing Systems |
2025 |
VLDB |
4.2942813e-05 |
| 9,871 |
From Logs to Causal Inference: Diagnosing Large Systems |
2025 |
VLDB |
4.2667743e-05 |
| 9,971 |
Data Movement-Aware GPU Sharing for Data-Intensive Systems |
2026 |
CIDR |
4.1945683e-05 |
| 9,972 |
KathDB: Explainable Multimodal Database Management System with Human-AI Collaboration |
2026 |
CIDR |
4.1945683e-05 |
| 9,981 |
Survivorship Bias in Industrial Database Workloads |
2026 |
CIDR |
4.1945683e-05 |
| 9,985 |
Making Prompts First-Class Citizens for Adaptive LLM Pipelines |
2026 |
CIDR |
4.1945683e-05 |
| 9,990 |
Deep Research is the New Analytics System: Towards Building the Runtime for AI-Driven Analytics |
2026 |
CIDR |
4.1945683e-05 |
| 10,117 |
AixelAsk: A Stepwise-Guided Retrieval and Reasoning Framework for Large Table QA |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,144 |
Beyond Relational: Semantic-Aware Multi-Modal Analytics with LLM-Native Query Optimization |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,185 |
MultiVis-Agent: A Multi-Agent Framework with Logic Rules for Reliable and Comprehensive Cross-Modal Data Visualization |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,194 |
PRISM: Navigating Cost–Accuracy Trade-offs for NL2SQL |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,212 |
SQLBarber: A System Leveraging Large Language Models to Generate Customized and Realistic SQL Workloads |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,215 |
Task Cascades for Efficient Unstructured Data Processing |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,456 |
SwellDB: Dynamic Query-Driven Table Generation with Large Language Models |
2025 |
SIGMOD |
4.1945683e-05 |
| 10,711 |
Cracking Vector Search Indexes |
2025 |
VLDB |
4.1945683e-05 |
| 10,752 |
QUEST: Query Optimization in Unstructured Document Analysis |
2025 |
VLDB |
4.1945683e-05 |
| 10,800 |
Unify: A System For Unstructured Data Analytics |
2025 |
VLDB |
4.1945683e-05 |
| 10,827 |
Beyond Quacking: Deep Integration of Language Models and RAG into DuckDB |
2025 |
VLDB |
4.1945683e-05 |
| 13,134 |
DocDB: A Database for Unstructured Document Analysis |
2025 |
VLDB |
- |
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 1,855 |
AI Meets AI: Leveraging Query Executions to Improve Index Recommendations |
2019 |
SIGMOD |
0.00010315245 |
| 10,950 |
PLAQUE: Automated Predicate Learning at Query Time |
2024 |
SIGMOD |
4.1945683e-05 |
| 12,847 |
Investigation of Algebraic Query Optimisation for Database Programming Languages |
1994 |
VLDB |
4.1945683e-05 |
| 11,650 |
Query-Driven Learning for Next Generation Predictive Modeling & Analytics |
2019 |
SIGMOD |
4.1945683e-05 |
| 5,658 |
Databases Unbound: Querying All of the World's Bytes with AI |
2024 |
VLDB |
5.385675e-05 |
| 5,840 |
Logical and Physical Optimizations for SQL Query Execution over Large Language Models |
2025 |
SIGMOD |
5.3042561e-05 |
| 119 |
Answering Queries using Humans, Algorithms and Databases |
2011 |
CIDR |
0.0004564788 |
| 9,990 |
Deep Research is the New Analytics System: Towards Building the Runtime for AI-Driven Analytics |
2026 |
CIDR |
4.1945683e-05 |
| 10,752 |
QUEST: Query Optimization in Unstructured Document Analysis |
2025 |
VLDB |
4.1945683e-05 |
| 9,370 |
PalimpChat: Declarative and Interactive AI analytics |
2025 |
SIGMOD |
4.3480692e-05 |