Ryan Marcus, assistant professor at the University of Pennsylvania.
Using machine learning to build the next generation of data systems.
____ __ ___
/ __ \__ ______ _____ / |/ /___ _____________ _______
/ /_/ / / / / __ `/ __ \ / /|_/ / __ `/ ___/ ___/ / / / ___/
/ _, _/ /_/ / /_/ / / / / / / / / /_/ / / / /__/ /_/ (__ )
/_/ |_|\__, /\__,_/_/ /_/ /_/ /_/\__,_/_/ \___/\__,_/____/
/____/
___ __ ___
/ _ \__ _____ ____ / |/ /__ ___________ _____
/ , _/ // / _ `/ _ \ / /|_/ / _ `/ __/ __/ // (_-<
/_/|_|\_, /\_,_/_//_/ /_/ /_/\_,_/_/ \__/\_,_/___/
/___/
___ __ ___
/ _ \/ |/ /__ ___________ _____
/ , _/ /|_/ / _ `/ __/ __/ // (_-<
/_/|_/_/ /_/\_,_/_/ \__/\_,_/___/
LLMs for Query Optimization

Large language models (LLMs) have fantastic reasoning capabilities in some domains. This project explores how that reasoning can be applied to database query optimization. Core challenges include (1) ensuring that any generated plan is correct, and (2) overcoming high LLM inference overhead. Our work has considered a number of different directions, including embeddings, RAG, and fine-tuning.
Papers
- SEFRQO: A Self-Evolving Fine-Tuned RAG-Based Query Optimizer
- Hanwen Liu
- Qihan Zhang
- Ryan Marcus
- Ibrahim Sabek
SIGMOD '26 (pdf) (doi)
- SERAG: Self-Evolving RAG System for Query Optimization
- Hanwen Liu
- Qihan Zhang
- Ryan Marcus
- Ibrahim Sabek
aiDM@ SIGMOD '25 (pdf) (doi)
- The Unreasonable Effectiveness of LLMs for Query Optimization
- Peter Akioyamen
- Zixuan Yi
- Ryan Marcus
MLForSystems @ NeurIPS '24 (pdf) (doi)
People