Ryan Marcus, assistant professor at the University of Pennsylvania. Using machine learning to build the next generation of data systems.
____ __ ___
/ __ \__ ______ _____ / |/ /___ _____________ _______
/ /_/ / / / / __ `/ __ \ / /|_/ / __ `/ ___/ ___/ / / / ___/
/ _, _/ /_/ / /_/ / / / / / / / / /_/ / / / /__/ /_/ (__ )
/_/ |_|\__, /\__,_/_/ /_/ /_/ /_/\__,_/_/ \___/\__,_/____/
/____/
___ __ ___
/ _ \__ _____ ____ / |/ /__ ___________ _____
/ , _/ // / _ `/ _ \ / /|_/ / _ `/ __/ __/ // (_-<
/_/|_|\_, /\_,_/_//_/ /_/ /_/\_,_/_/ \__/\_,_/___/
/___/
___ __ ___
/ _ \/ |/ /__ ___________ _____
/ , _/ /|_/ / _ `/ __/ __/ // (_-<
/_/|_/_/ /_/\_,_/_/ \__/\_,_/___/
I'm Ryan Marcus, an assistant professor of computer science at the University of Pennsylvania. I'm using machine learning to build the next generation of data management tools that automatically adapt to new hardware and user workloads, invent novel processing strategies, and understand user intention.
I am especially interested in query optimization, index structures, intelligent clouds, programming language runtimes, program synthesis for data processing, and applications of reinforcement learning to systems problems.
News
- 20 Jul 2024We'll be presenting our π vision for full stack adaptivity via machine learning for blockchain systems at VLDB '24, along with a π οΈ demo of BFTGym, our environment for performance testing BFT protocols under various fault conditions.
- 01 Jun 2024Two fresh takes on query planning presented at SIGMOD '24: first, π Stage, the cache-based multistage query latency predictor used in Redshift, and second, π LimeQO, a workload-level query steering technique using linear methods.
- 20 May 2024I appeared on the ποΈ Disseminate podcast.
- 06 Dec 2023I gave a π£οΈ talk at PrestoCon about learned query optimization and π AutoSteer (abstract).
Previous news items ...
- 16 Aug 2023Our π AutoSteer paper, an extensible learned query optimizer for any SQL database, was published in VLDB '23. We're also presenting a demo of π οΈ QO-Insight, our tool for exploring and understanding learned query optimizers.
- 19 Jun 2023Our π Kepler (robust learned parametric query optimization) and π Auto-WLM (learning enhanced workload management) papers were published at SIGMOD '23.
- 07 Apr 2023Our π AdaChain paper, the first adaptive blockchain that switches architectures in order to optimize throughput for dynamic workloads, was published at VLDB '23.
- 20 Feb 2023Our π paper on robust cardinality estimation under dynamic workloads was published at VLDB '23.
- 15 Sep 2022Our π SageDB paper, the first complete data system built with instance optimization as a foundational design principle, was published at VLDB '22.
- 30 Apr 2022I will be π joining the CIS faculty at the University of Pennsylvania in Fall 2023!
- 15 Jun 2021Our π Bao paper, a practical approach to learned query optimization, π wins the Best Paper Award at SIGMOD '21.
- 18 Mar 2021Our π paper presenting the first π οΈ benchmark of learned indexes has been accepted to VLDB '21.
Blog Posts
-
Ten years of improvements in PostgreSQL's optimizer
Since at least version 8, PostgreSQLβs query optimizer has been improving by around 15% between major versions
(12 Apr 2024).
-
Most influential database papers
We can use PageRank on top of the citation graph to find the influential papers in data management
(25 Jul 2023).
-
Generating bios with large language models
We can use a large language model to write a short bio for any researcher, but the results vary drastically in quality
(20 Dec 2022).
-
Applying Bao to distributed systems
We can apply Bao, a technique for learned query optimization, to a number of distributed cloud databases
(17 Jun 2021).
-
Machine learning for systems
A recent groundswell of research has been pushing machine learning into computer systems
(06 Jun 2019).
Copyright 2024 Ryan Marcus