____ __ ___ / __ \__ ______ _____ / |/ /___ _____________ _______ / /_/ / / / / __ `/ __ \ / /|_/ / __ `/ ___/ ___/ / / / ___/ / _, _/ /_/ / /_/ / / / / / / / / /_/ / / / /__/ /_/ (__ ) /_/ |_|\__, /\__,_/_/ /_/ /_/ /_/\__,_/_/ \___/\__,_/____/ /____/

___ __ ___ / _ \__ _____ ____ / |/ /__ ___________ _____ / , _/ // / _ `/ _ \ / /|_/ / _ `/ __/ __/ // (_-< /_/|_|\_, /\_,_/_//_/ /_/ /_/\_,_/_/ \__/\_,_/___/ /___/

Scalable Semantic Operators

ScaleLLM flowchart. See https://doi.org/10.1145/3722212.3725130 for details.

Semantic operators, generally powered by LLMs, are taking the database world by storm. Queries that were previously out of reach – like “does this review appear fake?” – are now possible. Unfortunately, naive implementations of semantic operators generally involve calling an expensive LLM for every row of data. How can we scale semantic operators to datasets with billions of rows?

Papers

ScaleLLM: A technique for scalable LLM-augmented data systems Demo.
1. Ashwin Alaparthi
2. Paul Loh
3. Ryan Marcus
SIGMOD '25 (pdf) (doi)

Scalable Semantic Operators

Papers

People