Back to papers
Efficient Execution of User-Defined Functions in SQL Queries
Summary: Tutorial surveying low-level techniques for physical optimization and compilation of SQL queries with UDFs to close the execution mismatch between SQL engines and procedural code. Compares core approaches, highlights limitations and gaps, and outlines future research directions.
(summarized by gpt-5-mini on Feb 09 2026)
- Paper ID
- 13217
- Venue
- VLDB
- Year
- 2023
- Pagerank
- 4.4919445e-05
- Overall Rank
- 8,583 | 40.29%
- DOI
-
10.14778/3611540.3611574
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
Outgoing Citations (Sorted by Pagerank)
Showing 23 of 23 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 853 |
Everything You Always Wanted to Know About Compiled and Vectorized Queries But Were Afraid to Ask |
2018 |
VLDB |
0.00015940507 |
| 1,108 |
Froid: Optimization of Imperative Programs in a Relational Database |
2018 |
VLDB |
0.00013984276 |
| 1,750 |
Weld: A Common Runtime for High Performance Data Analytics |
2017 |
CIDR |
0.00010683647 |
| 1,873 |
An Architecture for Compiling UDF-centric Workflows |
2015 |
VLDB |
0.00010253002 |
| 1,882 |
Tuplex: Data Science in Python at Native Code Speed |
2021 |
SIGMOD |
0.0001021625 |
| 2,804 |
Extending Relational Query Processing with ML Inference |
2020 |
CIDR |
8.0935487e-05 |
| 2,896 |
Evaluating End-to-End Optimization for Data Analytics Applications in Weld |
2018 |
VLDB |
7.9452051e-05 |
| 2,934 |
AIDA - Abstraction for Advanced In-Database Analytics |
2018 |
VLDB |
7.8595778e-05 |
| 2,954 |
Magpie: Python at Speed and Scale using Cloud Backends |
2021 |
CIDR |
7.8262582e-05 |
| 3,080 |
Compiling PL/SQL Away |
2020 |
CIDR |
7.603389e-05 |
| 3,296 |
Extracting Equivalent SQL from Imperative Code in Database Applications |
2016 |
SIGMOD |
7.2596583e-05 |
| 3,918 |
On Optimizing Operator Fusion Plans for Large-Scale Machine Learning in SystemML |
2018 |
VLDB |
6.6315176e-05 |
| 4,648 |
Aggify: Lifting the Curse of Cursor Loops using Custom Aggregates |
2020 |
SIGMOD |
6.0247446e-05 |
| 4,813 |
Putting Pandas in a Box |
2021 |
CIDR |
5.9049746e-05 |
| 4,924 |
User-Defined Operators: Efficiently Integrating Custom Algorithms into Modern Databases |
2022 |
VLDB |
5.822682e-05 |
| 5,731 |
Babelfish: Efficient Execution of Polyglot Queries |
2022 |
VLDB |
5.3502065e-05 |
| 6,189 |
Accelerating Python UDFs in Vectorized Query Execution |
2022 |
CIDR |
5.1647573e-05 |
| 6,212 |
Snakes on a Plan: Compiling Python Functions into Plain SQL Queries |
2022 |
SIGMOD |
5.1552576e-05 |
| 6,645 |
Functional-Style SQL UDFs With a Capital 'F' |
2020 |
SIGMOD |
4.978205e-05 |
| 6,701 |
YeSQL: “You extend SQL” with Rich and Highly Performant User-Defined Functions in Relational Databases |
2022 |
VLDB |
4.9561066e-05 |
| 6,990 |
Machine Learning, Linear Algebra, and More: Is SQL All You Need? |
2022 |
CIDR |
4.8704904e-05 |
| 8,692 |
Boosting Efficiency of External Pipelines by Blurring Application Boundaries |
2022 |
CIDR |
4.4661967e-05 |
| 13,400 |
BabbleFlow - A Translator for Analytic Data Flow Programs |
2014 |
SIGMOD |
- |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 5,176 |
User-Defined Aggregate Functions: Bridging Theory and Practice |
2006 |
SIGMOD |
5.6439407e-05 |
| 12,316 |
Fast and Dynamic OLAP Exploration Using UDFs |
2009 |
SIGMOD |
4.1945683e-05 |
| 1,873 |
An Architecture for Compiling UDF-centric Workflows |
2015 |
VLDB |
0.00010253002 |
| 6,375 |
Dear User-Defined Functions, Inlining isn't working out so great for us. Let's try batching to make our relationship work. Sincerely, SQL |
2024 |
CIDR |
5.0923872e-05 |
| 4,924 |
User-Defined Operators: Efficiently Integrating Custom Algorithms into Modern Databases |
2022 |
VLDB |
5.822682e-05 |
| 4,014 |
Exploiting Correlations for Expensive Predicate Evaluation |
2015 |
SIGMOD |
6.5273084e-05 |
| 10,459 |
UDFBench: A Tool for Benchmarking UDF Queries on SQL Engines |
2025 |
SIGMOD |
4.1945683e-05 |
| 6,189 |
Accelerating Python UDFs in Vectorized Query Execution |
2022 |
CIDR |
5.1647573e-05 |
| 6,645 |
Functional-Style SQL UDFs With a Capital 'F' |
2020 |
SIGMOD |
4.978205e-05 |
| 9,763 |
The UDFBench Benchmark for General-purpose UDF Queries |
2025 |
VLDB |
4.2856106e-05 |