Back to papers
YeSQL: “You extend SQL” with Rich and Highly Performant User-Defined Functions in Relational Databases
Summary: YeSQL extends SQL with rich Python UDFs and a pluggable architecture for scalar, aggregate, and table UDFs. JIT tracing of Python UDFs, parallelism, and UDF fusion, with stateful execution, minimize context switches and yield up to 68x speedups.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 12721
- Venue
- VLDB
- Year
- 2022
- Pagerank
- 4.9561066e-05
- Overall Rank
- 6,701 | 53.39%
- DOI
-
10.14778/3547305.3547328
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 11 of 11 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 6,375 |
Dear User-Defined Functions, Inlining isn't working out so great for us. Let's try batching to make our relationship work. Sincerely, SQL |
2024 |
CIDR |
5.0923872e-05 |
| 6,378 |
Mitigating the Impedance Mismatch between Prediction Query Execution and Database Engine |
2025 |
SIGMOD |
5.0909804e-05 |
| 6,863 |
Declarative Sub-Operators for Universal Data Processing |
2023 |
VLDB |
4.905092e-05 |
| 8,583 |
Efficient Execution of User-Defined Functions in SQL Queries |
2023 |
VLDB |
4.4919445e-05 |
| 8,645 |
Predicate Pushdown for Data Science Pipelines |
2023 |
SIGMOD |
4.4772518e-05 |
| 9,343 |
The Key to Effective UDF Optimization: Before Inlining, First Perform Outlining |
2025 |
VLDB |
4.3546206e-05 |
| 9,718 |
YeSQL: Rich User-Defined Functions without the Overhead |
2022 |
VLDB |
4.2980763e-05 |
| 9,763 |
The UDFBench Benchmark for General-purpose UDF Queries |
2025 |
VLDB |
4.2856106e-05 |
| 9,884 |
SQL Engines Excel at the Execution of Imperative Programs |
2024 |
VLDB |
4.2635782e-05 |
| 10,459 |
UDFBench: A Tool for Benchmarking UDF Queries on SQL Engines |
2025 |
SIGMOD |
4.1945683e-05 |
| 10,969 |
Query Compilation Without Regrets |
2024 |
SIGMOD |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 15 of 15 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 853 |
Everything You Always Wanted to Know About Compiled and Vectorized Queries But Were Afraid to Ask |
2018 |
VLDB |
0.00015940507 |
| 1,108 |
Froid: Optimization of Imperative Programs in a Relational Database |
2018 |
VLDB |
0.00013984276 |
| 1,750 |
Weld: A Common Runtime for High Performance Data Analytics |
2017 |
CIDR |
0.00010683647 |
| 1,882 |
Tuplex: Data Science in Python at Native Code Speed |
2021 |
SIGMOD |
0.0001021625 |
| 2,237 |
Procedural Extensions of SQL: Understanding their usage in the wild |
2021 |
VLDB |
9.2212748e-05 |
| 2,934 |
AIDA - Abstraction for Advanced In-Database Analytics |
2018 |
VLDB |
7.8595778e-05 |
| 2,954 |
Magpie: Python at Speed and Scale using Cloud Backends |
2021 |
CIDR |
7.8262582e-05 |
| 3,080 |
Compiling PL/SQL Away |
2020 |
CIDR |
7.603389e-05 |
| 4,648 |
Aggify: Lifting the Curse of Cursor Loops using Custom Aggregates |
2020 |
SIGMOD |
6.0247446e-05 |
| 4,813 |
Putting Pandas in a Box |
2021 |
CIDR |
5.9049746e-05 |
| 6,189 |
Accelerating Python UDFs in Vectorized Query Execution |
2022 |
CIDR |
5.1647573e-05 |
| 6,645 |
Functional-Style SQL UDFs With a Capital 'F' |
2020 |
SIGMOD |
4.978205e-05 |
| 6,990 |
Machine Learning, Linear Algebra, and More: Is SQL All You Need? |
2022 |
CIDR |
4.8704904e-05 |
| 8,692 |
Boosting Efficiency of External Pipelines by Blurring Application Boundaries |
2022 |
CIDR |
4.4661967e-05 |
| 9,297 |
Introducing a Query Acceleration Path for Analytics in SQLite3 |
2022 |
CIDR |
4.3594338e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 9,288 |
Using SQL to Build New Aggregates and Extenders for Object-Relational Systems |
2000 |
VLDB |
4.3622826e-05 |
| 2,237 |
Procedural Extensions of SQL: Understanding their usage in the wild |
2021 |
VLDB |
9.2212748e-05 |
| 7,760 |
G-SQL: Fast Query Processing via Graph Exploration |
2016 |
VLDB |
4.6589413e-05 |
| 1,355 |
SQL/MapReduce: A practical approach to self-describing, polymorphic, and parallelizable user-defined functions |
2009 |
VLDB |
0.00012404572 |
| 4,569 |
User-Defined Table Operators: Enhancing Extensibility for ORDBMS |
1999 |
VLDB |
6.0782978e-05 |
| 9,763 |
The UDFBench Benchmark for General-purpose UDF Queries |
2025 |
VLDB |
4.2856106e-05 |
| 10,459 |
UDFBench: A Tool for Benchmarking UDF Queries on SQL Engines |
2025 |
SIGMOD |
4.1945683e-05 |
| 6,189 |
Accelerating Python UDFs in Vectorized Query Execution |
2022 |
CIDR |
5.1647573e-05 |
| 8,583 |
Efficient Execution of User-Defined Functions in SQL Queries |
2023 |
VLDB |
4.4919445e-05 |
| 9,718 |
YeSQL: Rich User-Defined Functions without the Overhead |
2022 |
VLDB |
4.2980763e-05 |