Udon: Efficient Debugging of User-Defined Functions in Big Data Systems with Line-by-Line Control
Summary: Udon enables line-by-line debugging of UDFs in big data systems, with breakpoints, inspections, and live edits on a single tuple. A debug-aware execution model, cross-UDF state transfer, and optimizations keep overhead low while scaling across diverse UDF workloads. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Yicong Huang
- 2. Zuozhi Wang
- 3. Chen Li
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,746 | Texera: A System for Collaborative and Interactive Data Analytics Using Workflows | 2024 | VLDB | 4.456315e-05 |
| 10,842 | ML-Asset Management: Curation, Discovery, and Utilization | 2025 | VLDB | 4.1945683e-05 |
| 10,883 | IcedTea: Efficient and Responsive Time-Travel Debugging in Dataflow Systems | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3 | Pig Latin: A Not-So-Foreign Language for Data Processing | 2008 | SIGMOD | 0.0024183614 |
| 1,882 | Tuplex: Data Science in Python at Native Code Speed | 2021 | SIGMOD | 0.0001021625 |
| 2,804 | Extending Relational Query Processing with ML Inference | 2020 | CIDR | 8.0935487e-05 |
| 5,072 | Optimizing Machine Learning Inference Queries with Correlative Proxy Models | 2022 | VLDB | 5.7185674e-05 |
| 8,038 | Amber: A Debuggable Dataflow System Based on the Actor Model | 2020 | VLDB | 4.600124e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,582 | BlackMagic: Automatic Inlining of Scalar UDFs into SQL Queries with Froid | 2019 | VLDB | 6.070187e-05 |
| 11,573 | Towards Scalable UDTFs in Noria | 2020 | SIGMOD | 4.1945683e-05 |
| 9,585 | One-pass Data Mining Algorithms in a DBMS with UDFs | 2011 | SIGMOD | 4.3218691e-05 |
| 9,763 | The UDFBench Benchmark for General-purpose UDF Queries | 2025 | VLDB | 4.2856106e-05 |
| 1,873 | An Architecture for Compiling UDF-centric Workflows | 2015 | VLDB | 0.00010253002 |
| 11,288 | To UDFs and Beyond: Demonstration of a Fully Decomposed Data Processor for General Data Wrangling Tasks | 2023 | VLDB | 4.1945683e-05 |
| 6,189 | Accelerating Python UDFs in Vectorized Query Execution | 2022 | CIDR | 5.1647573e-05 |
| 10,459 | UDFBench: A Tool for Benchmarking UDF Queries on SQL Engines | 2025 | SIGMOD | 4.1945683e-05 |
| 8,583 | Efficient Execution of User-Defined Functions in SQL Queries | 2023 | VLDB | 4.4919445e-05 |
| 4,924 | User-Defined Operators: Efficiently Integrating Custom Algorithms into Modern Databases | 2022 | VLDB | 5.822682e-05 |