Database Paper Browser

Back to papers

Mitigating the Impedance Mismatch between Prediction Query Execution and Database Engine

Summary: Prediction-aware operator with inference-context reuse and batched invocation to reduce ML UDF impedance in DB engines. IMBridge on OceanBase enables one-off inference context setup and batched invocation, gives 71.4x speedup for prediction queries. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
7245
Venue
SIGMOD
Year
2025
Pagerank
5.0909804e-05
Overall Rank
6,378 | 55.64%
DOI
10.1145/3725326

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 3 of 3 citing papers.

Rank Citing Paper Year Venue Pagerank
10,095 NeurStore: Efficient In-database Deep Learning Model Management System 2026 SIGMOD 4.1945683e-05
10,177 InferF: Declarative Factorization of AI/ML Inferences over Joins 2026 SIGMOD 4.1945683e-05
10,243 TPCx-AI under the Microscope: A Benchmarking Debt Analysis 2026 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 43 of 43 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
60 Efficiently Compiling Efficient Query Plans for Modern Hardware 2011 VLDB 0.00064439773
66 Spark SQL: Relational Data Processing in Spark 2015 SIGMOD 0.00061639801
140 The MADlib Analytics Library or MAD Skills, the SQL 2012 VLDB 0.00042270404
185 DuckDB: an Embeddable Analytical Database 2019 SIGMOD 0.00036538405
418 Morsel-Driven Parallelism: A NUMA-Aware Query Evaluation Framework for the Many-Core Age 2014 SIGMOD 0.00023729211
853 Everything You Always Wanted to Know About Compiled and Vectorized Queries But Were Afraid to Ask 2018 VLDB 0.00015940507
903 To Join or Not to Join? Thinking Twice about Joins before Feature Selection 2016 SIGMOD 0.0001547016
1,108 Froid: Optimization of Imperative Programs in a Relational Database 2018 VLDB 0.00013984276
1,750 Weld: A Common Runtime for High Performance Data Analytics 2017 CIDR 0.00010683647
1,864 Relaxed Operator Fusion for In-Memory Databases: Making Compilation, Vectorization, and Prefetching Work Together At Last 2018 VLDB 0.00010280966
1,882 Tuplex: Data Science in Python at Native Code Speed 2021 SIGMOD 0.0001021625
2,528 Velox: Meta’s Unified Execution Engine 2022 VLDB 8.59454e-05
2,642 Vertica-ML: Distributed Machine Learning in Vertica Database 2020 SIGMOD 8.3851878e-05
2,691 Greenplum: A Hybrid Database for Transactional and Analytical Workloads 2021 SIGMOD 8.2909126e-05
2,804 Extending Relational Query Processing with ML Inference 2020 CIDR 8.0935487e-05
2,838 How to Architect a Query Compiler, Revisited 2018 SIGMOD 8.0408472e-05
3,254 Query Processing on Tensor Computation Runtimes 2022 VLDB 7.3161051e-05
3,407 End-to-end Optimization of Machine Learning Prediction Queries 2022 SIGMOD 7.1295646e-05
3,606 EVA: A Symbolic Approach to Accelerating Exploratory Video Analytics with Materialized Views 2022 SIGMOD 6.9260354e-05
3,628 OceanBase: A 707 Million tpmC Distributed Relational Database System 2022 VLDB 6.9031596e-05
3,875 Cloudy with High Chance of DBMS: A 10-year Prediction for Enterprise-Grade ML 2020 CIDR 6.675257e-05
3,958 MLog: Towards Declarative In-Database Machine Learning 2017 VLDB 6.5897636e-05
4,495 ClickHouse - Lightning Fast Analytics for Everyone 2024 VLDB 6.1410277e-05
4,548 Efficient and Portable Einstein Summation in SQL 2023 SIGMOD 6.0953447e-05
4,701 Tensors: An abstraction for general data processing 2021 VLDB 5.9866564e-05
4,924 User-Defined Operators: Efficiently Integrating Custom Algorithms into Modern Databases 2022 VLDB 5.822682e-05
4,948 Designing an Open Framework for Query Optimization and Compilation 2022 VLDB 5.8116879e-05
5,605 TPCx-AI - An Industry Standard Benchmark for Artificial Intelligence and Machine Learning Systems 2023 VLDB 5.4142007e-05
5,731 Babelfish: Efficient Execution of Polyglot Queries 2022 VLDB 5.3502065e-05
6,170 PolarDB-IMCI: A Cloud-Native HTAP Database System at Alibaba 2023 SIGMOD 5.171601e-05
6,189 Accelerating Python UDFs in Vectorized Query Execution 2022 CIDR 5.1647573e-05
6,192 SQLite: Past, Present, and Future 2022 VLDB 5.1641743e-05
6,212 Snakes on a Plan: Compiling Python Functions into Plain SQL Queries 2022 SIGMOD 5.1552576e-05
6,380 SmartLite: A DBMS-based Serving System for DNN Inference in Resource-constrained Environments 2024 VLDB 5.0893219e-05
6,701 YeSQL: “You extend SQL” with Rich and Highly Performant User-Defined Functions in Relational Databases 2022 VLDB 4.9561066e-05
7,920 JoinBoost: Grow Trees Over Normalized Data Using Only SQL 2023 VLDB 4.6163888e-05
8,257 Automating and Optimizing Data-Centric What-If Analyses on Native Machine Learning Pipelines 2023 SIGMOD 4.5487511e-05
8,583 Efficient Execution of User-Defined Functions in SQL Queries 2023 VLDB 4.4919445e-05
9,354 Interactive Demonstration of EVA 2023 VLDB 4.3517085e-05
9,911 Dias: Dynamic Rewriting of Pandas Code 2024 SIGMOD 4.2565279e-05
9,912 ElasticNotebook: Enabling Live Migration for Computational Notebooks 2024 VLDB 4.2565279e-05
9,913 Chukonu: A Fully-Featured High-Performance Big Data Framework that Integrates a Native Compute Engine into Spark 2022 VLDB 4.2565279e-05
9,958 OceanBase Paetica: A Hybrid Shared-nothing/Shared-everything Database for Supporting Single Machine and Distributed Cluster 2023 VLDB 4.2364477e-05
Previous Page 1 / 1 Next

Semantically Similar Papers