Database Paper Browser

Back to papers

InferF: Declarative Factorization of AI/ML Inferences over Joins

Summary: Declarative factorized inference over multi-way joins: push partial ML subcomputations to join-tree nodes to cut redundant inference and join cost. InferF formalizes plan selection for arbitrary analyzable inference expressions and uses greedy/genetic search; up to 11.3x speedup on Velox. (summarized by gpt-5-mini on Apr 11 2026)

Paper ID
7488
Venue
SIGMOD
Year
2026
Pagerank
4.1945683e-05
Overall Rank
10,177 | 29.21%
DOI
10.1145/3786662

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank Citing Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 37 of 37 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
71 How Good Are Query Optimizers, Really? 2016 VLDB 0.00059038975
544 Apache Calcite: A Foundational Framework for Optimized Query Processing Over Heterogeneous Data Sources 2018 SIGMOD 0.00020521965
557 SystemML: Declarative Machine Learning on Spark 2016 VLDB 0.00020197988
659 The Making of TPC-DS 2006 VLDB 0.00018500853
834 Learning Linear Regression Models over Factorized Joins 2016 SIGMOD 0.00016135159
903 To Join or Not to Join? Thinking Twice about Joins before Feature Selection 2016 SIGMOD 0.0001547016
1,167 Learning Generalized Linear Models Over Normalized Data 2015 SIGMOD 0.00013547713
1,279 Towards Linear Algebra over Normalized Data 2017 VLDB 0.00012868394
1,284 Amazon Redshift Re-invented 2022 SIGMOD 0.00012837822
2,122 SystemDS: A Declarative Machine Learning System for the End-to-End Data Science Lifecycle 2020 CIDR 9.4989076e-05
2,194 Enabling and Optimizing Non-linear Feature Interactions in Factorized Linear Algebra 2019 SIGMOD 9.3138337e-05
2,350 An Intermediate Representation for Optimizing Machine Learning Pipelines 2019 VLDB 8.9788641e-05
2,528 Velox: Meta’s Unified Execution Engine 2022 VLDB 8.59454e-05
2,896 Evaluating End-to-End Optimization for Data Analytics Applications in Weld 2018 VLDB 7.9452051e-05
2,915 Brainwash: A Data System for Feature Engineering 2013 CIDR 7.9078385e-05
3,006 On Functional Aggregate Queries with Additive Inequalities 2019 PODS 7.7299363e-05
3,277 A Layered Aggregate Engine for Analytics Workloads 2019 SIGMOD 7.2871625e-05
3,345 QuickFOIL: Scalable Inductive Logic Programming 2015 VLDB 7.1958815e-05
3,407 End-to-end Optimization of Machine Learning Prediction Queries 2022 SIGMOD 7.1295646e-05
3,948 A Comparative Evaluation of Systems for Scalable Linear Algebra-based Analytics 2018 VLDB 6.5959084e-05
4,159 F: Regression Models over Factorized Views 2016 VLDB 6.3993326e-05
4,276 Looking Ahead Makes Query Plans Robust: Making the Initial Case with In-Memory Star Schema Data Warehouse Workloads 2017 VLDB 6.2976602e-05
4,948 Designing an Open Framework for Query Optimization and Compilation 2022 VLDB 5.8116879e-05
5,821 Tensor Relational Algebra for Distributed Machine Learning System Design 2021 VLDB 5.3134851e-05
6,378 Mitigating the Impedance Mismatch between Prediction Query Execution and Database Engine 2025 SIGMOD 5.0909804e-05
6,380 SmartLite: A DBMS-based Serving System for DNN Inference in Resource-constrained Environments 2024 VLDB 5.0893219e-05
6,541 ConnectorX: Accelerating Data Loading From Databases to Dataframes 2022 VLDB 5.0216945e-05
6,863 Declarative Sub-Operators for Universal Data Processing 2023 VLDB 4.905092e-05
7,061 Serving Deep Learning Models with Deduplication from Relational Databases 2022 VLDB 4.8463881e-05
7,476 Lachesis: Automatic Partitioning for UDF-Centric Analytics 2021 VLDB 4.7188928e-05
7,920 JoinBoost: Grow Trees Over Normalized Data Using Only SQL 2023 VLDB 4.6163888e-05
8,002 Pangea: Monolithic Distributed Storage for Data Analytics 2019 VLDB 4.6088289e-05
8,448 PARQO: Penalty-Aware Robust Plan Selection in Query Optimization 2024 VLDB 4.5100508e-05
9,332 PlinyCompute: A Platform for High-Performance, Distributed, Data-Intensive Tool Development 2018 SIGMOD 4.3556432e-05
9,364 FEBench: A Benchmark for Real-Time Relational Data Feature Extraction 2023 VLDB 4.3502487e-05
9,886 Scalable and Usable Relational Learning With Automatic Language Bias 2021 SIGMOD 4.2621158e-05
10,499 Privacy and Accuracy-Aware AI/ML Model Deduplication 2025 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Semantically Similar Papers