InferDB: In-Database Machine Learning Inference Using Indexes
Summary: Uses a discretizing embedding of selected features and an index mapping embedding cells to aggregated model outputs to approximate end-to-end ML inference inside the DB. Replaces preprocessing and model execution with a transform+lookup, cutting latency by orders of magnitude while retaining similar accuracy. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Ricardo Salazar-Díaz
- 2. Boris Glavic
- 3. Tilmann Rabl
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,688 | NeurDB: On the Design and Implementation of an AI-powered Autonomous Database | 2025 | CIDR | 4.4673127e-05 |
| 9,983 | Does A Fish Need a Bicycle? The Case for On-Chip NPUs in DBMS | 2026 | CIDR | 4.1945683e-05 |
| 10,095 | NeurStore: Efficient In-database Deep Learning Model Management System | 2026 | SIGMOD | 4.1945683e-05 |
| 10,130 | MorphingDB: A Task-Centric AI-Native DBMS for Model Management and Inference | 2026 | SIGMOD | 4.1945683e-05 |
| 10,143 | Beluga: A CXL-Based Memory Architecture for Scalable and Efficient LLM KVCache Management | 2026 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 8 of 8 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 557 | SystemML: Declarative Machine Learning on Spark | 2016 | VLDB | 0.00020197988 |
| 658 | Towards a Unified Architecture for in-RDBMS Analytics | 2012 | SIGMOD | 0.00018506577 |
| 904 | Integrating Association Rule Mining with Relational Database Systems: Alternatives and Implications | 1998 | SIGMOD | 0.00015469655 |
| 2,350 | An Intermediate Representation for Optimizing Machine Learning Pipelines | 2019 | VLDB | 8.9788641e-05 |
| 3,407 | End-to-end Optimization of Machine Learning Prediction Queries | 2022 | SIGMOD | 7.1295646e-05 |
| 4,557 | Distributed Deep Learning on Data Systems: A Comparative Analysis of Approaches | 2021 | VLDB | 6.087611e-05 |
| 5,224 | Neighbor-Sensitive Hashing | 2016 | VLDB | 5.6197981e-05 |
| 6,327 | The Tensor Data Platform: Towards an AI-centric Database System | 2023 | CIDR | 5.1083405e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,776 | Structure-Aware Machine Learning over Multi-Relational Databases | 2021 | SIGMOD | 4.2856106e-05 |
| 5,861 | Machine Learning for Databases | 2021 | VLDB | 5.298883e-05 |
| 13,138 | Database Perspective on LLM Inference Systems | 2025 | VLDB | - |
| 2,804 | Extending Relational Query Processing with ML Inference | 2020 | CIDR | 8.0935487e-05 |
| 9,351 | On Efficient Approximate Queries over Machine Learning Models | 2023 | VLDB | 4.3524472e-05 |
| 5,074 | Learned Index: A Comprehensive Experimental Evaluation | 2023 | VLDB | 5.7175726e-05 |
| 329 | Accelerating Machine Learning Inference with Probabilistic Predicates | 2018 | SIGMOD | 0.00027249545 |
| 4,387 | Hybrid In-Database Inference for Declarative Information Extraction | 2011 | SIGMOD | 6.2320072e-05 |
| 6,378 | Mitigating the Impedance Mismatch between Prediction Query Execution and Database Engine | 2025 | SIGMOD | 5.0909804e-05 |
| 5,337 | Learned Index Benefits: Machine Learning Based Index Performance Estimation | 2022 | VLDB | 5.5635208e-05 |