Database Paper Browser

Back to papers

Hardware-Efficient Data Imputation through DBMS Extensibility

Summary: HCP stores and evaluates code-as-data—"homoiconic expressions"—inside the DBMS to enable in-kernel imputation, augmentation, and other data-science tasks. BOSS implements Shape‑Wise Microbatching to make HCP hardware-efficient, matching tuned DBMSs and yielding 2–5 orders of magnitude speedups over prior homoiconic runtimes and imputation systems. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
13559
Venue
VLDB
Year
2024
Pagerank
4.1945683e-05
Overall Rank
11,069 | 23.00%
DOI
10.14778/3681954.3682016

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 1 of 1 citing papers.

Rank Citing Paper Year Venue Pagerank
10,332 Cephalopod - Virtual Data Model Composition through Partial Query Translation 2025 CIDR 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 22 of 22 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
44 The Design Of Postgres 1986 SIGMOD 0.00071838587
60 Efficiently Compiling Efficient Query Plans for Modern Hardware 2011 VLDB 0.00064439773
123 A Decomposition Storage Model 1985 SIGMOD 0.00045255007
192 HoloClean: Holistic Data Repairs with Probabilistic Inference 2017 VLDB 0.00035728858
299 Trio: A System for Data, Uncertainty, and Lineage 2006 VLDB 0.00028525071
623 Improving Data Quality: Consistency and Accuracy 2007 VLDB 0.00018996374
853 Everything You Always Wanted to Know About Compiled and Vectorized Queries But Were Afraid to Ask 2018 VLDB 0.00015940507
1,108 Froid: Optimization of Imperative Programs in a Relational Database 2018 VLDB 0.00013984276
1,277 The Data Civilizer System 2017 CIDR 0.00012879695
1,546 KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing 2015 SIGMOD 0.00011446851
2,014 Voodoo - A Vector Algebra for Portable Database Performance on Modern Hardware 2016 VLDB 9.7904029e-05
2,276 Mind the Gap: An Experimental Evaluation of Imputation of Missing Values Techniques in Time Series 2020 VLDB 9.1261944e-05
2,443 Data Management for Data Science: Towards Embedded Analytics 2020 CIDR 8.8078476e-05
2,573 Query Optimization for Dynamic Imputation 2017 VLDB 8.518235e-05
3,353 Managing Expressions as Data in Relational Database Systems 2003 CIDR 7.1843143e-05
3,576 Closed World Databases Opened Through Null Values 1988 VLDB 6.9506798e-05
4,582 BlackMagic: Automatic Inlining of Scalar UDFs into SQL Queries with Froid 2019 VLDB 6.070187e-05
4,924 User-Defined Operators: Efficiently Integrating Custom Algorithms into Modern Databases 2022 VLDB 5.822682e-05
5,153 Horizon: Scalable Dependency-driven Data Cleaning 2021 VLDB 5.6607963e-05
5,779 Lenses: An On-Demand Approach to ETL 2015 VLDB 5.3307398e-05
6,212 Snakes on a Plan: Compiling Python Functions into Plain SQL Queries 2022 SIGMOD 5.1552576e-05
9,240 ZIP: Lazy Imputation during Query Processing 2024 VLDB 4.3690661e-05
Previous Page 1 / 1 Next

Semantically Similar Papers