Database Paper Browser

Back to papers

Query Optimization for Dynamic Imputation

Summary: ImputeDB fuses imputation with a cost-based optimizer to do on-the-fly cleaning per query. Choosing where to apply imputations yields 10–140x speedups over pre-imputation, with 0–8% result change and 0–21% data loss versus dropping missing values. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
11416
Venue
VLDB
Year
2017
Pagerank
8.518235e-05
Overall Rank
2,573 | 82.11%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 18 of 18 citing papers.

Rank Citing Paper Year Venue Pagerank
2,122 SystemDS: A Declarative Machine Learning System for the End-to-End Data Science Lifecycle 2020 CIDR 9.4989076e-05
2,276 Mind the Gap: An Experimental Evaluation of Imputation of Missing Values Techniques in Time Series 2020 VLDB 9.1261944e-05
4,273 Cleaning Denial Constraint Violations through Relaxation 2020 SIGMOD 6.3003864e-05
4,332 Missing Value Imputation on Multidimensional Time Series 2021 VLDB 6.2805243e-05
6,727 ORBITS: Online Recovery of Missing Values in Multiple Time Series Streams 2021 VLDB 4.9483604e-05
6,912 CYADB: A Database that Covers Your Ask 2018 VLDB 4.8925595e-05
7,306 DAPHNE: An Open and Extensible System Infrastructure for Integrated Data Analysis Pipelines 2022 CIDR 4.7678574e-05
7,328 BOSS - An Architecture for Database Kernel Composition 2024 VLDB 4.7610909e-05
7,400 Missing Value Imputation for Multi-attribute Sensor Data Streams via Message Propagation 2024 VLDB 4.7397846e-05
7,704 ExDRa: Exploratory Data Science on Federated Raw Data 2021 SIGMOD 4.6733838e-05
8,092 Saga: A Scalable Framework for Optimizing Data Cleaning Pipelines for Machine Learning Applications 2023 SIGMOD 4.587921e-05
9,049 JENNER: Just-in-time Enrichment in Query Processing 2022 VLDB 4.4039656e-05
9,240 ZIP: Lazy Imputation during Query Processing 2024 VLDB 4.3690661e-05
9,242 ImputeVIS: An Interactive Evaluator to Benchmark Imputation Techniques for Time Series Data 2024 VLDB 4.3690661e-05
9,856 In-Database Data Imputation 2024 SIGMOD 4.269353e-05
10,617 Deduplicated Sampling On-Demand 2025 VLDB 4.1945683e-05
10,744 DIM-SUM: Dynamic IMputation for Smart Utility Management 2025 VLDB 4.1945683e-05
11,069 Hardware-Efficient Data Imputation through DBMS Extensibility 2024 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
2,659 Multi-Objective Parametric Query Optimization 2015 VLDB 8.3604734e-05
3,445 Processing Forecasting Queries 2007 VLDB 7.08644e-05
5,549 Query Processing over Incomplete Autonomous Databases 2007 VLDB 5.4428494e-05
5,779 Lenses: An On-Demand Approach to ETL 2015 VLDB 5.3307398e-05
6,487 Inter-Operator Feedback in Data Stream Management Systems via Punctuation 2009 CIDR 5.0435729e-05
Previous Page 1 / 1 Next

Semantically Similar Papers