In-Database Data Imputation
Summary: In-database data imputation with MICE, using computation sharing and a ring abstraction to speed training. In-db learning of stochastic linear regression and Gaussian discriminant analysis for continuous/categorical imputation; PostgreSQL and DuckDB beat prior MICE and model-based methods by up to two orders of magnitude, preserving relationships. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Massimo Perini
- 2. Milos Nikolic
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,617 | Deduplicated Sampling On-Demand | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 35 of 35 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 11,050 | Win-Win: On Simultaneous Clustering and Imputing over Incomplete Data | 2024 | VLDB | 4.1945683e-05 |
| 9,240 | ZIP: Lazy Imputation during Query Processing | 2024 | VLDB | 4.3690661e-05 |
| 7,400 | Missing Value Imputation for Multi-attribute Sensor Data Streams via Message Propagation | 2024 | VLDB | 4.7397846e-05 |
| 5,253 | Enriching Data Imputation with Extensive Similarity Neighbors | 2015 | VLDB | 5.6014916e-05 |
| 8,138 | Fast and Reliable Missing Data Contingency Analysis with Predicate-Constraints | 2020 | SIGMOD | 4.5771031e-05 |
| 6,600 | Missing Data Imputation with Uncertainty-Driven Network | 2024 | SIGMOD | 4.9972581e-05 |
| 9,479 | Data Imputation with Limited Data Redundancy Using Data Lakes | 2025 | VLDB | 4.3341665e-05 |
| 3,311 | Efficient and Effective Data Imputation with Influence Functions | 2022 | VLDB | 7.2406486e-05 |
| 10,953 | Certain and Approximately Certain Models for Statistical Learning | 2024 | SIGMOD | 4.1945683e-05 |
| 2,573 | Query Optimization for Dynamic Imputation | 2017 | VLDB | 8.518235e-05 |