Back to papers
DIM-SUM: Dynamic IMputation for Smart Utility Management
Summary: DIM-SUM is a preprocessing framework that learns realistic missing-data distributions via pattern clustering and adaptive masking to train robust imputation models with provable learning guarantees. On >2B infrastructure readings it outperforms large pre-trained models (~2x accuracy) while using less training data and cutting processing and inference time.
(summarized by gpt-5-mini on Feb 09 2026)
- Paper ID
- 14058
- Venue
- VLDB
- Year
- 2025
- Pagerank
- 4.1945683e-05
- Overall Rank
- 10,744 | 25.26%
- DOI
-
10.14778/3749646.3749705
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
Outgoing Citations (Sorted by Pagerank)
Showing 13 of 13 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 192 |
HoloClean: Holistic Data Repairs with Probabilistic Inference |
2017 |
VLDB |
0.00035728858 |
| 2,276 |
Mind the Gap: An Experimental Evaluation of Imputation of Missing Values Techniques in Time Series |
2020 |
VLDB |
9.1261944e-05 |
| 2,573 |
Query Optimization for Dynamic Imputation |
2017 |
VLDB |
8.518235e-05 |
| 2,644 |
Series2Graph: Graph-based Subsequence Anomaly Detection for Time Series |
2020 |
VLDB |
8.3832357e-05 |
| 3,311 |
Efficient and Effective Data Imputation with Influence Functions |
2022 |
VLDB |
7.2406486e-05 |
| 4,332 |
Missing Value Imputation on Multidimensional Time Series |
2021 |
VLDB |
6.2805243e-05 |
| 5,028 |
Adaptive Data Augmentation for Supervised Learning over Missing Data |
2021 |
VLDB |
5.7506746e-05 |
| 5,629 |
DAMR: Dynamic Adjacency Matrix Representation Learning for Multivariate Time Series Imputation |
2023 |
SIGMOD |
5.4025905e-05 |
| 5,777 |
ImDiffusion: Imputed Diffusion Models for Multivariate Time Series Anomaly Detection |
2024 |
VLDB |
5.3308813e-05 |
| 6,600 |
Missing Data Imputation with Uncertainty-Driven Network |
2024 |
SIGMOD |
4.9972581e-05 |
| 6,727 |
ORBITS: Online Recovery of Missing Values in Multiple Time Series Streams |
2021 |
VLDB |
4.9483604e-05 |
| 9,240 |
ZIP: Lazy Imputation during Query Processing |
2024 |
VLDB |
4.3690661e-05 |
| 9,242 |
ImputeVIS: An Interactive Evaluator to Benchmark Imputation Techniques for Time Series Data |
2024 |
VLDB |
4.3690661e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 11,050 |
Win-Win: On Simultaneous Clustering and Imputing over Incomplete Data |
2024 |
VLDB |
4.1945683e-05 |
| 10,953 |
Certain and Approximately Certain Models for Statistical Learning |
2024 |
SIGMOD |
4.1945683e-05 |
| 2,573 |
Query Optimization for Dynamic Imputation |
2017 |
VLDB |
8.518235e-05 |
| 2,276 |
Mind the Gap: An Experimental Evaluation of Imputation of Missing Values Techniques in Time Series |
2020 |
VLDB |
9.1261944e-05 |
| 13,181 |
Spatio-Temporal Denoising Graph Autoencoders with Data Augmentation for Photovoltaic Data Imputation |
2023 |
SIGMOD |
- |
| 7,400 |
Missing Value Imputation for Multi-attribute Sensor Data Streams via Message Propagation |
2024 |
VLDB |
4.7397846e-05 |
| 3,311 |
Efficient and Effective Data Imputation with Influence Functions |
2022 |
VLDB |
7.2406486e-05 |
| 4,332 |
Missing Value Imputation on Multidimensional Time Series |
2021 |
VLDB |
6.2805243e-05 |
| 5,629 |
DAMR: Dynamic Adjacency Matrix Representation Learning for Multivariate Time Series Imputation |
2023 |
SIGMOD |
5.4025905e-05 |
| 9,242 |
ImputeVIS: An Interactive Evaluator to Benchmark Imputation Techniques for Time Series Data |
2024 |
VLDB |
4.3690661e-05 |