Back to papers
SHoTClean: Bridging Soft and Hard Constraints for Multivariate Time Series Cleaning
Summary: SHoTClean reframes multivariate time-series cleaning as constrained optimization: hard physical bounds + soft statistical consistency, targeting subtle dirty points missed by prior univariate-centric methods. Offers offline/streaming algorithms (pruned/incremental DP, near-linear acceleration) and a causal variant to model inter-variable dependencies.
(summarized by gpt-5-mini on Apr 11 2026)
- Paper ID
- 7523
- Venue
- SIGMOD
- Year
- 2026
- Pagerank
- 4.1945683e-05
- Overall Rank
- 10,211 | 28.97%
- DOI
-
10.1145/3786698
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
Outgoing Citations (Sorted by Pagerank)
Showing 16 of 16 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 161 |
LOF: Identifying Density-Based Local Outliers |
2000 |
SIGMOD |
0.00039846974 |
| 192 |
HoloClean: Holistic Data Repairs with Probabilistic Inference |
2017 |
VLDB |
0.00035728858 |
| 265 |
A Cost-Based Model and Effective Heuristic for Repairing Constraints by Value Modification |
2005 |
SIGMOD |
0.00029763412 |
| 2,159 |
Sequential Dependencies |
2009 |
VLDB |
9.4130956e-05 |
| 2,290 |
TranAD: Deep Transformer Networks for Anomaly Detection in Multivariate Time Series Data |
2022 |
VLDB |
9.0934125e-05 |
| 2,298 |
TFB: Towards Comprehensive and Fair Benchmarking of Time Series Forecasting Methods |
2024 |
VLDB |
9.0742746e-05 |
| 3,825 |
Cleanits: A Data Cleaning System for Industrial Time Series |
2019 |
VLDB |
6.7255837e-05 |
| 5,002 |
Sequential Data Cleaning: A Statistical Approach |
2016 |
SIGMOD |
5.7671075e-05 |
| 5,152 |
Learning-Based Cleansing for Indoor RFID Data |
2016 |
SIGMOD |
5.6609383e-05 |
| 5,777 |
ImDiffusion: Imputed Diffusion Models for Multivariate Time Series Anomaly Detection |
2024 |
VLDB |
5.3308813e-05 |
| 6,451 |
Multivariate Time Series Cleaning under Speed Constraints |
2024 |
SIGMOD |
5.0583324e-05 |
| 6,583 |
SCREEN: Stream Data Cleaning under Speed Constraints |
2015 |
SIGMOD |
5.0027988e-05 |
| 7,223 |
Akane: Perplexity-Guided Time Series Data Cleaning |
2024 |
SIGMOD |
4.7965857e-05 |
| 7,564 |
PIClean: A Probabilistic and Interactive Data Cleaning System |
2019 |
SIGMOD |
4.7093702e-05 |
| 9,558 |
Clean4TSDB: A Data Cleaning Tool for Time Series Databases |
2024 |
VLDB |
4.3254416e-05 |
| 9,560 |
MTSClean: Efficient Constraint-based Cleaning for Multi-Dimensional Time Series Data |
2024 |
VLDB |
4.3254416e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 1,627 |
Data Cleaning: Overview and Emerging Challenges |
2016 |
SIGMOD |
0.00011086905 |
| 7,223 |
Akane: Perplexity-Guided Time Series Data Cleaning |
2024 |
SIGMOD |
4.7965857e-05 |
| 3,825 |
Cleanits: A Data Cleaning System for Industrial Time Series |
2019 |
VLDB |
6.7255837e-05 |
| 6,583 |
SCREEN: Stream Data Cleaning under Speed Constraints |
2015 |
SIGMOD |
5.0027988e-05 |
| 7,449 |
OTClean: Data Cleaning for Conditional Independence Violations using Optimal Transport |
2024 |
SIGMOD |
4.7269357e-05 |
| 10,511 |
The Best of Both Worlds: On Repairing Timestamps and Attribute Values for Multivariate Time Series |
2025 |
SIGMOD |
4.1945683e-05 |
| 9,558 |
Clean4TSDB: A Data Cleaning Tool for Time Series Databases |
2024 |
VLDB |
4.3254416e-05 |
| 10,061 |
Cleaning Time Series under Seasonal and Trend Constraints |
2026 |
SIGMOD |
4.1945683e-05 |
| 6,451 |
Multivariate Time Series Cleaning under Speed Constraints |
2024 |
SIGMOD |
5.0583324e-05 |
| 9,560 |
MTSClean: Efficient Constraint-based Cleaning for Multi-Dimensional Time Series Data |
2024 |
VLDB |
4.3254416e-05 |