Back to papers
MTSClean: Efficient Constraint-based Cleaning for Multi-Dimensional Time Series Data
Summary: Introduces MTSClean and MTSClean-soft, online constraint-based cleaners for multi-dimensional time series that jointly exploit row and column constraints to handle complex attribute interdependencies and persistent errors. Achieves huge speedups (MTSClean from O((NM)^3.5|Σ|)→O(NM^3.5|Σ|); MTSClean-soft O(NM^2)) via key-cell search and a novel repair-cost, empirically outperforming nine baselines on repair quality and runtime.
(summarized by gpt-5-mini on Feb 09 2026)
- Paper ID
- 13726
- Venue
- VLDB
- Year
- 2024
- Pagerank
- 4.3254416e-05
- Overall Rank
- 9,560 | 33.50%
- DOI
-
10.14778/3704965.3704987
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
Outgoing Citations (Sorted by Pagerank)
Showing 17 of 17 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 192 |
HoloClean: Holistic Data Repairs with Probabilistic Inference |
2017 |
VLDB |
0.00035728858 |
| 265 |
A Cost-Based Model and Effective Heuristic for Repairing Constraints by Value Modification |
2005 |
SIGMOD |
0.00029763412 |
| 833 |
Guided Data Repair |
2011 |
VLDB |
0.00016138432 |
| 1,337 |
HoloDetect: Few-Shot Learning for Error Detection |
2019 |
SIGMOD |
0.00012497164 |
| 1,921 |
Apache IoTDB: Time-series Database for Internet of Things |
2020 |
VLDB |
0.00010082827 |
| 3,133 |
Time Series Data Cleaning: From Anomaly Detection to Anomaly Repairing |
2017 |
VLDB |
7.4978041e-05 |
| 3,299 |
SCODED: Statistical Constraint Oriented Data Error Detection |
2020 |
SIGMOD |
7.2546659e-05 |
| 3,825 |
Cleanits: A Data Cleaning System for Industrial Time Series |
2019 |
VLDB |
6.7255837e-05 |
| 3,967 |
Apache IoTDB: A Time Series Database for IoT Applications |
2023 |
SIGMOD |
6.5796647e-05 |
| 5,002 |
Sequential Data Cleaning: A Statistical Approach |
2016 |
SIGMOD |
5.7671075e-05 |
| 5,152 |
Learning-Based Cleansing for Indoor RFID Data |
2016 |
SIGMOD |
5.6609383e-05 |
| 6,187 |
Semi-Supervised Data Cleaning with Raha and Baran |
2021 |
CIDR |
5.1656857e-05 |
| 6,440 |
An Experimental Evaluation of Anomaly Detection in Time Series |
2024 |
VLDB |
5.0603878e-05 |
| 6,583 |
SCREEN: Stream Data Cleaning under Speed Constraints |
2015 |
SIGMOD |
5.0027988e-05 |
| 7,202 |
Conformance Constraint Discovery: Measuring Trust in Data-Driven Systems |
2021 |
SIGMOD |
4.8023314e-05 |
| 7,391 |
Time Series Data Validity |
2023 |
SIGMOD |
4.7429293e-05 |
| 9,558 |
Clean4TSDB: A Data Cleaning Tool for Time Series Databases |
2024 |
VLDB |
4.3254416e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 7,449 |
OTClean: Data Cleaning for Conditional Independence Violations using Optimal Transport |
2024 |
SIGMOD |
4.7269357e-05 |
| 10,026 |
Minimum Change ≠ Best Cleaning: Parallel and Incremental Error Detection under Integrity Constraints |
2026 |
SIGMOD |
4.1945683e-05 |
| 11,881 |
Cleaning Timestamps with Temporal Constraints |
2016 |
VLDB |
4.1945683e-05 |
| 3,133 |
Time Series Data Cleaning: From Anomaly Detection to Anomaly Repairing |
2017 |
VLDB |
7.4978041e-05 |
| 3,825 |
Cleanits: A Data Cleaning System for Industrial Time Series |
2019 |
VLDB |
6.7255837e-05 |
| 10,511 |
The Best of Both Worlds: On Repairing Timestamps and Attribute Values for Multivariate Time Series |
2025 |
SIGMOD |
4.1945683e-05 |
| 9,558 |
Clean4TSDB: A Data Cleaning Tool for Time Series Databases |
2024 |
VLDB |
4.3254416e-05 |
| 10,061 |
Cleaning Time Series under Seasonal and Trend Constraints |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,211 |
SHoTClean: Bridging Soft and Hard Constraints for Multivariate Time Series Cleaning |
2026 |
SIGMOD |
4.1945683e-05 |
| 6,451 |
Multivariate Time Series Cleaning under Speed Constraints |
2024 |
SIGMOD |
5.0583324e-05 |