Back to papers
Sequential Data Cleaning: A Statistical Approach
Summary: Statistical approach to cleaning sequential data by modeling speed-change distributions rather than hard speed constraints. It models a likelihood-maximizing repair problem; NP-hard; heuristic algorithms; experiments show superiority to constraint-based cleaning.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 5260
- Venue
- SIGMOD
- Year
- 2016
- Pagerank
- 5.7671075e-05
- Overall Rank
- 5,002 | 65.21%
- DOI
-
10.1145/2882903.2915233
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 13 of 13 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 3,825 |
Cleanits: A Data Cleaning System for Industrial Time Series |
2019 |
VLDB |
6.7255837e-05 |
| 3,967 |
Apache IoTDB: A Time Series Database for IoT Applications |
2023 |
SIGMOD |
6.5796647e-05 |
| 6,451 |
Multivariate Time Series Cleaning under Speed Constraints |
2024 |
SIGMOD |
5.0583324e-05 |
| 7,223 |
Akane: Perplexity-Guided Time Series Data Cleaning |
2024 |
SIGMOD |
4.7965857e-05 |
| 7,391 |
Time Series Data Validity |
2023 |
SIGMOD |
4.7429293e-05 |
| 8,005 |
Online Topic-Aware Entity Resolution Over Incomplete Data Streams |
2021 |
SIGMOD |
4.6081461e-05 |
| 9,494 |
Spatial Data Quality in the IoT Era: Management and Exploitation |
2022 |
SIGMOD |
4.3341665e-05 |
| 9,560 |
MTSClean: Efficient Constraint-based Cleaning for Multi-Dimensional Time Series Data |
2024 |
VLDB |
4.3254416e-05 |
| 10,061 |
Cleaning Time Series under Seasonal and Trend Constraints |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,081 |
From Suspicious Errors to Valid Data: On Repairing Spatio-Temporal Data via Spatial and Temporal Dependencies |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,211 |
SHoTClean: Bridging Soft and Hard Constraints for Multivariate Time Series Cleaning |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,511 |
The Best of Both Worlds: On Repairing Timestamps and Attribute Values for Multivariate Time Series |
2025 |
SIGMOD |
4.1945683e-05 |
| 10,965 |
High Precision ≠ High Cost: Temporal Data Fusion for Multiple Low-Precision Sensors |
2024 |
SIGMOD |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 8 of 8 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 2,823 |
Interaction between Record Matching and Data Repairing |
2011 |
SIGMOD |
8.0593894e-05 |
| 10,511 |
The Best of Both Worlds: On Repairing Timestamps and Attribute Values for Multivariate Time Series |
2025 |
SIGMOD |
4.1945683e-05 |
| 10,026 |
Minimum Change ≠ Best Cleaning: Parallel and Incremental Error Detection under Integrity Constraints |
2026 |
SIGMOD |
4.1945683e-05 |
| 9,560 |
MTSClean: Efficient Constraint-based Cleaning for Multi-Dimensional Time Series Data |
2024 |
VLDB |
4.3254416e-05 |
| 6,583 |
SCREEN: Stream Data Cleaning under Speed Constraints |
2015 |
SIGMOD |
5.0027988e-05 |
| 10,081 |
From Suspicious Errors to Valid Data: On Repairing Spatio-Temporal Data via Spatial and Temporal Dependencies |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,061 |
Cleaning Time Series under Seasonal and Trend Constraints |
2026 |
SIGMOD |
4.1945683e-05 |
| 6,451 |
Multivariate Time Series Cleaning under Speed Constraints |
2024 |
SIGMOD |
5.0583324e-05 |
| 10,855 |
bNDCRepair: Cleaning both Data Errors and Inaccurate Constraints on Numerical Sequential Data |
2025 |
VLDB |
4.1945683e-05 |
| 3,133 |
Time Series Data Cleaning: From Anomaly Detection to Anomaly Repairing |
2017 |
VLDB |
7.4978041e-05 |