Database Paper Browser

Back to papers

Minimum Change ≠ Best Cleaning: Parallel and Incremental Error Detection under Integrity Constraints

Summary: Shows minimum-change repairs fail when errors co-occur; uses Bayesian inference to score conflicting cells and pinpoint erroneous attribute values instead of minimal edits. Introduces provably scalable parallel conflict detection and incremental, parallel error-detection algorithms. (summarized by gpt-5-mini on Feb 11 2026)

Paper ID
7331
Venue
SIGMOD
Year
2026
Pagerank
4.1945683e-05
Overall Rank
10,026 | 30.26%
DOI
10.1145/3749174

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank Citing Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 22 of 22 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
192 HoloClean: Holistic Data Repairs with Probabilistic Inference 2017 VLDB 0.00035728858
555 Discovering Denial Constraints 2013 VLDB 0.00020254908
656 ERACER: A Database Approach for Statistical Inference and Data Cleaning 2010 SIGMOD 0.00018588729
791 ActiveClean: Interactive Data Cleaning For Statistical Modeling 2016 VLDB 0.00016629664
881 Don’t be SCAREd: Use SCalable Automatic REpairing with Maximal Likelihood and Bounded Changes 2013 SIGMOD 0.00015661103
1,337 HoloDetect: Few-Shot Learning for Error Detection 2019 SIGMOD 0.00012497164
1,546 KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing 2015 SIGMOD 0.00011446851
1,612 Detecting Data Errors: Where are we and what needs to be done? 2016 VLDB 0.00011142794
1,624 Sampling the Repairs of Functional Dependency Violations under Hard Constraints 2010 VLDB 0.00011099222
2,506 Auto-Detect: Data-Driven Error Detection in Tables 2018 SIGMOD 8.6335464e-05
2,638 Messing Up with BART: Error Generation for Evaluating Data-Cleaning Algorithms 2016 VLDB 8.399764e-05
2,946 BigDansing: A System for Big Data Cleansing 2015 SIGMOD 7.8372441e-05
2,968 Raha: A Configuration-Free Error Detection System 2019 SIGMOD 7.7985097e-05
3,299 SCODED: Statistical Constraint Oriented Data Error Detection 2020 SIGMOD 7.2546659e-05
3,396 Automatic Data Repair: Are We Ready to Deploy? 2024 VLDB 7.1455126e-05
5,153 Horizon: Scalable Dependency-driven Data Cleaning 2021 VLDB 5.6607963e-05
5,618 Explaining Repaired Data with CFDs 2018 VLDB 5.4079415e-05
6,690 Parallel Discrepancy Detection and Incremental Detection 2021 VLDB 4.9621556e-05
7,703 Uniform Operational Consistent Query Answering 2022 PODS 4.673644e-05
8,836 Fast Approximate Denial Constraint Discovery 2023 VLDB 4.4393184e-05
9,748 Combined Approximations for Uniform Operational Consistent Query Answering 2024 PODS 4.2897489e-05
9,749 Efficient Differential Dependency Discovery 2024 VLDB 4.2897489e-05
Previous Page 1 / 1 Next

Semantically Similar Papers