Constraint-Variance Tolerant Data Repairing
Summary: Introduces theta-tolerant data repair to handle imprecise constraints by allowing limited predicate insertions/deletions (variation ≤ theta). Finds a minimum repair that satisfies at least one constraint variant, via a single-round, sharing-enabled algorithm, showing improved accuracy on real datasets. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Shaoxu Song
- 2. Han Zhu
- 3. Jianmin Wang
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,396 | Automatic Data Repair: Are We Ready to Deploy? | 2024 | VLDB | 7.1455126e-05 |
| 9,348 | GIDCL: A Graph-Enhanced Interpretable Data Cleaning Framework with Large Language Models | 2024 | SIGMOD | 4.3526427e-05 |
| 10,855 | bNDCRepair: Cleaning both Data Errors and Inaccurate Constraints on Numerical Sequential Data | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 265 | A Cost-Based Model and Effective Heuristic for Repairing Constraints by Value Modification | 2005 | SIGMOD | 0.00029763412 |
| 555 | Discovering Denial Constraints | 2013 | VLDB | 0.00020254908 |
| 560 | Dependencies Revisited for Improving Data Quality | 2008 | PODS | 0.00020141923 |
| 1,188 | On Generating Near-Optimal Tableaux for Conditional Functional Dependencies | 2008 | VLDB | 0.00013441729 |
| 2,159 | Sequential Dependencies | 2009 | VLDB | 9.4130956e-05 |
| 2,567 | Resolving Conflicts in Heterogeneous Data by Truth Discovery and Source Reliability Estimation | 2014 | SIGMOD | 8.5239306e-05 |
| 6,583 | SCREEN: Stream Data Cleaning under Speed Constraints | 2015 | SIGMOD | 5.0027988e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,840 | The Cost of Representation by Subset Repairs | 2025 | VLDB | 4.4388652e-05 |
| 2,460 | Combining Quantitative and Logical Data Cleaning | 2016 | VLDB | 8.7617484e-05 |
| 10,026 | Minimum Change ≠ Best Cleaning: Parallel and Incremental Error Detection under Integrity Constraints | 2026 | SIGMOD | 4.1945683e-05 |
| 265 | A Cost-Based Model and Effective Heuristic for Repairing Constraints by Value Modification | 2005 | SIGMOD | 0.00029763412 |
| 2,823 | Interaction between Record Matching and Data Repairing | 2011 | SIGMOD | 8.0593894e-05 |
| 3,396 | Automatic Data Repair: Are We Ready to Deploy? | 2024 | VLDB | 7.1455126e-05 |
| 1,159 | Towards Certain Fixes with Editing Rules and Master Data | 2010 | VLDB | 0.00013592813 |
| 1,624 | Sampling the Repairs of Functional Dependency Violations under Hard Constraints | 2010 | VLDB | 0.00011099222 |
| 3,192 | Towards Dependable Data Repairing with Fixing Rules | 2014 | SIGMOD | 7.4095761e-05 |
| 623 | Improving Data Quality: Consistency and Accuracy | 2007 | VLDB | 0.00018996374 |