Combining Quantitative and Logical Data Cleaning
Summary: Synergizes quantitative data cleaning (statistical distortion via EMD) with logical data cleaning using metric functional dependencies to detect and repair quality problems. Key results: linear-time MFD inference; NP-hardness of distortion-minimal repairs; efficient set-minimal repair with axiomatization; empirical distortion gains. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 19 of 19 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 11 of 11 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 49 | Consistent Query Answers in Inconsistent Databases | 1999 | PODS | 0.00067660624 |
| 199 | Declarative Data Cleaning: Language, Model, and Algorithms | 2001 | VLDB | 0.00035041015 |
| 265 | A Cost-Based Model and Effective Heuristic for Repairing Constraints by Value Modification | 2005 | SIGMOD | 0.00029763412 |
| 702 | Reasoning about Record Matching Rules | 2009 | VLDB | 0.00017918203 |
| 881 | Don’t be SCAREd: Use SCalable Automatic REpairing with Maximal Likelihood and Bounded Changes | 2013 | SIGMOD | 0.00015661103 |
| 1,624 | Sampling the Repairs of Functional Dependency Violations under Hard Constraints | 2010 | VLDB | 0.00011099222 |
| 1,664 | On Multi-Column Foreign Key Discovery | 2010 | VLDB | 0.00010976887 |
| 2,018 | Statistical Distortion: Consequences of Data Cleaning | 2012 | VLDB | 9.7764643e-05 |
| 2,159 | Sequential Dependencies | 2009 | VLDB | 9.4130956e-05 |
| 3,105 | Data X-Ray: A Diagnostic Tool for Data Errors | 2015 | SIGMOD | 7.5568954e-05 |
| 5,326 | Earth Mover's Distance based Similarity Search at Scale | 2014 | VLDB | 5.5680074e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,660 | Descriptive and Prescriptive Data Cleaning | 2014 | SIGMOD | 5.3847321e-05 |
| 1,159 | Towards Certain Fixes with Editing Rules and Master Data | 2010 | VLDB | 0.00013592813 |
| 9,369 | Constraint-Variance Tolerant Data Repairing | 2016 | SIGMOD | 4.3481081e-05 |
| 3,396 | Automatic Data Repair: Are We Ready to Deploy? | 2024 | VLDB | 7.1455126e-05 |
| 7,702 | Counting and Enumerating (Preferred) Database Repairs | 2017 | PODS | 4.6736471e-05 |
| 10,081 | From Suspicious Errors to Valid Data: On Repairing Spatio-Temporal Data via Spatial and Temporal Dependencies | 2026 | SIGMOD | 4.1945683e-05 |
| 1,624 | Sampling the Repairs of Functional Dependency Violations under Hard Constraints | 2010 | VLDB | 0.00011099222 |
| 3,192 | Towards Dependable Data Repairing with Fixing Rules | 2014 | SIGMOD | 7.4095761e-05 |
| 2,823 | Interaction between Record Matching and Data Repairing | 2011 | SIGMOD | 8.0593894e-05 |
| 623 | Improving Data Quality: Consistency and Accuracy | 2007 | VLDB | 0.00018996374 |