ERACER: A Database Approach for Statistical Inference and Data Cleaning
Summary: ERACER is a framework that infers missing values and cleans errors via belief propagation on relational networks, implementable in SQL/UDFs. Uses shrinkage to cleanse dirty data and handles cyclic dependencies, achieving Bayesian accuracy with approximate inference on synthetic data. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 32 of 32 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 168 | MAD Skills: New Analysis Practices for Big Data | 2009 | VLDB | 0.00038946305 |
| 265 | A Cost-Based Model and Effective Heuristic for Repairing Constraints by Value Modification | 2005 | SIGMOD | 0.00029763412 |
| 623 | Improving Data Quality: Consistency and Accuracy | 2007 | VLDB | 0.00018996374 |
| 1,569 | Querying Continuous Functions in a Database System | 2008 | SIGMOD | 0.0001132337 |
| 2,379 | A Revival of Integrity Constraints for Data Cleaning | 2008 | VLDB | 8.9392633e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 192 | HoloClean: Holistic Data Repairs with Probabilistic Inference | 2017 | VLDB | 0.00035728858 |
| 3,105 | Data X-Ray: A Diagnostic Tool for Data Errors | 2015 | SIGMOD | 7.5568954e-05 |
| 881 | Don’t be SCAREd: Use SCalable Automatic REpairing with Maximal Likelihood and Bounded Changes | 2013 | SIGMOD | 0.00015661103 |
| 5,952 | Eraser: Eliminating Performance Regression on Learned Query Optimizer | 2024 | VLDB | 5.2591691e-05 |
| 7,867 | Learning Over Dirty Data Without Cleaning | 2020 | SIGMOD | 4.6320452e-05 |
| 5,445 | QFix: Diagnosing Errors through Query Histories | 2017 | SIGMOD | 5.5020909e-05 |
| 14,327 | ERQ: Controlled Inference and Instruction Techniques for DBMS Query Languages | 1982 | SIGMOD | - |
| 623 | Improving Data Quality: Consistency and Accuracy | 2007 | VLDB | 0.00018996374 |
| 10,026 | Minimum Change ≠ Best Cleaning: Parallel and Incremental Error Detection under Integrity Constraints | 2026 | SIGMOD | 4.1945683e-05 |
| 12,800 | Enhancing Database Correctness: A Statistical Approach | 1995 | SIGMOD | 4.1945683e-05 |