NADEEF: A Commodity Data Cleaning System
Summary: NADEEF: commodity end-to-end data cleaning platform with a programmable rule interface and a core for detection and repair. Extends beyond CFDs/MDs/ETL; core cleans holistically with two repair-algorithm implementations; validated on real data. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Michele Dallachiesa
- 2. Amr Ebaid
- 3. Ahmed Eldawy
- 4. Ahmed Elmagarmid
- 5. Ihab F. Ilyas
- 6. Mourad Ouzzani
- 7. Nan Tang
Incoming Citations (Sorted by Pagerank)
Showing 3 of 53 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 11,216 | Demystifying the QoS and QoE of Edge-hosted Video Streaming Applications in the Wild with SNESet | 2023 | SIGMOD | 4.1945683e-05 |
| 11,682 | IHCS: An Integrated Hybrid Cleaning System | 2019 | VLDB | 4.1945683e-05 |
| 11,841 | BART in Action: Error Generation and Empirical Evaluations of Data-Cleaning Systems | 2016 | SIGMOD | 4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 112 | Potter's Wheel: An Interactive Data Cleaning System | 2001 | VLDB | 0.00047045036 |
| 199 | Declarative Data Cleaning: Language, Model, and Algorithms | 2001 | VLDB | 0.00035041015 |
| 265 | A Cost-Based Model and Effective Heuristic for Repairing Constraints by Value Modification | 2005 | SIGMOD | 0.00029763412 |
| 560 | Dependencies Revisited for Improving Data Quality | 2008 | PODS | 0.00020141923 |
| 623 | Improving Data Quality: Consistency and Accuracy | 2007 | VLDB | 0.00018996374 |
| 656 | ERACER: A Database Approach for Statistical Inference and Data Cleaning | 2010 | SIGMOD | 0.00018588729 |
| 702 | Reasoning about Record Matching Rules | 2009 | VLDB | 0.00017918203 |
| 833 | Guided Data Repair | 2011 | VLDB | 0.00016138432 |
| 1,159 | Towards Certain Fixes with Editing Rules and Master Data | 2010 | VLDB | 0.00013592813 |
| 2,823 | Interaction between Record Matching and Data Repairing | 2011 | SIGMOD | 8.0593894e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 199 | Declarative Data Cleaning: Language, Model, and Algorithms | 2001 | VLDB | 0.00035041015 |
| 2,946 | BigDansing: A System for Big Data Cleansing | 2015 | SIGMOD | 7.8372441e-05 |
| 2,460 | Combining Quantitative and Logical Data Cleaning | 2016 | VLDB | 8.7617484e-05 |
| 9,278 | Interactive and Deterministic Data Cleaning: A Tossed Stone Raises a Thousand Ripples | 2016 | SIGMOD | 4.3639892e-05 |
| 623 | Improving Data Quality: Consistency and Accuracy | 2007 | VLDB | 0.00018996374 |
| 5,803 | Semandaq: A Data Quality System Based on Conditional Functional Dependencies | 2008 | VLDB | 5.3205861e-05 |
| 5,660 | Descriptive and Prescriptive Data Cleaning | 2014 | SIGMOD | 5.3847321e-05 |
| 732 | Discovering Data Quality Rules | 2008 | VLDB | 0.00017465093 |
| 3,582 | NADEEF/ER: Generic and Interactive Entity Resolution | 2014 | SIGMOD | 6.9479263e-05 |
| 6,350 | NADEEF: A Generalized Data Cleaning System | 2013 | VLDB | 5.101815e-05 |