Database Paper Browser

Back to papers

Declarative Data Cleaning: Language, Model, and Algorithms

Summary: Declarative data cleaning language, execution model, and algorithms for cleaning workflows. Key novelty: separation of logical specification from physical execution, explainable results, and interactive tuning for data integration. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
8761
Venue
VLDB
Year
2001
Pagerank
0.00035041015
Overall Rank
199 | 98.62%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 35 of 35 citing papers.

Rank Citing Paper Year Venue Pagerank
149 Trio: A System for Integrated Management of Data, Accuracy, and Lineage 2005 CIDR 0.00041101118
229 Reference Reconciliation in Complex Information Spaces 2005 SIGMOD 0.00032242633
265 A Cost-Based Model and Effective Heuristic for Repairing Constraints by Value Modification 2005 SIGMOD 0.00029763412
280 Eliminating Fuzzy Duplicates in Data Warehouses 2002 VLDB 0.00029113044
355 Hippocratic Databases 2002 VLDB 0.00026087195
518 Data Integration for the Relational Web 2009 VLDB 0.00021158934
623 Improving Data Quality: Consistency and Accuracy 2007 VLDB 0.00018996374
627 Management of Probabilistic Data: Foundations and Challenges 2007 PODS 0.00018959005
702 Reasoning about Record Matching Rules 2009 VLDB 0.00017918203
732 Discovering Data Quality Rules 2008 VLDB 0.00017465093
893 Data Integration: The Teenage Years 2006 VLDB 0.00015558352
1,012 NADEEF: A Commodity Data Cleaning System 2013 SIGMOD 0.0001464733
1,345 Entity Matching: How Similar Is Similar 2011 VLDB 0.00012468408
1,482 Automating Large-Scale Data Quality Verification 2018 VLDB 0.00011725533
1,533 Example-driven Design of Efficient Record Matching Queries 2007 VLDB 0.00011471971
2,175 Falcon: Scaling Up Hands-Off Crowdsourced Entity Matching to Build Cloud Services 2017 SIGMOD 9.3644117e-05
2,460 Combining Quantitative and Logical Data Cleaning 2016 VLDB 8.7617484e-05
2,514 Comparative Analysis of Approximate Blocking Techniques for Entity Resolution 2016 VLDB 8.6139012e-05
2,589 DogmatiX Tracks down Duplicates in XML 2005 SIGMOD 8.4847146e-05
3,267 Benchmarking Declarative Approximate Selection Predicates 2007 SIGMOD 7.3058429e-05
3,360 Modeling and Querying Possible Repairs in Duplicate Detection 2009 VLDB 7.1742067e-05
3,396 Automatic Data Repair: Are We Ready to Deploy? 2024 VLDB 7.1455126e-05
3,830 ++Spicy: an Open-Source Tool for Second-Generation Schema Mapping and Data Exchange 2011 VLDB 6.7193951e-05
4,185 Arnold: Declarative Crowd-Machine Data Integration 2013 CIDR 6.3776356e-05
4,607 Data Integration and Machine Learning: A Natural Synergy 2018 SIGMOD 6.0538827e-05
4,929 Data Auditor: Exploring Data Quality and Semantics using Pattern Tableaux 2010 VLDB 5.8217296e-05
5,618 Explaining Repaired Data with CFDs 2018 VLDB 5.4079415e-05
5,803 Semandaq: A Data Quality System Based on Conditional Functional Dependencies 2008 VLDB 5.3205861e-05
7,066 On Multiple Semantics for Declarative Database Repairs 2020 SIGMOD 4.8445108e-05
7,243 Data Integration and Machine Learning: A Natural Synergy 2018 VLDB 4.7913666e-05
7,867 Learning Over Dirty Data Without Cleaning 2020 SIGMOD 4.6320452e-05
9,278 Interactive and Deterministic Data Cleaning: A Tossed Stone Raises a Thousand Ripples 2016 SIGMOD 4.3639892e-05
10,676 Meaningful Data Erasure in the Presence of Dependencies 2025 VLDB 4.1945683e-05
12,002 Tutorial: Uncertain Entity Resolution — Re-evaluating Entity Resolution in the Big Data Era 2014 VLDB 4.1945683e-05
12,425 XClean in Action: A Demonstration of Declarative XML Data Cleaning 2007 CIDR 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 4 of 4 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers