Database Paper Browser

Back to papers

Reasoning about Record Matching Rules

Summary: Introduces MDs with similarity-based dynamic semantics for unreliable data in record matching. Proposes relative candidate keys (RCKs) and an O(n^2) MD-inference method with RCK deduction; experiments show improved match quality and faster blocking. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
9926
Venue
VLDB
Year
2009
Pagerank
0.00017918203
Overall Rank
702 | 95.12%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 27 of 27 citing papers.

Rank Citing Paper Year Venue Pagerank
192 HoloClean: Holistic Data Repairs with Probabilistic Inference 2017 VLDB 0.00035728858
300 Deep Learning for Entity Matching: A Design Space Exploration 2018 SIGMOD 0.00028441466
833 Guided Data Repair 2011 VLDB 0.00016138432
1,012 NADEEF: A Commodity Data Cleaning System 2013 SIGMOD 0.0001464733
1,159 Towards Certain Fixes with Editing Rules and Master Data 2010 VLDB 0.00013592813
1,345 Entity Matching: How Similar Is Similar 2011 VLDB 0.00012468408
2,158 Uni-Detect: A Unified Approach to Automated Error Detection in Tables 2019 SIGMOD 9.4141354e-05
2,460 Combining Quantitative and Logical Data Cleaning 2016 VLDB 8.7617484e-05
2,823 Interaction between Record Matching and Data Repairing 2011 SIGMOD 8.0593894e-05
3,192 Towards Dependable Data Repairing with Fixing Rules 2014 SIGMOD 7.4095761e-05
3,396 Automatic Data Repair: Are We Ready to Deploy? 2024 VLDB 7.1455126e-05
4,837 Entity Resolution with Hierarchical Graph Attention Networks 2022 SIGMOD 5.8892326e-05
5,253 Enriching Data Imputation with Extensive Similarity Neighbors 2015 VLDB 5.6014916e-05
5,958 Fine-grained Concept Linking using Neural Networks in Healthcare 2018 SIGMOD 5.2563968e-05
6,175 Query-Driven Approach to Entity Resolution 2013 VLDB 5.169496e-05
6,569 Domain Adaptation for Deep Entity Resolution 2022 SIGMOD 5.0065379e-05
6,621 We Challenge You to Certify Your Updates 2011 SIGMOD 4.9905794e-05
6,690 Parallel Discrepancy Detection and Incremental Detection 2021 VLDB 4.9621556e-05
6,810 Record Linkage with Uniqueness Constraints and Erroneous Values 2010 VLDB 4.9203397e-05
7,867 Learning Over Dirty Data Without Cleaning 2020 SIGMOD 4.6320452e-05
8,153 Deep Transfer Learning for Multi-source Entity Linkage via Domain Adaptation 2022 VLDB 4.574554e-05
8,875 CerFix: A System for Cleaning Data with Certain Fixes 2011 VLDB 4.430475e-05
9,725 On Concise Set of Relative Candidate Keys 2014 VLDB 4.2945121e-05
9,896 Towards Interpretable and Learnable Risk Analysis for Entity Resolution 2020 SIGMOD 4.2600049e-05
10,723 UniClean: A Scalable Data Cleaning Solution for Mixed Errors based on Unified Cleaners and Optimized Cleaning Workflow 2025 VLDB 4.1945683e-05
11,333 LACE: A Logical Approach to Collective Entity Resolution 2022 PODS 4.1945683e-05
11,742 Learning Efficiently Over Heterogeneous Databases 2018 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 8 of 8 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
67 The Merge/Purge Problem for Large Databases 1995 SIGMOD 0.00061348205
199 Declarative Data Cleaning: Language, Model, and Algorithms 2001 VLDB 0.00035041015
280 Eliminating Fuzzy Duplicates in Data Warehouses 2002 VLDB 0.00029113044
560 Dependencies Revisited for Improving Data Quality 2008 PODS 0.00020141923
1,533 Example-driven Design of Efficient Record Matching Queries 2007 VLDB 0.00011471971
2,386 Leveraging Aggregate Constraints For Deduplication 2007 SIGMOD 8.9231895e-05
3,529 Merging the Results of Approximate Match Operations 2004 VLDB 7.0059524e-05
5,235 Industry-Scale Duplicate Detection 2008 VLDB 5.6115647e-05
Previous Page 1 / 1 Next

Semantically Similar Papers