Database Paper Browser

Back to papers

Making It Tractable to Catch Duplicates and Conflicts in Graphs

Summary: Introduces Graph Cleaning Rules (GCRs) for scalable entity and conflict resolution in large graphs, using dual-pattern and star-shaped templates with ML predicates to cope with schemaless structures. PTIME for satisfiability, implication, and validation of GCRs; presents parallel rule mining and deep ER/CR algorithms with speedups and higher accuracy than prior graph dependencies. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
6589
Venue
SIGMOD
Year
2023
Pagerank
4.3341665e-05
Overall Rank
9,487 | 34.01%
DOI
10.1145/3588940

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 6 of 6 citing papers.

Rank Citing Paper Year Venue Pagerank
9,400 Explaining GNN-based Recommendations in Logic 2025 VLDB 4.3441378e-05
10,235 Repairing Property Graphs under PG-Constraints 2026 VLDB 4.1945683e-05
10,486 Rule-Based Graph Cleaning with GPUs on a Single Machine 2025 SIGMOD 4.1945683e-05
11,001 Capturing More Associations by Referencing External Graphs 2024 VLDB 4.1945683e-05
11,098 Graph Association Analyses for Early Drug Discovery 2024 VLDB 4.1945683e-05
11,209 Enriching Recommendation Models with Logic Conditions 2023 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 34 of 34 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
49 Consistent Query Answers in Inconsistent Databases 1999 PODS 0.00067660624
192 HoloClean: Holistic Data Repairs with Probabilistic Inference 2017 VLDB 0.00035728858
300 Deep Learning for Entity Matching: A Design Space Exploration 2018 SIGMOD 0.00028441466
319 Evaluation of entity resolution approaches on real-world match problems 2010 VLDB 0.00027781866
509 On Active Learning of Record Matching Packages 2010 SIGMOD 0.00021409518
555 Discovering Denial Constraints 2013 VLDB 0.00020254908
623 Improving Data Quality: Consistency and Accuracy 2007 VLDB 0.00018996374
754 Distributed Representations of Tuples for Entity Resolution 2018 VLDB 0.00017117211
894 A Hybrid Approach to Functional Dependency Discovery 2016 SIGMOD 0.00015556428
1,089 GRAMI: Frequent Subgraph and Pattern Mining in a Single Large Graph 2014 VLDB 0.00014157922
1,159 Towards Certain Fixes with Editing Rules and Master Data 2010 VLDB 0.00013592813
1,188 On Generating Near-Optimal Tableaux for Conditional Functional Dependencies 2008 VLDB 0.00013441729
1,337 HoloDetect: Few-Shot Learning for Error Detection 2019 SIGMOD 0.00012497164
1,831 Synthesizing Entity Matching Rules by Examples 2018 VLDB 0.00010384082
2,175 Falcon: Scaling Up Hands-Off Crowdsourced Entity Matching to Build Cloud Services 2017 SIGMOD 9.3644117e-05
2,231 Dedoop: Efficient Deduplication with Hadoop 2012 VLDB 9.2304499e-05
2,253 Efficient Denial Constraint Discovery with Hydra 2018 VLDB 9.1937209e-05
2,450 Functional Dependencies for Graphs 2016 SIGMOD 8.7882979e-05
2,483 Discovery of Approximate (and Exact) Denial Constraints 2020 VLDB 8.6864916e-05
2,527 Dependencies for Graphs 2017 PODS 8.5954406e-05
2,968 Raha: A Configuration-Free Error Detection System 2019 SIGMOD 7.7985097e-05
3,528 Distributed Data Deduplication 2016 VLDB 7.0066139e-05
3,694 Keys for Graphs 2015 VLDB 6.8345712e-05
3,773 Cleaning Crowdsourced Labels Using Oracles for Statistical Classification 2019 VLDB 6.7758649e-05
4,127 A Statistical Perspective on Discovering Functional Dependencies in Noisy Data 2020 SIGMOD 6.4310458e-05
6,690 Parallel Discrepancy Detection and Incremental Detection 2021 VLDB 4.9621556e-05
6,703 Discovering Graph Functional Dependencies 2018 SIGMOD 4.9555163e-05
6,810 Record Linkage with Uniqueness Constraints and Erroneous Values 2010 VLDB 4.9203397e-05
7,287 Discovering Association Rules from Big Graphs 2022 VLDB 4.7762276e-05
8,133 Towards Event Prediction in Temporal Graphs 2022 VLDB 4.5784634e-05
8,211 Capturing Associations in Graphs 2020 VLDB 4.5581054e-05
8,422 Deducing Certain Fixes to Graphs 2019 VLDB 4.5167705e-05
8,436 A Critical Re-evaluation of Neural Methods for Entity Alignment 2022 VLDB 4.5138915e-05
9,564 Catching Numeric Inconsistencies in Graphs 2018 SIGMOD 4.3254416e-05
Previous Page 1 / 1 Next

Semantically Similar Papers

Overall Rank Paper Year Venue Pagerank
3,532 Entity Resolution with Evolving Rules 2010 VLDB 7.0020216e-05
11,223 Splitting Tuples of Mismatched Entities 2023 SIGMOD 4.1945683e-05
9,434 Rock: Cleaning Data by Embedding ML in Logic Rules 2024 SIGMOD 4.3430376e-05
3,143 Extracting and Analyzing Hidden Graphs from Relational Databases 2017 SIGMOD 7.4804326e-05
7,287 Discovering Association Rules from Big Graphs 2022 VLDB 4.7762276e-05
8,211 Capturing Associations in Graphs 2020 VLDB 4.5581054e-05
11,016 Extending Graph Rules with Oracles 2024 VLDB 4.1945683e-05
8,585 Robust Entity Resolution using Random Graphs 2018 SIGMOD 4.4905755e-05
9,963 Parallel Rule Discovery from Large Datasets by Sampling 2022 SIGMOD 4.2294678e-05
6,690 Parallel Discrepancy Detection and Incremental Detection 2021 VLDB 4.9621556e-05