Database Paper Browser

Back to papers

Synthesizing Entity Matching Rules by Examples

Summary: Synthesizes EM rules from pos/neg examples via program synthesis on a GBF grammar. GBF yields concise, interpretable rules (AND/OR/NOT) that handle missing values and are competitive with deep models in effectiveness. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
11669
Venue
VLDB
Year
2018
Pagerank
0.00010384082
Overall Rank
1,831 | 87.27%
DOI
10.14778/3149193.3149199

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 31 of 31 citing papers.

Rank Citing Paper Year Venue Pagerank
221 Deep Entity Matching with Pre-Trained Language Models 2021 VLDB 0.00033121824
754 Distributed Representations of Tuples for Entity Resolution 2018 VLDB 0.00017117211
2,767 A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching 2020 SIGMOD 8.1513883e-05
4,018 Through the Fairness Lens: Experimental Analysis and Evaluation of Entity Matching 2023 VLDB 6.5244015e-05
4,837 Entity Resolution with Hierarchical Graph Attention Networks 2022 SIGMOD 5.8892326e-05
5,192 Pattern Functional Dependencies for Data Cleaning 2020 VLDB 5.6375087e-05
6,042 MDedup: Duplicate Detection with Matching Dependencies 2020 VLDB 5.2405269e-05
6,553 How do Categorical Duplicates Affect ML? A New Benchmark and Empirical Analyses 2024 VLDB 5.0157344e-05
6,569 Domain Adaptation for Deep Entity Resolution 2022 SIGMOD 5.0065379e-05
6,758 Data Migration using Datalog Program Synthesis 2020 VLDB 4.937199e-05
7,185 Certus: An Effective Entity Resolution Approach with Graph Differential Dependencies (GDDs) 2019 VLDB 4.8066159e-05
7,287 Discovering Association Rules from Big Graphs 2022 VLDB 4.7762276e-05
8,153 Deep Transfer Learning for Multi-source Entity Linkage via Domain Adaptation 2022 VLDB 4.574554e-05
8,344 Exploring the Data Wilderness through Examples 2019 SIGMOD 4.5428111e-05
8,406 DADER: Hands-Off Entity Resolution with Domain Adaptation 2022 VLDB 4.5220083e-05
8,911 PromptEM: Prompt-tuning for Low-resource Generalized Entity Matching 2023 VLDB 4.427232e-05
8,958 FlexER: Flexible Entity Resolution for Multiple Intents 2023 SIGMOD 4.4210635e-05
9,035 Data-Driven Insight Synthesis for Multi-Dimensional Data 2024 VLDB 4.4039656e-05
9,355 Discovering Top-k Rules using Subjective and Objective Criteria 2023 SIGMOD 4.3514328e-05
9,460 The Battleship Approach to the Low Resource Entity Matching Problem 2023 SIGMOD 4.3366491e-05
9,487 Making It Tractable to Catch Duplicates and Conflicts in Graphs 2023 SIGMOD 4.3341665e-05
9,709 Outlier Summarization via Human Interpretable Rules 2024 VLDB 4.299267e-05
9,847 Discovering Top-k Relevant and Diversified Rules 2024 SIGMOD 4.2721228e-05
9,963 Parallel Rule Discovery from Large Datasets by Sampling 2022 SIGMOD 4.2294678e-05
10,040 3dSAGER: Geospatial Entity Resolution over 3D Objects 2026 SIGMOD 4.1945683e-05
10,489 Incremental Rule Discovery in Response to Parameter Updates 2025 SIGMOD 4.1945683e-05
10,610 Weak-to-Strong Prompts with Lightweight-to-Powerful LLMs for High-Accuracy, Low-Cost, and Explainable Data Transformation 2025 VLDB 4.1945683e-05
10,617 Deduplicated Sampling On-Demand 2025 VLDB 4.1945683e-05
11,206 When Automatic Filtering Comes to the Rescue: Pre-Computing Company Competitor Pairs in Owler 2023 SIGMOD 4.1945683e-05
11,223 Splitting Tuples of Mismatched Entities 2023 SIGMOD 4.1945683e-05
11,343 SPINE: Scaling up Programming-by-Negative-Example for String Filtering and Transformation 2022 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 12 of 12 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers