Generating Concise Entity Matching Rules
Summary: Generates concise EM rules as GBFs via program synthesis from pos/neg examples, beating DNFs in conciseness. Demo: web-based customization, fast GBF synthesis, and open-source release for benchmarking against ML baselines. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Rohit Singh
- 2. Vamsi Meduri
- 3. Ahmed Elmagarmid
- 4. Samuel Madden
- 5. Paolo Papotti
- 6. Jorge-Arnulfo Quiané-Ruiz
- 7. Armando Solar-Lezama
- 8. Nan Tang
Incoming Citations (Sorted by Pagerank)
Showing 9 of 9 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 300 | Deep Learning for Entity Matching: A Design Space Exploration | 2018 | SIGMOD | 0.00028441466 |
| 1,831 | Synthesizing Entity Matching Rules by Examples | 2018 | VLDB | 0.00010384082 |
| 3,396 | Automatic Data Repair: Are We Ready to Deploy? | 2024 | VLDB | 7.1455126e-05 |
| 3,711 | Saga: A Platform for Continuous Construction and Serving of Knowledge At Scale | 2022 | SIGMOD | 6.823609e-05 |
| 4,837 | Entity Resolution with Hierarchical Graph Attention Networks | 2022 | SIGMOD | 5.8892326e-05 |
| 6,553 | How do Categorical Duplicates Affect ML? A New Benchmark and Empirical Analyses | 2024 | VLDB | 5.0157344e-05 |
| 6,569 | Domain Adaptation for Deep Entity Resolution | 2022 | SIGMOD | 5.0065379e-05 |
| 9,709 | Outlier Summarization via Human Interpretable Rules | 2024 | VLDB | 4.299267e-05 |
| 9,896 | Towards Interpretable and Learnable Risk Analysis for Entity Resolution | 2020 | SIGMOD | 4.2600049e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 2 of 2 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,345 | Entity Matching: How Similar Is Similar | 2011 | VLDB | 0.00012468408 |
| 1,527 | Generic Schema Matching, Ten Years Later | 2011 | VLDB | 0.00011499442 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,762 | Natural Language Querying of Complex Business Intelligence Queries | 2019 | SIGMOD | 4.9345125e-05 |
| 9,487 | Making It Tractable to Catch Duplicates and Conflicts in Graphs | 2023 | SIGMOD | 4.3341665e-05 |
| 9,409 | Ground Truth Inference for Weakly Supervised Entity Matching | 2023 | SIGMOD | 4.3441378e-05 |
| 5,205 | ANMAT: Automatic Knowledge Discovery and Error Detection through Pattern Functional Dependencies | 2019 | SIGMOD | 5.630869e-05 |
| 3,582 | NADEEF/ER: Generic and Interactive Entity Resolution | 2014 | SIGMOD | 6.9479263e-05 |
| 8,475 | DataProf: Semantic Profiling for Iterative Data Cleansing and Business Rule Acquisition | 2018 | SIGMOD | 4.5028904e-05 |
| 3,744 | Learning Expressive Linkage Rules using Genetic Programming | 2012 | VLDB | 6.7932071e-05 |
| 1,345 | Entity Matching: How Similar Is Similar | 2011 | VLDB | 0.00012468408 |
| 8,007 | A Grammar-based Entity Representation Framework for Data Cleaning | 2009 | SIGMOD | 4.6068018e-05 |
| 1,831 | Synthesizing Entity Matching Rules by Examples | 2018 | VLDB | 0.00010384082 |