Database Paper Browser

Back to papers

Deep Learning for Entity Matching: A Design Space Exploration

Summary: Explores deep learning for entity matching, defines a DL design space (SIF, RNN, Attention, Hybrid), and maps NLP-style methods to EM. Empirically, DL lags on structured EM vs Magellan but excels on textual and dirty EM, guiding DL use for noisy data and outlining directions. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5572
Venue
SIGMOD
Year
2018
Pagerank
0.00028441466
Overall Rank
300 | 97.92%
DOI
10.1145/3183713.3196926

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 85 citing papers.

Rank Citing Paper Year Venue Pagerank
221 Deep Entity Matching with Pre-Trained Language Models 2021 VLDB 0.00033121824
333 Neo: A Learned Query Optimizer 2019 VLDB 0.00027206884
517 Can Foundation Models Wrangle Your Data? 2023 VLDB 0.00021169035
754 Distributed Representations of Tuples for Entity Resolution 2018 VLDB 0.00017117211
884 Plan-Structured Deep Neural Network Models for Query Performance Prediction 2019 VLDB 0.00015654004
1,914 Creating Embeddings of Heterogeneous Relational Datasets for Data Integration Tasks 2020 SIGMOD 0.00010109102
2,057 From Natural Language Processing to Neural Databases 2021 VLDB 9.6624862e-05
2,349 RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation 2021 VLDB 8.9876423e-05
2,587 Table-GPT: Table Fine-tuned GPT for Diverse Table Tasks 2024 SIGMOD 8.4924618e-05
2,767 A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching 2020 SIGMOD 8.1513883e-05
3,140 ZeroER: Entity Resolution using Zero Labeled Examples 2020 SIGMOD 7.4841763e-05
3,396 Automatic Data Repair: Are We Ready to Deploy? 2024 VLDB 7.1455126e-05
3,640 Deep Learning for Blocking in Entity Matching: A Design Space Exploration 2021 VLDB 6.8891671e-05
3,658 Towards a Hands-Free Query Optimizer through Deep Learning 2019 CIDR 6.8704209e-05
3,711 Saga: A Platform for Continuous Construction and Serving of Knowledge At Scale 2022 SIGMOD 6.823609e-05
3,915 A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs 2020 VLDB 6.6332294e-05
3,942 Ember: No-Code Context Enrichment via Similarity-Based Keyless Joins 2022 VLDB 6.6114622e-05
4,018 Through the Fairness Lens: Experimental Analysis and Evaluation of Entity Matching 2023 VLDB 6.5244015e-05
4,212 Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data Integration 2023 SIGMOD 6.3555142e-05
4,278 Similarity Query Processing for High-Dimensional Data 2020 VLDB 6.2953764e-05
4,355 LargeEA: Aligning Entities for Large-scale Knowledge Graphs 2022 VLDB 6.259483e-05
4,593 Auto-WLM: Machine Learning Enhanced Workload Management in Amazon Redshift 2023 SIGMOD 6.0606891e-05
4,703 Medical Entity Disambiguation Using Graph Neural Networks 2021 SIGMOD 5.9855056e-05
4,837 Entity Resolution with Hierarchical Graph Attention Networks 2022 SIGMOD 5.8892326e-05
5,024 Towards Distribution-aware Query Answering in Data Markets 2022 VLDB 5.7535043e-05
5,088 TCUDB: Accelerating Database with Tensor Processors 2022 SIGMOD 5.7072189e-05
5,282 Deep Indexed Active Learning for Matching Heterogeneous Entity Representations 2022 VLDB 5.5864206e-05
5,371 LearnedSQLGen: Constraint-aware SQL Generation using Reinforcement Learning 2022 SIGMOD 5.5428776e-05
5,434 Auto-FuzzyJoin: Auto-Program Fuzzy Similarity Joins Without Labeled Examples 2021 SIGMOD 5.5045402e-05
5,533 Dual-Objective Fine-Tuning of BERT for Entity Matching 2021 VLDB 5.4544359e-05
5,869 Demonstration of Panda: A Weakly Supervised Entity Matching System 2021 VLDB 5.2959029e-05
5,978 Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond 2021 SIGMOD 5.2453012e-05
6,042 MDedup: Duplicate Detection with Matching Dependencies 2020 VLDB 5.2405269e-05
6,228 Managing ML Pipelines: Feature Stores and the Coming Wave of Embedding Ecosystems 2021 VLDB 5.1470042e-05
6,408 Explaining Link Prediction Systems based on Knowledge Graph Embeddings 2022 SIGMOD 5.0763482e-05
6,553 How do Categorical Duplicates Affect ML? A New Benchmark and Empirical Analyses 2024 VLDB 5.0157344e-05
6,569 Domain Adaptation for Deep Entity Resolution 2022 SIGMOD 5.0065379e-05
6,690 Parallel Discrepancy Detection and Incremental Detection 2021 VLDB 4.9621556e-05
6,711 Analyzing How BERT Performs Entity Matching 2022 VLDB 4.9517546e-05
6,747 Entity Matching Meets Data Science: A Progress Report from the Magellan Project 2019 SIGMOD 4.9408824e-05
7,052 Pre-trained Embeddings for Entity Resolution: An Experimental Analysis 2023 VLDB 4.8497453e-05
7,185 Certus: An Effective Entity Resolution Approach with Graph Differential Dependencies (GDDs) 2019 VLDB 4.8066159e-05
7,243 Data Integration and Machine Learning: A Natural Synergy 2018 VLDB 4.7913666e-05
7,613 ADnEV: Cross-Domain Schema Matching using Deep Similarity Matrix Adjustment and Evaluation 2020 VLDB 4.6961059e-05
8,008 Entity Resolution On-Demand 2022 VLDB 4.6067684e-05
8,099 Sparkly: A Simple yet Surprisingly Strong TF/IDF Blocker for Entity Matching 2023 VLDB 4.5859317e-05
8,153 Deep Transfer Learning for Multi-source Entity Linkage via Domain Adaptation 2022 VLDB 4.574554e-05
8,346 Deep Learning: Systems and Responsibility 2021 SIGMOD 4.5420668e-05
8,406 DADER: Hands-Off Entity Resolution with Domain Adaptation 2022 VLDB 4.5220083e-05
8,436 A Critical Re-evaluation of Neural Methods for Entity Alignment 2022 VLDB 4.5138915e-05
Previous Page 1 / 2 Next

Outgoing Citations (Sorted by Pagerank)

Showing 11 of 11 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers