Database Paper Browser

Back to papers

Entity Resolution: Theory, Practice & Open Challenges

Summary: A cross-disciplinary survey of Entity Resolution (ER) across databases, ML, NLP, and IR, unifying practical solutions with theoretical underpinnings. It outlines current challenges and open problems, highlighting gaps between practice and theory and guiding future data-management work. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10462
Venue
VLDB
Year
2012
Pagerank
0.00016370594
Overall Rank
814 | 94.34%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 32 of 32 citing papers.

Rank Citing Paper Year Venue Pagerank
192 HoloClean: Holistic Data Repairs with Probabilistic Inference 2017 VLDB 0.00035728858
300 Deep Learning for Entity Matching: A Design Space Exploration 2018 SIGMOD 0.00028441466
2,514 Comparative Analysis of Approximate Blocking Techniques for Entity Resolution 2016 VLDB 8.6139012e-05
3,140 ZeroER: Entity Resolution using Zero Labeled Examples 2020 SIGMOD 7.4841763e-05
3,528 Distributed Data Deduplication 2016 VLDB 7.0066139e-05
3,694 Keys for Graphs 2015 VLDB 6.8345712e-05
3,711 Saga: A Platform for Continuous Construction and Serving of Knowledge At Scale 2022 SIGMOD 6.823609e-05
4,104 Online Entity Resolution Using an Oracle 2016 VLDB 6.4493809e-05
4,383 Incremental Record Linkage 2014 VLDB 6.2383094e-05
4,607 Data Integration and Machine Learning: A Natural Synergy 2018 SIGMOD 6.0538827e-05
4,668 PrivateClean: Data Cleaning and Differential Privacy 2016 SIGMOD 6.0115918e-05
5,228 Schema-agnostic vs Schema-based Configurations for Blocking Methods on Homogeneous Data 2016 VLDB 5.6158315e-05
5,978 Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond 2021 SIGMOD 5.2453012e-05
6,295 Your notebook is not crumby enough, REPLace it 2020 CIDR 5.1249204e-05
7,052 Pre-trained Embeddings for Entity Resolution: An Experimental Analysis 2023 VLDB 4.8497453e-05
7,243 Data Integration and Machine Learning: A Natural Synergy 2018 VLDB 4.7913666e-05
7,345 Linking Temporal Records for Profiling Entities 2015 SIGMOD 4.756212e-05
8,153 Deep Transfer Learning for Multi-source Entity Linkage via Domain Adaptation 2022 VLDB 4.574554e-05
8,585 Robust Entity Resolution using Random Graphs 2018 SIGMOD 4.4905755e-05
8,958 FlexER: Flexible Entity Resolution for Multiple Intents 2023 SIGMOD 4.4210635e-05
9,028 Enabling Rich Queries Over Heterogeneous Data From Diverse Sources In HealthCare 2020 CIDR 4.4043898e-05
9,056 A Data Quality Metric (DQM): How to Estimate the Number of Undetected Errors in Data Sets 2017 VLDB 4.4039656e-05
9,235 ThriftLLM: On Cost-Effective Selection of Large Language Models for Classification Queries 2025 VLDB 4.3690661e-05
9,251 Joint Open Knowledge Base Canonicalization and Linking 2021 SIGMOD 4.3690661e-05
9,409 Ground Truth Inference for Weakly Supervised Entity Matching 2023 SIGMOD 4.3441378e-05
9,683 Hierarchical Entity Resolution using an Oracle 2022 SIGMOD 4.3047774e-05
9,855 Progressive Entity Matching: A Design Space Exploration 2025 SIGMOD 4.269353e-05
9,896 Towards Interpretable and Learnable Risk Analysis for Entity Resolution 2020 SIGMOD 4.2600049e-05
10,022 In-context Clustering-based Entity Resolution with Large Language Models: A Design Space Exploration 2026 SIGMOD 4.1945683e-05
10,040 3dSAGER: Geospatial Entity Resolution over 3D Objects 2026 SIGMOD 4.1945683e-05
11,906 Knowledge Curation and Knowledge Fusion: Challenges, Models, and Applications 2015 SIGMOD 4.1945683e-05
12,006 YZStack: Provisioning Customizable Solution for Big Data 2014 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 9 of 9 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
67 The Merge/Purge Problem for Large Databases 1995 SIGMOD 0.00061348205
155 Robust and Efficient Fuzzy Match for Online Data Cleaning 2003 SIGMOD 0.00040637896
229 Reference Reconciliation in Complex Information Spaces 2005 SIGMOD 0.00032242633
319 Evaluation of entity resolution approaches on real-world match problems 2010 VLDB 0.00027781866
322 Record Linkage: Similarity Measures and Algorithms 2006 SIGMOD 0.00027518768
509 On Active Learning of Record Matching Packages 2010 SIGMOD 0.00021409518
1,410 Entity Resolution with Iterative Blocking 2009 SIGMOD 0.00012127555
3,177 Evaluating Entity Resolution Results 2010 VLDB 7.4367331e-05
3,645 Large-Scale Collective Entity Matching 2011 VLDB 6.8853274e-05
Previous Page 1 / 1 Next

Semantically Similar Papers