Database Paper Browser

Back to papers

Entity Matching: How Similar Is Similar

Summary: Addresses 'how similar is similar' in entity matching by pruning the space of similarity functions and thresholds. Introduces optimization to remove redundancy and efficient algorithms to pick the best functions, with experiments on real and synthetic data showing improved accuracy over baselines. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10290
Venue
VLDB
Year
2011
Pagerank
0.00012468408
Overall Rank
1,345 | 90.65%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 16 of 16 citing papers.

Rank Citing Paper Year Venue Pagerank
221 Deep Entity Matching with Pre-Trained Language Models 2021 VLDB 0.00033121824
754 Distributed Representations of Tuples for Entity Resolution 2018 VLDB 0.00017117211
1,831 Synthesizing Entity Matching Rules by Examples 2018 VLDB 0.00010384082
3,322 iCrowd: An Adaptive Crowdsourcing Framework 2015 SIGMOD 7.2230626e-05
3,711 Saga: A Platform for Continuous Construction and Serving of Knowledge At Scale 2022 SIGMOD 6.823609e-05
3,861 Generating Concise Entity Matching Rules 2017 SIGMOD 6.6878164e-05
4,018 Through the Fairness Lens: Experimental Analysis and Evaluation of Entity Matching 2023 VLDB 6.5244015e-05
4,837 Entity Resolution with Hierarchical Graph Attention Networks 2022 SIGMOD 5.8892326e-05
5,656 HYDRA: Large-scale Social Identity Linkage via Heterogeneous Behavior Modeling 2014 SIGMOD 5.3866501e-05
6,042 MDedup: Duplicate Detection with Matching Dependencies 2020 VLDB 5.2405269e-05
7,185 Certus: An Effective Entity Resolution Approach with Graph Differential Dependencies (GDDs) 2019 VLDB 4.8066159e-05
7,668 Human-in-the-loop Data Integration 2017 VLDB 4.6834075e-05
8,911 PromptEM: Prompt-tuning for Low-resource Generalized Entity Matching 2023 VLDB 4.427232e-05
8,958 FlexER: Flexible Entity Resolution for Multiple Intents 2023 SIGMOD 4.4210635e-05
9,725 On Concise Set of Relative Candidate Keys 2014 VLDB 4.2945121e-05
11,206 When Automatic Filtering Comes to the Rescue: Pre-Computing Company Competitor Pairs in Owler 2023 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 6 of 6 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
67 The Merge/Purge Problem for Large Databases 1995 SIGMOD 0.00061348205
199 Declarative Data Cleaning: Language, Model, and Algorithms 2001 VLDB 0.00035041015
266 Efficient Exact Set-Similarity Joins 2006 VLDB 0.00029718727
322 Record Linkage: Similarity Measures and Algorithms 2006 SIGMOD 0.00027518768
702 Reasoning about Record Matching Rules 2009 VLDB 0.00017918203
1,533 Example-driven Design of Efficient Record Matching Queries 2007 VLDB 0.00011471971
Previous Page 1 / 1 Next

Semantically Similar Papers