Comparative Analysis of Approximate Blocking Techniques for Entity Resolution
Summary: Systematic empirical survey of 17 blocking methods for entity resolution, assessing robustness of configurations and effectiveness–latency trade-offs on 6 real datasets. Explores scalability with 7 synthetic corpora from 10k to 2M entities. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. George Papadakis
- 2. Jonathan Svirsky
- 3. Avigdor Gal
- 4. Themis Palpanas
Incoming Citations (Sorted by Pagerank)
Showing 19 of 19 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 6 of 6 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 67 | The Merge/Purge Problem for Large Databases | 1995 | SIGMOD | 0.00061348205 |
| 125 | Approximate String Joins in a Database (Almost) for Free | 2001 | VLDB | 0.00044847972 |
| 199 | Declarative Data Cleaning: Language, Model, and Algorithms | 2001 | VLDB | 0.00035041015 |
| 814 | Entity Resolution: Theory, Practice & Open Challenges | 2012 | VLDB | 0.00016370594 |
| 1,410 | Entity Resolution with Iterative Blocking | 2009 | SIGMOD | 0.00012127555 |
| 5,228 | Schema-agnostic vs Schema-based Configurations for Blocking Methods on Homogeneous Data | 2016 | VLDB | 5.6158315e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,052 | Pre-trained Embeddings for Entity Resolution: An Experimental Analysis | 2023 | VLDB | 4.8497453e-05 |
| 11,047 | Blocker and Matcher Can Mutually Benefit: A Co-Learning Framework for Low-Resource Entity Resolution | 2024 | VLDB | 4.1945683e-05 |
| 1,345 | Entity Matching: How Similar Is Similar | 2011 | VLDB | 0.00012468408 |
| 3,640 | Deep Learning for Blocking in Entity Matching: A Design Space Exploration | 2021 | VLDB | 6.8891671e-05 |
| 319 | Evaluation of entity resolution approaches on real-world match problems | 2010 | VLDB | 0.00027781866 |
| 11,373 | Generalized Supervised Meta-blocking | 2022 | VLDB | 4.1945683e-05 |
| 5,228 | Schema-agnostic vs Schema-based Configurations for Blocking Methods on Homogeneous Data | 2016 | VLDB | 5.6158315e-05 |
| 3,977 | BLAST: a Loosely Schema-aware Meta-blocking Approach for Entity Resolution | 2016 | VLDB | 6.5736268e-05 |
| 4,974 | Supervised Meta-blocking | 2014 | VLDB | 5.7903293e-05 |
| 1,410 | Entity Resolution with Iterative Blocking | 2009 | SIGMOD | 0.00012127555 |