Schema-agnostic vs Schema-based Configurations for Blocking Methods on Homogeneous Data
Summary: Comparative study of schema-agnostic blocking vs. schema-based blocking for entity resolution. Schema-agnostic blocking yields higher recall with higher compute, enabling schema-free operation across heterogeneous data; validated on 9 methods and 11 benchmarks. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 8 of 8 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,038 | The return of JedAI: End-to-End Entity Resolution for Structured and Semi-Structured Data | 2018 | VLDB | 9.7098952e-05 |
| 2,514 | Comparative Analysis of Approximate Blocking Techniques for Entity Resolution | 2016 | VLDB | 8.6139012e-05 |
| 3,977 | BLAST: a Loosely Schema-aware Meta-blocking Approach for Entity Resolution | 2016 | VLDB | 6.5736268e-05 |
| 4,989 | BEER: Blocking for Effective Entity Resolution | 2021 | SIGMOD | 5.7827362e-05 |
| 5,282 | Deep Indexed Active Learning for Matching Heterogeneous Entity Representations | 2022 | VLDB | 5.5864206e-05 |
| 7,052 | Pre-trained Embeddings for Entity Resolution: An Experimental Analysis | 2023 | VLDB | 4.8497453e-05 |
| 10,040 | 3dSAGER: Geospatial Entity Resolution over 3D Objects | 2026 | SIGMOD | 4.1945683e-05 |
| 11,373 | Generalized Supervised Meta-blocking | 2022 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 67 | The Merge/Purge Problem for Large Databases | 1995 | SIGMOD | 0.00061348205 |
| 125 | Approximate String Joins in a Database (Almost) for Free | 2001 | VLDB | 0.00044847972 |
| 322 | Record Linkage: Similarity Measures and Algorithms | 2006 | SIGMOD | 0.00027518768 |
| 814 | Entity Resolution: Theory, Practice & Open Challenges | 2012 | VLDB | 0.00016370594 |
| 1,147 | Web-scale Data Integration: You can only afford to Pay As You Go | 2007 | CIDR | 0.00013677658 |
| 1,410 | Entity Resolution with Iterative Blocking | 2009 | SIGMOD | 0.00012127555 |
| 2,740 | String Similarity Joins: An Experimental Evaluation | 2014 | VLDB | 8.1980628e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 902 | Statistical Schema Matching across Web Query Interfaces | 2003 | SIGMOD | 0.00015486247 |
| 319 | Evaluation of entity resolution approaches on real-world match problems | 2010 | VLDB | 0.00027781866 |
| 11,047 | Blocker and Matcher Can Mutually Benefit: A Co-Learning Framework for Low-Resource Entity Resolution | 2024 | VLDB | 4.1945683e-05 |
| 12,223 | Schema Clustering and Retrieval for Multi-domain Pay-As-You-Go Data Integration Systems | 2010 | SIGMOD | 4.1945683e-05 |
| 4,974 | Supervised Meta-blocking | 2014 | VLDB | 5.7903293e-05 |
| 3,640 | Deep Learning for Blocking in Entity Matching: A Design Space Exploration | 2021 | VLDB | 6.8891671e-05 |
| 1,410 | Entity Resolution with Iterative Blocking | 2009 | SIGMOD | 0.00012127555 |
| 11,373 | Generalized Supervised Meta-blocking | 2022 | VLDB | 4.1945683e-05 |
| 3,977 | BLAST: a Loosely Schema-aware Meta-blocking Approach for Entity Resolution | 2016 | VLDB | 6.5736268e-05 |
| 2,514 | Comparative Analysis of Approximate Blocking Techniques for Entity Resolution | 2016 | VLDB | 8.6139012e-05 |