BLAST: a Loosely Schema-aware Meta-blocking Approach for Entity Resolution
Summary: BLAST is a loosely schema-aware meta-blocking method for entity resolution using data-derived stats to improve blocks beyond schema-agnostic methods. LSH-based extraction scales to large data, delivering better coverage and beating unsupervised meta-blocking. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 8 of 8 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,038 | The return of JedAI: End-to-End Entity Resolution for Structured and Semi-Structured Data | 2018 | VLDB | 9.7098952e-05 |
| 3,640 | Deep Learning for Blocking in Entity Matching: A Design Space Exploration | 2021 | VLDB | 6.8891671e-05 |
| 4,989 | BEER: Blocking for Effective Entity Resolution | 2021 | SIGMOD | 5.7827362e-05 |
| 5,282 | Deep Indexed Active Learning for Matching Heterogeneous Entity Representations | 2022 | VLDB | 5.5864206e-05 |
| 6,690 | Parallel Discrepancy Detection and Incremental Detection | 2021 | VLDB | 4.9621556e-05 |
| 7,668 | Human-in-the-loop Data Integration | 2017 | VLDB | 4.6834075e-05 |
| 9,846 | HyperBlocker: Accelerating Rule-based Blocking in Entity Resolution using GPUs | 2025 | VLDB | 4.2721228e-05 |
| 11,373 | Generalized Supervised Meta-blocking | 2022 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 125 | Approximate String Joins in a Database (Almost) for Free | 2001 | VLDB | 0.00044847972 |
| 398 | Big Data Integration | 2013 | VLDB | 0.00024372588 |
| 1,147 | Web-scale Data Integration: You can only afford to Pay As You Go | 2007 | CIDR | 0.00013677658 |
| 4,974 | Supervised Meta-blocking | 2014 | VLDB | 5.7903293e-05 |
| 5,228 | Schema-agnostic vs Schema-based Configurations for Blocking Methods on Homogeneous Data | 2016 | VLDB | 5.6158315e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,052 | Pre-trained Embeddings for Entity Resolution: An Experimental Analysis | 2023 | VLDB | 4.8497453e-05 |
| 8,099 | Sparkly: A Simple yet Surprisingly Strong TF/IDF Blocker for Entity Matching | 2023 | VLDB | 4.5859317e-05 |
| 11,047 | Blocker and Matcher Can Mutually Benefit: A Co-Learning Framework for Low-Resource Entity Resolution | 2024 | VLDB | 4.1945683e-05 |
| 4,989 | BEER: Blocking for Effective Entity Resolution | 2021 | SIGMOD | 5.7827362e-05 |
| 3,640 | Deep Learning for Blocking in Entity Matching: A Design Space Exploration | 2021 | VLDB | 6.8891671e-05 |
| 1,410 | Entity Resolution with Iterative Blocking | 2009 | SIGMOD | 0.00012127555 |
| 4,974 | Supervised Meta-blocking | 2014 | VLDB | 5.7903293e-05 |
| 5,228 | Schema-agnostic vs Schema-based Configurations for Blocking Methods on Homogeneous Data | 2016 | VLDB | 5.6158315e-05 |
| 11,373 | Generalized Supervised Meta-blocking | 2022 | VLDB | 4.1945683e-05 |
| 2,514 | Comparative Analysis of Approximate Blocking Techniques for Entity Resolution | 2016 | VLDB | 8.6139012e-05 |