Database Paper Browser

Back to papers

Deep Indexed Active Learning for Matching Heterogeneous Entity Representations

Summary: Presents DIAL, an ER method with active learning that jointly learns embeddings to boost blocking recall and matching accuracy. Index-By-Committee with PLM-based blockers and matchers; blocking vs. matching have distinct training aims, improving precision, recall, and speed on five benchmarks. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
12791
Venue
VLDB
Year
2022
Pagerank
5.5864206e-05
Overall Rank
5,282 | 63.26%
DOI
10.14778/3485450.3485455

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 5 of 5 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 17 of 17 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
67 The Merge/Purge Problem for Large Databases 1995 SIGMOD 0.00061348205
221 Deep Entity Matching with Pre-Trained Language Models 2021 VLDB 0.00033121824
263 CrowdER: Crowdsourcing Entity Resolution 2012 VLDB 0.00029862413
300 Deep Learning for Entity Matching: A Design Space Exploration 2018 SIGMOD 0.00028441466
319 Evaluation of entity resolution approaches on real-world match problems 2010 VLDB 0.00027781866
509 On Active Learning of Record Matching Packages 2010 SIGMOD 0.00021409518
712 Magellan: Toward Building Entity Matching Management Systems 2016 VLDB 0.00017732426
754 Distributed Representations of Tuples for Entity Resolution 2018 VLDB 0.00017117211
2,038 The return of JedAI: End-to-End Entity Resolution for Structured and Semi-Structured Data 2018 VLDB 9.7098952e-05
2,767 A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching 2020 SIGMOD 8.1513883e-05
3,118 Scaling Up Crowd-Sourcing to Very Large Datasets: A Case for Active Learning 2015 VLDB 7.5379338e-05
3,977 BLAST: a Loosely Schema-aware Meta-blocking Approach for Entity Resolution 2016 VLDB 6.5736268e-05
4,126 Waldo: An Adaptive Human Interface for Crowd Entity Resolution 2017 SIGMOD 6.4314729e-05
4,974 Supervised Meta-blocking 2014 VLDB 5.7903293e-05
5,228 Schema-agnostic vs Schema-based Configurations for Blocking Methods on Homogeneous Data 2016 VLDB 5.6158315e-05
7,450 SystemER: A Human-in-the-loop System for Explainable Entity Resolution 2019 VLDB 4.7265276e-05
8,585 Robust Entity Resolution using Random Graphs 2018 SIGMOD 4.4905755e-05
Previous Page 1 / 1 Next

Semantically Similar Papers