Database Paper Browser

Back to papers

On Active Learning of Record Matching Packages

Summary: Active learning for record matching: selective labeling to train a classifier. New algorithms exploit record-matching structure to guarantee quality and scalability; real-world experiments validate effectiveness. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4293
Venue
SIGMOD
Year
2010
Pagerank
0.00021409518
Overall Rank
509 | 96.47%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 31 of 31 citing papers.

Rank Citing Paper Year Venue Pagerank
263 CrowdER: Crowdsourcing Entity Resolution 2012 VLDB 0.00029862413
643 Corleone: Hands-Off Crowdsourcing for Entity Matching 2014 SIGMOD 0.00018754451
712 Magellan: Toward Building Entity Matching Management Systems 2016 VLDB 0.00017732426
814 Entity Resolution: Theory, Practice & Open Challenges 2012 VLDB 0.00016370594
1,242 Question Selection for Crowd Entity Resolution 2013 VLDB 0.00013096655
2,767 A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching 2020 SIGMOD 8.1513883e-05
3,118 Scaling Up Crowd-Sourcing to Very Large Datasets: A Case for Active Learning 2015 VLDB 7.5379338e-05
3,140 ZeroER: Entity Resolution using Zero Labeled Examples 2020 SIGMOD 7.4841763e-05
3,142 Active Learning for ML Enhanced Database Systems 2020 SIGMOD 7.4815444e-05
3,744 Learning Expressive Linkage Rules using Genetic Programming 2012 VLDB 6.7932071e-05
3,996 MaskIt: Privately Releasing User Context Streams for Personalized Mobile Applications 2012 SIGMOD 6.5502886e-05
4,126 Waldo: An Adaptive Human Interface for Crowd Entity Resolution 2017 SIGMOD 6.4314729e-05
4,619 Crowd-Based Deduplication: An Adaptive Approach 2015 SIGMOD 6.0444854e-05
4,665 Argonaut: Macrotask Crowdsourcing for Complex Data Processing 2015 VLDB 6.0125329e-05
5,032 Actively Soliciting Feedback for Query Answers in Keyword Search-Based Data Integration 2013 VLDB 5.748807e-05
5,282 Deep Indexed Active Learning for Matching Heterogeneous Entity Representations 2022 VLDB 5.5864206e-05
5,941 Big Graphs: Challenges and Opportunities 2022 VLDB 5.2635446e-05
5,978 Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond 2021 SIGMOD 5.2453012e-05
6,519 Expand your Training Limits! Generating Training Data for ML-based Data Management 2021 SIGMOD 5.0316686e-05
6,690 Parallel Discrepancy Detection and Incremental Detection 2021 VLDB 4.9621556e-05
7,345 Linking Temporal Records for Profiling Entities 2015 SIGMOD 4.756212e-05
7,648 User Guidance for Efficient Fact Checking 2019 VLDB 4.6889787e-05
8,362 Minimizing Efforts in Validating Crowd Answers 2015 SIGMOD 4.5366717e-05
9,409 Ground Truth Inference for Weakly Supervised Entity Matching 2023 SIGMOD 4.3441378e-05
9,434 Rock: Cleaning Data by Embedding ML in Logic Rules 2024 SIGMOD 4.3430376e-05
9,487 Making It Tractable to Catch Duplicates and Conflicts in Graphs 2023 SIGMOD 4.3341665e-05
9,896 Towards Interpretable and Learnable Risk Analysis for Entity Resolution 2020 SIGMOD 4.2600049e-05
11,054 Enriching Relations with Additional Attributes for ER 2024 VLDB 4.1945683e-05
11,223 Splitting Tuples of Mismatched Entities 2023 SIGMOD 4.1945683e-05
11,438 New Algorithms for Monotone Classification 2021 PODS 4.1945683e-05
12,194 Web Scale Taxonomy Cleansing 2011 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 9 of 9 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers