Database Paper Browser

Back to papers

Crowdsourcing Algorithms for Entity Resolution

Summary: Hybrid human–machine entity resolution on a probabilistic edge graph; designs query strategies to minimize expected human verifications by exploiting transitivity. Reveals that a claimed optimal strategy can be arbitrarily bad, offers provably guaranteed alternatives, and validates on public datasets and Facebook’s production system. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10767
Venue
VLDB
Year
2014
Pagerank
0.00010348858
Overall Rank
1,841 | 87.20%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 25 of 25 citing papers.

Rank Citing Paper Year Venue Pagerank
2,175 Falcon: Scaling Up Hands-Off Crowdsourced Entity Matching to Build Cloud Services 2017 SIGMOD 9.3644117e-05
2,767 A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching 2020 SIGMOD 8.1513883e-05
4,104 Online Entity Resolution Using an Oracle 2016 VLDB 6.4493809e-05
4,126 Waldo: An Adaptive Human Interface for Crowd Entity Resolution 2017 SIGMOD 6.4314729e-05
4,619 Crowd-Based Deduplication: An Adaptive Approach 2015 SIGMOD 6.0444854e-05
4,989 BEER: Blocking for Effective Entity Resolution 2021 SIGMOD 5.7827362e-05
5,362 Cost-Effective Crowdsourced Entity Resolution: A Partial-Order Approach 2016 SIGMOD 5.5473503e-05
5,734 Efficient Algorithms for Crowd-Aided Categorization 2020 VLDB 5.3482904e-05
6,584 Budget Constrained Interactive Search for Multiple Targets 2021 VLDB 5.0027686e-05
6,868 Cost-Effective Data Annotation using Game-Based Crowdsourcing 2019 VLDB 4.9010083e-05
7,117 Crowdsourced Data Management: Overview and Challenges 2017 SIGMOD 4.826509e-05
7,178 Towards Globally Optimal Crowdsourcing Quality Management: The Uniform Worker Setting 2016 SIGMOD 4.8085946e-05
7,292 Subjective Knowledge Base Construction Powered By Crowdsourcing and Knowledge Base 2018 SIGMOD 4.7740174e-05
8,585 Robust Entity Resolution using Random Graphs 2018 SIGMOD 4.4905755e-05
9,683 Hierarchical Entity Resolution using an Oracle 2022 SIGMOD 4.3047774e-05
9,684 How to Design Robust Algorithms using Noisy Comparison Oracle 2021 VLDB 4.3047774e-05
9,855 Progressive Entity Matching: A Design Space Exploration 2025 SIGMOD 4.269353e-05
9,896 Towards Interpretable and Learnable Risk Analysis for Entity Resolution 2020 SIGMOD 4.2600049e-05
10,022 In-context Clustering-based Entity Resolution with Large Language Models: A Design Space Exploration 2026 SIGMOD 4.1945683e-05
10,091 LLM-Powered Interactive Graph Search: A Scalable and Practical Approach 2026 SIGMOD 4.1945683e-05
10,624 Evaluating Methods for Efficient Entity Count Estimation 2025 VLDB 4.1945683e-05
11,443 Approximation Algorithms for Large Scale Data Analysis 2021 PODS 4.1945683e-05
11,731 A Demonstration of PERC: Probabilistic Entity Resolution With Crowd Errors 2018 VLDB 4.1945683e-05
11,788 CDB: Optimizing Queries with Crowd-Based Selections and Joins 2017 SIGMOD 4.1945683e-05
11,791 CrowdDQS: Dynamic Question Selection in Crowdsourcing Systems 2017 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
263 CrowdER: Crowdsourcing Entity Resolution 2012 VLDB 0.00029862413
267 Human-powered Sorts and Joins 2012 VLDB 0.00029690405
866 Leveraging Transitive Relations for Crowdsourced Joins 2013 SIGMOD 0.00015801196
1,242 Question Selection for Crowd Entity Resolution 2013 VLDB 0.00013096655
4,185 Arnold: Declarative Crowd-Machine Data Integration 2013 CIDR 6.3776356e-05
Previous Page 1 / 1 Next

Semantically Similar Papers