Database Paper Browser

Back to papers

CrowdER: Crowdsourcing Entity Resolution

Summary: CrowdER uses a machine-driven coarse pass to prune candidate pairs, saving human effort. Minimizing verification batches is NP-hard; a two-tiered batched-heuristic yields efficient, accurate results, demonstrated on real datasets via crowdsourcing. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10384
Venue
VLDB
Year
2012
Pagerank
0.00029862413
Overall Rank
263 | 98.18%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 21 of 71 citing papers.

Rank Citing Paper Year Venue Pagerank
7,292 Subjective Knowledge Base Construction Powered By Crowdsourcing and Knowledge Base 2018 SIGMOD 4.7740174e-05
7,575 Human-in-the-loop Outlier Detection 2020 SIGMOD 4.7068909e-05
7,668 Human-in-the-loop Data Integration 2017 VLDB 4.6834075e-05
8,056 Where To: Crowd-Aided Path Selection 2014 VLDB 4.5946189e-05
8,585 Robust Entity Resolution using Random Graphs 2018 SIGMOD 4.4905755e-05
8,911 PromptEM: Prompt-tuning for Low-resource Generalized Entity Matching 2023 VLDB 4.427232e-05
9,056 A Data Quality Metric (DQM): How to Estimate the Number of Undetected Errors in Data Sets 2017 VLDB 4.4039656e-05
9,196 QOCO: A Query Oriented Data Cleaning System with Oracles 2015 VLDB 4.3749064e-05
9,683 Hierarchical Entity Resolution using an Oracle 2022 SIGMOD 4.3047774e-05
9,684 How to Design Robust Algorithms using Noisy Comparison Oracle 2021 VLDB 4.3047774e-05
9,851 Adaptive Schema Databases 2017 CIDR 4.2721228e-05
9,896 Towards Interpretable and Learnable Risk Analysis for Entity Resolution 2020 SIGMOD 4.2600049e-05
10,022 In-context Clustering-based Entity Resolution with Large Language Models: A Design Space Exploration 2026 SIGMOD 4.1945683e-05
10,091 LLM-Powered Interactive Graph Search: A Scalable and Practical Approach 2026 SIGMOD 4.1945683e-05
11,454 Contextual Data Cleaning with Ontology FDs 2021 SIGMOD 4.1945683e-05
11,707 A Rating-Ranking Method for Crowdsourced Top-k Computation 2018 SIGMOD 4.1945683e-05
11,739 CloudMatcher: A Hands-Off Cloud/Crowd Service for Entity Matching 2018 VLDB 4.1945683e-05
11,788 CDB: Optimizing Queries with Crowd-Based Selections and Joins 2017 SIGMOD 4.1945683e-05
11,791 CrowdDQS: Dynamic Question Selection in Crowdsourcing Systems 2017 SIGMOD 4.1945683e-05
11,816 DOCS: Domain-Aware Crowdsourcing System 2017 VLDB 4.1945683e-05
12,044 Knowledge Harvesting in the Big-Data Era 2013 SIGMOD 4.1945683e-05
Previous Page 2 / 2 Next

Outgoing Citations (Sorted by Pagerank)

Showing 7 of 7 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
94 CrowdDB: Answering Queries with Crowdsourcing 2011 SIGMOD 0.00051013264
249 Crowdsourced Databases: Query Processing with People 2011 CIDR 0.00030740523
267 Human-powered Sorts and Joins 2012 VLDB 0.00029690405
319 Evaluation of entity resolution approaches on real-world match problems 2010 VLDB 0.00027781866
509 On Active Learning of Record Matching Packages 2010 SIGMOD 0.00021409518
692 Pay-as-you-go User Feedback for Dataspace Systems 2008 SIGMOD 0.00018083948
1,885 CrowdDB: Query Processing with the VLDB Crowd 2011 VLDB 0.0001021098
Previous Page 1 / 1 Next

Semantically Similar Papers