Database Paper Browser

Back to papers

Cost-Effective Crowdsourced Entity Resolution: A Partial-Order Approach

Summary: Partial-order-based crowdsourced entity resolution; defines a pairwise order, queries a small set, and propagates to infer all answers. Grouping for cost reduction, error-tolerant handling of order and crowd errors, and efficient query selection; experiments show 1.25% of baselines (80x cheaper) with preserved quality. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5279
Venue
SIGMOD
Year
2016
Pagerank
5.5473503e-05
Overall Rank
5,362 | 62.70%
DOI
10.1145/2882903.2915252

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 25 of 25 citing papers.

Rank Citing Paper Year Venue Pagerank
2,175 Falcon: Scaling Up Hands-Off Crowdsourced Entity Matching to Build Cloud Services 2017 SIGMOD 9.3644117e-05
2,767 A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching 2020 SIGMOD 8.1513883e-05
2,937 Truth Inference in Crowdsourcing: Is the Problem Solved? 2017 VLDB 7.853108e-05
4,102 GoodCore: Data-effective and Data-efficient Machine Learning through Coreset Selection over Incomplete Data 2023 SIGMOD 6.4522929e-05
5,279 CDB: A Crowd-Powered Database System 2018 VLDB 5.5902418e-05
5,381 Selective Data Acquisition in the Wild for Model Charging 2022 VLDB 5.5399508e-05
5,963 Automatic Data Acquisition for Deep Learning 2021 VLDB 5.2526794e-05
5,976 Responsible Data Integration: Next-generation Challenges 2022 SIGMOD 5.245976e-05
6,569 Domain Adaptation for Deep Entity Resolution 2022 SIGMOD 5.0065379e-05
6,868 Cost-Effective Data Annotation using Game-Based Crowdsourcing 2019 VLDB 4.9010083e-05
6,985 CompressGraph: Efficient Parallel Graph Analytics with Rule-Based Compression 2023 SIGMOD 4.8729387e-05
7,117 Crowdsourced Data Management: Overview and Challenges 2017 SIGMOD 4.826509e-05
7,179 Coresets over Multiple Tables for Feature-rich and Data-efficient Machine Learning 2023 VLDB 4.8078895e-05
7,292 Subjective Knowledge Base Construction Powered By Crowdsourcing and Knowledge Base 2018 SIGMOD 4.7740174e-05
7,575 Human-in-the-loop Outlier Detection 2020 SIGMOD 4.7068909e-05
7,668 Human-in-the-loop Data Integration 2017 VLDB 4.6834075e-05
8,268 Learned Data-aware Image Representations of Line Charts for Similarity Search 2023 SIGMOD 4.5456668e-05
8,585 Robust Entity Resolution using Random Graphs 2018 SIGMOD 4.4905755e-05
9,896 Towards Interpretable and Learnable Risk Analysis for Entity Resolution 2020 SIGMOD 4.2600049e-05
10,022 In-context Clustering-based Entity Resolution with Large Language Models: A Design Space Exploration 2026 SIGMOD 4.1945683e-05
10,161 Enabling Efficient Direct Update on Rule-Based Compressed Graph 2026 SIGMOD 4.1945683e-05
11,000 MisDetect: Iterative Mislabel Detection using Early Loss 2024 VLDB 4.1945683e-05
11,707 A Rating-Ranking Method for Crowdsourced Top-k Computation 2018 SIGMOD 4.1945683e-05
11,788 CDB: Optimizing Queries with Crowd-Based Selections and Joins 2017 SIGMOD 4.1945683e-05
11,816 DOCS: Domain-Aware Crowdsourcing System 2017 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 15 of 15 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers