CloudMatcher: A Hands-Off Cloud/Crowd Service for Entity Matching
Summary: Hands-off cloud/crowd service for entity matching; lay users upload two tables and label pairs as match/no-match, or crowdsource labels. Showcases end-to-end EM with scalable concurrency for large datasets, enabling non-developer data integration workflows. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Yash Govind
- 2. Erik Paulson
- 3. Palaniappan Nagarajan
- 4. Paul Suganthan G.C.
- 5. AnHai Doan
- 6. Youngchoon Park
- 7. Glenn M. Fung
- 8. Devin Conathan
- 9. Marshall Carter
- 10. Mingju Sun
Incoming Citations (Sorted by Pagerank)
Showing 4 of 4 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,640 | Deep Learning for Blocking in Entity Matching: A Design Space Exploration | 2021 | VLDB | 6.8891671e-05 |
| 4,402 | Smurf: Self-Service String Matching Using Random Forests | 2019 | VLDB | 6.2195162e-05 |
| 6,747 | Entity Matching Meets Data Science: A Progress Report from the Magellan Project | 2019 | SIGMOD | 4.9408824e-05 |
| 8,099 | Sparkly: A Simple yet Surprisingly Strong TF/IDF Blocker for Entity Matching | 2023 | VLDB | 4.5859317e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 263 | CrowdER: Crowdsourcing Entity Resolution | 2012 | VLDB | 0.00029862413 |
| 643 | Corleone: Hands-Off Crowdsourcing for Entity Matching | 2014 | SIGMOD | 0.00018754451 |
| 712 | Magellan: Toward Building Entity Matching Management Systems | 2016 | VLDB | 0.00017732426 |
| 2,175 | Falcon: Scaling Up Hands-Off Crowdsourced Entity Matching to Build Cloud Services | 2017 | SIGMOD | 9.3644117e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 11,528 | Valentine in Action: Matching Tabular Data at Scale | 2021 | VLDB | 4.1945683e-05 |
| 4,464 | Magellan: Toward Building Entity Matching Management Systems over Data Science Stacks | 2016 | VLDB | 6.1606042e-05 |
| 11,583 | InCognitoMatch: Cognitive-aware Matching via Crowdsourcing | 2020 | SIGMOD | 4.1945683e-05 |
| 8,343 | CrowdGame: A Game-Based Crowdsourcing System for Cost-Effective Data Labeling | 2019 | SIGMOD | 4.5429217e-05 |
| 7,668 | Human-in-the-loop Data Integration | 2017 | VLDB | 4.6834075e-05 |
| 643 | Corleone: Hands-Off Crowdsourcing for Entity Matching | 2014 | SIGMOD | 0.00018754451 |
| 263 | CrowdER: Crowdsourcing Entity Resolution | 2012 | VLDB | 0.00029862413 |
| 2,175 | Falcon: Scaling Up Hands-Off Crowdsourced Entity Matching to Build Cloud Services | 2017 | SIGMOD | 9.3644117e-05 |
| 4,416 | CrowdMatcher: Crowd-Assisted Schema Matching | 2014 | SIGMOD | 6.2039225e-05 |
| 6,747 | Entity Matching Meets Data Science: A Progress Report from the Magellan Project | 2019 | SIGMOD | 4.9408824e-05 |