CLAMShell: Speeding up Crowds for Low-latency Data Labeling
Summary: Presents CLAMShell to accelerate crowds for fast labeling; builds latency taxonomy and deployment profiles. Mitigates stragglers, pool maintenance, retainer pools, and active learning yield large speedups and lower variance; validated on MTurk. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Daniel Haas
- 2. Jiannan Wang
- 3. Eugene Wu
- 4. Michael J. Franklin
Incoming Citations (Sorted by Pagerank)
Showing 12 of 12 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 16 of 16 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,491 | CDAS: A Crowdsourcing Data Analytics System | 2012 | VLDB | 0.00011694982 |
| 2,334 | Counting with the Crowd | 2013 | VLDB | 9.0161817e-05 |
| 37 | Distributed GraphLab: A Framework for Machine Learning and Data Mining in the Cloud | 2012 | VLDB | 0.0007522744 |
| 9,867 | tDP: An Optimal-Latency Budget Allocation Strategy for Crowdsourced MAXIMUM Operations | 2015 | SIGMOD | 4.2675549e-05 |
| 866 | Leveraging Transitive Relations for Crowdsourced Joins | 2013 | SIGMOD | 0.00015801196 |
| 3,067 | CrowdFill: Collecting Structured Data from the Crowd | 2014 | SIGMOD | 7.6180371e-05 |
| 8,967 | Planting Trees for scalable and efficient Canonical Hub Labeling | 2020 | VLDB | 4.4190656e-05 |
| 9,504 | Supporting Scalable Analytics with Latency Constraints | 2015 | VLDB | 4.3341665e-05 |
| 3,118 | Scaling Up Crowd-Sourcing to Very Large Datasets: A Case for Active Learning | 2015 | VLDB | 7.5379338e-05 |
| 8,343 | CrowdGame: A Game-Based Crowdsourcing System for Cost-Effective Data Labeling | 2019 | SIGMOD | 4.5429217e-05 |