Counting with the Crowd
Summary: Crowdsourced selectivity estimation for predicates without full scans, leveraging workers to estimate the fraction of items satisfying a property. Counts (showing items) excel for image datasets, while sampled labeling wins on text; a spammer/collusion detector boosts accuracy by up to 100×. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Adam Marcus
- 2. David Karger
- 3. Samuel Madden
- 4. Robert Miller
- 5. Sewoong Oh
Incoming Citations (Sorted by Pagerank)
Showing 18 of 18 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 6 of 6 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 94 | CrowdDB: Answering Queries with Crowdsourcing | 2011 | SIGMOD | 0.00051013264 |
| 119 | Answering Queries using Humans, Algorithms and Databases | 2011 | CIDR | 0.0004564788 |
| 249 | Crowdsourced Databases: Query Processing with People | 2011 | CIDR | 0.00030740523 |
| 267 | Human-powered Sorts and Joins | 2012 | VLDB | 0.00029690405 |
| 859 | So Who Won? Dynamic Max Discovery with the Crowd | 2012 | SIGMOD | 0.00015870894 |
| 1,164 | CrowdScreen: Algorithms for Filtering Data with Humans | 2012 | SIGMOD | 0.00013564823 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 11,788 | CDB: Optimizing Queries with Crowd-Based Selections and Joins | 2017 | SIGMOD | 4.1945683e-05 |
| 39 | Statistical Estimators for Relational Algebra Expressions | 1988 | PODS | 0.00074745564 |
| 3,118 | Scaling Up Crowd-Sourcing to Very Large Datasets: A Case for Active Learning | 2015 | VLDB | 7.5379338e-05 |
| 94 | CrowdDB: Answering Queries with Crowdsourcing | 2011 | SIGMOD | 0.00051013264 |
| 3,702 | Every Row Counts: Combining Sketches and Sampling for Accurate Group-By Result Estimates | 2019 | CIDR | 6.8295759e-05 |
| 267 | Human-powered Sorts and Joins | 2012 | VLDB | 0.00029690405 |
| 4,579 | Crowdsourced Top-k Algorithms: An Experimental Evaluation | 2016 | VLDB | 6.070469e-05 |
| 1,164 | CrowdScreen: Algorithms for Filtering Data with Humans | 2012 | SIGMOD | 0.00013564823 |
| 249 | Crowdsourced Databases: Query Processing with People | 2011 | CIDR | 0.00030740523 |
| 7,251 | Learning to Sample: Counting with Complex Queries | 2020 | VLDB | 4.7890519e-05 |