CDB: A Crowd-Powered Database System
Summary: CDB: crowd-powered DB with a graph-based, fine-grained tuple-level optimization model. Simultaneous cost, latency, and quality optimization; outperforms tree-based plans and enables real-world deployments for web table integration and entity collection. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Guoliang Li
- 2. Chengliang Chai
- 3. Ju Fan
- 4. Xueping Weng
- 5. Jian Li
- 6. Yudian Zheng
- 7. Yuanbing Li
- 8. Xiang Yu
- 9. Xiaohang Zhang
- 10. Haitao Yuan
Incoming Citations (Sorted by Pagerank)
Showing 10 of 10 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,963 | DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing | 2025 | VLDB | 9.929429e-05 |
| 4,102 | GoodCore: Data-effective and Data-efficient Machine Learning through Coreset Selection over Incomplete Data | 2023 | SIGMOD | 6.4522929e-05 |
| 5,381 | Selective Data Acquisition in the Wild for Model Charging | 2022 | VLDB | 5.5399508e-05 |
| 5,734 | Efficient Algorithms for Crowd-Aided Categorization | 2020 | VLDB | 5.3482904e-05 |
| 6,426 | Fluid: A Blockchain based Framework for Crowdsourcing | 2019 | SIGMOD | 5.0670573e-05 |
| 6,569 | Domain Adaptation for Deep Entity Resolution | 2022 | SIGMOD | 5.0065379e-05 |
| 7,575 | Human-in-the-loop Outlier Detection | 2020 | SIGMOD | 4.7068909e-05 |
| 9,043 | Query-Guided Resolution in Uncertain Databases | 2023 | SIGMOD | 4.4039656e-05 |
| 11,000 | MisDetect: Iterative Mislabel Detection using Early Loss | 2024 | VLDB | 4.1945683e-05 |
| 11,399 | ActivePDB: Active Probabilistic Databases | 2022 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 9 of 9 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 94 | CrowdDB: Answering Queries with Crowdsourcing | 2011 | SIGMOD | 0.00051013264 |
| 249 | Crowdsourced Databases: Query Processing with People | 2011 | CIDR | 0.00030740523 |
| 866 | Leveraging Transitive Relations for Crowdsourced Joins | 2013 | SIGMOD | 0.00015801196 |
| 2,937 | Truth Inference in Crowdsourcing: Is the Problem Solved? | 2017 | VLDB | 7.853108e-05 |
| 3,263 | QASCA: A Quality-Aware Task Assignment System for Crowdsourcing Applications | 2015 | SIGMOD | 7.3097573e-05 |
| 4,579 | Crowdsourced Top-k Algorithms: An Experimental Evaluation | 2016 | VLDB | 6.070469e-05 |
| 5,362 | Cost-Effective Crowdsourced Entity Resolution: A Partial-Order Approach | 2016 | SIGMOD | 5.5473503e-05 |
| 11,788 | CDB: Optimizing Queries with Crowd-Based Selections and Joins | 2017 | SIGMOD | 4.1945683e-05 |
| 11,816 | DOCS: Domain-Aware Crowdsourcing System | 2017 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,806 | Query Optimization over Crowdsourced Data | 2013 | VLDB | 4.9218336e-05 |
| 514 | An End-to-End Automatic Cloud Database Tuning System Using Deep Reinforcement Learning | 2019 | SIGMOD | 0.0002124895 |
| 7,668 | Human-in-the-loop Data Integration | 2017 | VLDB | 4.6834075e-05 |
| 7,117 | Crowdsourced Data Management: Overview and Challenges | 2017 | SIGMOD | 4.826509e-05 |
| 3,935 | CrowdQ: Crowdsourced Query Understanding | 2013 | CIDR | 6.6163464e-05 |
| 5,450 | Crowdsourcing Applications and Platforms: A Data Management Perspective | 2011 | VLDB | 5.5003491e-05 |
| 7,572 | Pushing the Boundaries of Crowd-enabled Databases with Query-driven Schema Expansion | 2012 | VLDB | 4.7075553e-05 |
| 1,885 | CrowdDB: Query Processing with the VLDB Crowd | 2011 | VLDB | 0.0001021098 |
| 94 | CrowdDB: Answering Queries with Crowdsourcing | 2011 | SIGMOD | 0.00051013264 |
| 11,788 | CDB: Optimizing Queries with Crowd-Based Selections and Joins | 2017 | SIGMOD | 4.1945683e-05 |