The Role of Massively Multi-Task and Weak Supervision in Software 2.0
Summary: Vision: program Software 2.0 by labeling—declarative weak supervision aggregated via unsupervised label models to cheaply generate training data. Introduce massively multitask central models to amortize labeling across many tasks and validate via Snorkel deployments (ad fraud, diagnostics). (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,206 | Panorama: A Data System for Unbounded Vocabulary Querying over Video | 2020 | VLDB | 7.3826363e-05 |
| 4,196 | Overton: A Data System for Monitoring and Improving Machine-Learned Products | 2020 | CIDR | 6.3686231e-05 |
| 4,749 | Slice Tuner: A Selective Data Acquisition Framework for Accurate and Fair Machine Learning Models | 2021 | SIGMOD | 5.9503689e-05 |
| 9,434 | Rock: Cleaning Data by Embedding ML in Logic Rules | 2024 | SIGMOD | 4.3430376e-05 |
| 11,543 | Migrating a Privacy-Safe Information Extraction System to a Software 2.0 Design | 2020 | CIDR | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 192 | HoloClean: Holistic Data Repairs with Probabilistic Inference | 2017 | VLDB | 0.00035728858 |
| 254 | Snorkel: Rapid Training Data Creation with Weak Supervision | 2018 | VLDB | 0.00030540555 |
| 1,215 | Snuba: Automating Weak Supervision to Label Training Data | 2019 | VLDB | 0.0001323375 |
| 5,251 | Snorkel DryBell: A Case Study in Deploying Weak Supervision at Industrial Scale | 2019 | SIGMOD | 5.6029615e-05 |
Previous
Page 1 / 1
Next