LANCET: Labeling Complex Data at Scale
Summary: Unifies auto-labeling tasks: what, how, when. Guided by Covariate-shift and Continuity, LANCET maps data to semantic space, keeps labeled neighbors, and uses a distribution-matching network to decide when labeling is safe; outperforms Snuba/GOGGLES by 30pp. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Huayi Zhang
- 2. Lei Cao
- 3. Samuel Madden
- 4. Elke Rundensteiner
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,769 | VOCALExplore: Pay-as-You-Go Video Data Exploration and Model Building | 2023 | VLDB | 4.2856106e-05 |
| 10,365 | Agree to Disagree: Robust Anomaly Detection with Noisy Labels | 2025 | SIGMOD | 4.1945683e-05 |
| 11,008 | MetaStore: Analyzing Deep Learning Meta-Data at Scale | 2024 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 254 | Snorkel: Rapid Training Data Creation with Weak Supervision | 2018 | VLDB | 0.00030540555 |
| 1,215 | Snuba: Automating Weak Supervision to Label Training Data | 2019 | VLDB | 0.0001323375 |
| 2,825 | Smile: A System to Support Machine Learning on EEG Data at Scale | 2019 | VLDB | 8.0563426e-05 |
| 4,471 | GOGGLES: Automatic Image Labeling with Affinity Coding | 2020 | SIGMOD | 6.1555681e-05 |
Previous
Page 1 / 1
Next