Witan: Unsupervised Labelling Function Generation for Assisted Data Programming
Summary: Witan generates labelling functions without supervision, enabling unsupervised data programming. It supports interactive modes from exploration to class definition, delivering accurate binary and multiclass labeling with efficient weak supervision. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Benjamin Denham
- 2. Edmund M-K. Lai
- 3. Roopak Sinha
- 4. M. Asif Naeem
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,533 | WeShap: Weak Supervision Source Evaluation with Shapley Values | 2025 | VLDB | 4.1945683e-05 |
| 11,205 | Steered Training Data Generation for Learned Semantic Type Detection | 2023 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 254 | Snorkel: Rapid Training Data Creation with Weak Supervision | 2018 | VLDB | 0.00030540555 |
| 1,215 | Snuba: Automating Weak Supervision to Label Training Data | 2019 | VLDB | 0.0001323375 |
| 5,347 | Adaptive Rule Discovery for Labeling Text Data | 2021 | SIGMOD | 5.5560452e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,963 | Automatic Data Acquisition for Deep Learning | 2021 | VLDB | 5.2526794e-05 |
| 8,714 | LANCET: Labeling Complex Data at Scale | 2021 | VLDB | 4.4619818e-05 |
| 8,590 | Exploratory Training: When Annotators Learn About Data | 2023 | SIGMOD | 4.4896282e-05 |
| 2,958 | The Role of Massively Multi-Task and Weak Supervision in Software 2.0 | 2019 | CIDR | 7.8173975e-05 |
| 5,347 | Adaptive Rule Discovery for Labeling Text Data | 2021 | SIGMOD | 5.5560452e-05 |
| 6,955 | Inspector Gadget: A Data Programming-based Labeling System for Industrial Images | 2021 | VLDB | 4.8864297e-05 |
| 254 | Snorkel: Rapid Training Data Creation with Weak Supervision | 2018 | VLDB | 0.00030540555 |
| 10,533 | WeShap: Weak Supervision Source Evaluation with Shapley Values | 2025 | VLDB | 4.1945683e-05 |
| 8,292 | Nemo: Guiding and Contextualizing Weak Supervision for Interactive Data Programming | 2022 | VLDB | 4.5435639e-05 |
| 1,215 | Snuba: Automating Weak Supervision to Label Training Data | 2019 | VLDB | 0.0001323375 |