Fault Lines: Benchmarking the Impact of Label Data Quality on ML Robustness and Fairness
Summary: FAULT LINES: a model-agnostic benchmark (15 datasets, systematic diverse label corruptions) plus an evaluation suite to measure robustness and fairness across 22 SOTA classifiers. Key result: many models resist random noise but <10% biased noise causes large accuracy and fairness losses; transformers often handle biased noise better than GBDTs but with higher tuning-dependent variance. (summarized by gpt-5-mini on Mar 13 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. David Jackson
- 2. Paul Groth
- 3. Hazar Harmouch
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,404 | Responsible Data Management | 2020 | VLDB | 0.00012174977 |
| 3,396 | Automatic Data Repair: Are We Ready to Deploy? | 2024 | VLDB | 7.1455126e-05 |
| 6,553 | How do Categorical Duplicates Affect ML? A New Benchmark and Empirical Analyses | 2024 | VLDB | 5.0157344e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,365 | Falcon: Fair Active Learning using Multi-armed Bandits | 2024 | VLDB | 4.3502315e-05 |
| 1,867 | Interpretable Data-Based Explanations for Fairness Debugging | 2022 | SIGMOD | 0.00010272055 |
| 9,684 | How to Design Robust Algorithms using Noisy Comparison Oracle | 2021 | VLDB | 4.3047774e-05 |
| 3,773 | Cleaning Crowdsourced Labels Using Oracles for Statistical Classification | 2019 | VLDB | 6.7758649e-05 |
| 4,769 | Automated Feature Engineering for Algorithmic Fairness | 2021 | VLDB | 5.934329e-05 |
| 1,041 | Interventional Fairness : Causal Database Repair for Algorithmic Fairness | 2019 | SIGMOD | 0.00014482047 |
| 3,396 | Automatic Data Repair: Are We Ready to Deploy? | 2024 | VLDB | 7.1455126e-05 |
| 7,602 | Causal Feature Selection for Algorithmic Fairness | 2022 | SIGMOD | 4.6988081e-05 |
| 7,046 | Through the Data Management Lens: Experimental Analysis and Evaluation of Fair Classification | 2022 | SIGMOD | 4.8525913e-05 |
| 10,758 | Stress-Testing ML Pipelines with Adversarial Data Corruption | 2025 | VLDB | 4.1945683e-05 |