mlwhatif: What If You Could Stop Re-Implementing Your Machine Learning Pipeline Analyses Over and Over?
Summary: Declarative framework for data-centric ML what-if analyses that lets users specify pipeline perturbations and automatically generates, optimizes, and executes the required pipeline variants instead of reimplementing analyses. Demonstrates planner-level optimizations for robustness, data-cleaning and preprocessing/fairness studies across diverse pipelines; open-source mlwhatif library available. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Stefan Grafberger
- 2. Shubha Guha
- 3. Paul Groth
- 4. Sebastian Schelter
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,365 | Falcon: Fair Active Learning using Multi-armed Bandits | 2024 | VLDB | 4.3502315e-05 |
| 10,392 | Shapley Value Estimation Based on Differential Matrix | 2025 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,298 | Efficient Task-Specific Data Valuation for Nearest Neighbor Algorithms | 2019 | VLDB | 0.00012758104 |
| 1,404 | Responsible Data Management | 2020 | VLDB | 0.00012174977 |
| 4,734 | MLINSPECT: A Data Distribution Debugger for Machine Learning Pipelines | 2021 | SIGMOD | 5.9615384e-05 |
| 8,257 | Automating and Optimizing Data-Centric What-If Analyses on Native Machine Learning Pipelines | 2023 | SIGMOD | 4.5487511e-05 |
| 11,310 | Screening Native ML Pipelines with “ArgusEyes” | 2022 | CIDR | 4.1945683e-05 |
Previous
Page 1 / 1
Next