Back to papers
Reconstructing and Querying ML Pipeline Intermediates
Summary: Reconstructs and exposes ML pipeline intermediates as queryable, lineage-aware datasets so existing debugging and fairness tools can operate without manual pipeline rewrites. Reduces developer effort and risk of introducing analysis bugs while revealing input–output dependencies.
(summarized by gpt-5-mini on Feb 09 2026)
- Paper ID
- 477
- Venue
- CIDR
- Year
- 2023
- Pagerank
- 4.1945683e-05
- Overall Rank
- 11,147 | 22.46%
- DOI
-
-
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
Outgoing Citations (Sorted by Pagerank)
Showing 1 of 1 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 9,306 |
Debugging Large-Scale Data Science Pipelines using Dagger |
2020 |
VLDB |
4.3572942e-05 |
| 1,940 |
SliceLine: Fast, Linear-Algebra-based Slice Finding for ML Model Debugging |
2021 |
SIGMOD |
0.00010020173 |
| 2,456 |
Production Machine Learning Pipelines: Empirical Analysis and Optimization Opportunities |
2021 |
SIGMOD |
8.7733773e-05 |
| 1,867 |
Interpretable Data-Based Explanations for Fairness Debugging |
2022 |
SIGMOD |
0.00010272055 |
| 9,118 |
Towards Observability for Production Machine Learning Pipelines |
2022 |
VLDB |
4.3928288e-05 |
| 8,257 |
Automating and Optimizing Data-Centric What-If Analyses on Native Machine Learning Pipelines |
2023 |
SIGMOD |
4.5487511e-05 |
| 8,114 |
mlwhatif: What If You Could Stop Re-Implementing Your Machine Learning Pipeline Analyses Over and Over? |
2023 |
VLDB |
4.5823351e-05 |
| 7,000 |
Generating Interpretable Data-Based Explanations for Fairness Debugging using Gopher |
2022 |
SIGMOD |
4.8676312e-05 |
| 6,291 |
Lightweight Inspection of Data Preprocessing in Native Machine Learning Pipelines |
2021 |
CIDR |
5.1269764e-05 |
| 4,734 |
MLINSPECT: A Data Distribution Debugger for Machine Learning Pipelines |
2021 |
SIGMOD |
5.9615384e-05 |