BugDoc: A System for Debugging Computational Pipelines
Summary: Provenance-driven automatic root-cause inference for complex pipelines, with iterative, succinct failure explanations. BugDoc demonstrates debugging from few configurations, enabling automatic triage and actionable insights for data-intensive workflows. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Raoni Lourenço
- 2. Juliana Freire
- 3. Dennis Shasha
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,779 | Explaining Inference Queries with Bayesian Optimization | 2021 | VLDB | 4.9280116e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,099 | Interpretable and Informative Explanations of Outcomes | 2015 | VLDB | 0.00014096312 |
| 2,892 | Data Provenance at Internet Scale: Architecture, Experiences, and the Road Ahead | 2017 | CIDR | 7.9480559e-05 |
| 3,105 | Data X-Ray: A Diagnostic Tool for Data Errors | 2015 | SIGMOD | 7.5568954e-05 |
| 8,341 | BugDoc: Algorithms to Debug Computational Processes | 2020 | SIGMOD | 4.5433282e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 11,147 | Reconstructing and Querying ML Pipeline Intermediates | 2023 | CIDR | 4.1945683e-05 |
| 9,871 | From Logs to Causal Inference: Diagnosing Large Systems | 2025 | VLDB | 4.2667743e-05 |
| 10,820 | APEX-DAG: Library and Language independent Pipeline EXtraction | 2025 | VLDB | 4.1945683e-05 |
| 7,857 | Fixed It For You: Protocol Repair Using Lineage Graphs | 2019 | CIDR | 4.6345517e-05 |
| 4,734 | MLINSPECT: A Data Distribution Debugger for Machine Learning Pipelines | 2021 | SIGMOD | 5.9615384e-05 |
| 9,118 | Towards Observability for Production Machine Learning Pipelines | 2022 | VLDB | 4.3928288e-05 |
| 8,586 | A Demonstration of DLBD: Database Logic Bug Detection System | 2023 | VLDB | 4.4902778e-05 |
| 5,086 | Improving Reproducibility of Data Science Pipelines through Transparent Provenance Capture | 2020 | VLDB | 5.7078462e-05 |
| 9,306 | Debugging Large-Scale Data Science Pipelines using Dagger | 2020 | VLDB | 4.3572942e-05 |
| 8,341 | BugDoc: Algorithms to Debug Computational Processes | 2020 | SIGMOD | 4.5433282e-05 |