The Machine Learning Bazaar: Harnessing the ML Ecosystem for Effective System Development
Summary: Unifies ML libraries via ML primitives API and pipeline composition to automate end-to-end ML systems. AutoML via Bayesian optimization and bandit strategies for a multi-task, cross-modal platform, evaluated on 456 tasks and 2.5M pipelines. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,957 | Doing More with Less: Characterizing Dataset Downsampling for AutoML | 2021 | VLDB | 5.8035715e-05 |
| 6,448 | Sintel: A Machine Learning Framework to Extract Insights from Signals | 2022 | SIGMOD | 5.0587973e-05 |
| 11,429 | Leam: An Interactive System for In-situ Visual Text Analysis | 2021 | CIDR | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 8 of 8 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 557 | SystemML: Declarative Machine Learning on Spark | 2016 | VLDB | 0.00020197988 |
| 921 | Democratizing Data Science through Interactive Curation of ML Pipelines | 2019 | SIGMOD | 0.00015337438 |
| 1,277 | The Data Civilizer System | 2017 | CIDR | 0.00012879695 |
| 1,281 | DataHub: Collaborative Data Science & Dataset Version Management at Scale | 2015 | CIDR | 0.00012854744 |
| 2,251 | Vizdom: Interactive Analytics through Pen and Touch | 2015 | VLDB | 9.1986441e-05 |
| 2,896 | Evaluating End-to-End Optimization for Data Analytics Applications in Weld | 2018 | VLDB | 7.9452051e-05 |
| 4,576 | The Missing Piece in Complex Analytics: Low Latency, Scalable Model Management and Serving with Velox | 2015 | CIDR | 6.0721464e-05 |
| 4,748 | Rafiki: Machine Learning as an Analytics Service System | 2019 | VLDB | 5.9526539e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 11,313 | Towards Observability for Machine Learning Pipelines | 2022 | CIDR | 4.1945683e-05 |
| 2,456 | Production Machine Learning Pipelines: Empirical Analysis and Optimization Opportunities | 2021 | SIGMOD | 8.7733773e-05 |
| 13,267 | Towards Scalable Online Machine Learning Collaborations with OpenML | 2021 | VLDB | - |
| 11,310 | Screening Native ML Pipelines with “ArgusEyes” | 2022 | CIDR | 4.1945683e-05 |
| 13,098 | Demonstrating CatDB: LLM-based Generation of Data-centric ML Pipelines | 2025 | SIGMOD | - |
| 9,118 | Towards Observability for Production Machine Learning Pipelines | 2022 | VLDB | 4.3928288e-05 |
| 5,304 | A Scalable AutoML Approach Based on Graph Neural Networks | 2022 | VLDB | 5.5779335e-05 |
| 2,122 | SystemDS: A Declarative Machine Learning System for the End-to-End Data Science Lifecycle | 2020 | CIDR | 9.4989076e-05 |
| 921 | Democratizing Data Science through Interactive Curation of ML Pipelines | 2019 | SIGMOD | 0.00015337438 |
| 543 | MLbase: A Distributed Machine-learning System | 2013 | CIDR | 0.00020526854 |