MLbase: A Distributed Machine-learning System
Summary: MLbase offers a declarative interface and high-level operators so ML researchers can implement scalable algorithms without low-level distributed-systems expertise. Novelty: an optimizer that selects and dynamically adapts learning algorithms and an operator-aware runtime tuned to their data-access patterns. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Tim Kraska
- 2. Ameet Talwalkar
- 3. John Duchi
- 4. Rean Griffith
- 5. Michael J. Franklin
- 6. Michael Jordan
Incoming Citations (Sorted by Pagerank)
Showing 34 of 34 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3 | Pig Latin: A Not-So-Foreign Language for Data Processing | 2008 | SIGMOD | 0.0024183614 |
| 140 | The MADlib Analytics Library or MAD Skills, the SQL | 2012 | VLDB | 0.00042270404 |
| 658 | Towards a Unified Architecture for in-RDBMS Analytics | 2012 | SIGMOD | 0.00018506577 |
| 2,084 | The Case for Predictive Database Systems: Opportunities and Challenges | 2011 | CIDR | 9.5820534e-05 |
| 4,387 | Hybrid In-Database Inference for Declarative Information Extraction | 2011 | SIGMOD | 6.2320072e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,821 | Tensor Relational Algebra for Distributed Machine Learning System Design | 2021 | VLDB | 5.3134851e-05 |
| 9,222 | Towards an Optimized GROUP BY Abstraction for Large-Scale Machine Learning | 2021 | VLDB | 4.3698672e-05 |
| 8,847 | Towards Foundation Database Models | 2025 | CIDR | 4.4371897e-05 |
| 2,122 | SystemDS: A Declarative Machine Learning System for the End-to-End Data Science Lifecycle | 2020 | CIDR | 9.4989076e-05 |
| 557 | SystemML: Declarative Machine Learning on Spark | 2016 | VLDB | 0.00020197988 |
| 6,191 | Automatic Optimization of Matrix Implementations for Distributed Machine Learning and Linear Algebra | 2021 | SIGMOD | 5.1642282e-05 |
| 1,402 | Hybrid Parallelization Strategies for Large-Scale Machine Learning in SystemML | 2014 | VLDB | 0.00012180605 |
| 4,906 | Machine Learning for Big Data | 2013 | SIGMOD | 5.8389053e-05 |
| 7,311 | The Machine Learning Bazaar: Harnessing the ML Ecosystem for Effective System Development | 2020 | SIGMOD | 4.7656884e-05 |
| 4,003 | Data Platform for Machine Learning | 2019 | SIGMOD | 6.54347e-05 |