Database Paper Browser

Back to papers

MLbase: A Distributed Machine-learning System

Summary: MLbase offers a declarative interface and high-level operators so ML researchers can implement scalable algorithms without low-level distributed-systems expertise. Novelty: an optimizer that selects and dynamically adapts learning algorithms and an operator-aware runtime tuned to their data-access patterns. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
185
Venue
CIDR
Year
2013
Pagerank
0.00020526854
Overall Rank
543 | 96.23%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 34 of 34 citing papers.

Rank Citing Paper Year Venue Pagerank
316 NoScope: Optimizing Neural Network Queries over Video at Scale 2017 VLDB 0.00027988668
557 SystemML: Declarative Machine Learning on Spark 2016 VLDB 0.00020197988
761 Materialization Optimizations for Feature Selection Workloads 2014 SIGMOD 0.00017053783
903 To Join or Not to Join? Thinking Twice about Joins before Feature Selection 2016 SIGMOD 0.0001547016
921 Democratizing Data Science through Interactive Curation of ML Pipelines 2019 SIGMOD 0.00015337438
1,167 Learning Generalized Linear Models Over Normalized Data 2015 SIGMOD 0.00013547713
1,279 Towards Linear Algebra over Normalized Data 2017 VLDB 0.00012868394
1,350 Northstar: An Interactive Data Science System 2018 VLDB 0.00012431059
1,402 Hybrid Parallelization Strategies for Large-Scale Machine Learning in SystemML 2014 VLDB 0.00012180605
1,532 Data Management in Machine Learning: Challenges, Techniques, and Systems 2017 SIGMOD 0.00011472681
2,132 Towards Sustainable Insights or why polygamy is bad for you 2017 CIDR 9.4770432e-05
2,251 Vizdom: Interactive Analytics through Pen and Touch 2015 VLDB 9.1986441e-05
2,255 LINVIEW: Incremental View Maintenance for Complex Analytical Queries 2014 SIGMOD 9.1884983e-05
2,350 An Intermediate Representation for Optimizing Machine Learning Pipelines 2019 VLDB 8.9788641e-05
2,753 Complaint-driven Training Data Debugging for Query 2.0 2020 SIGMOD 8.1724339e-05
2,791 Towards Demystifying Serverless Machine Learning Training 2021 SIGMOD 8.1206618e-05
3,023 Helix: Accelerating Human-in-the-loop Machine Learning 2018 VLDB 7.6929986e-05
3,455 A Comparison of Platforms for Implementing and Running Very Large Scale Machine Learning Algorithms 2014 SIGMOD 7.0771839e-05
3,473 AI Meets Database: AI4DB and DB4AI 2021 SIGMOD 7.062864e-05
3,617 Ava: From Data to Insights Through Conversation 2017 CIDR 6.9091789e-05
3,948 A Comparative Evaluation of Systems for Scalable Linear Algebra-based Analytics 2018 VLDB 6.5959084e-05
4,003 Data Platform for Machine Learning 2019 SIGMOD 6.54347e-05
4,129 Are Key-Foreign Key Joins Safe to Avoid when Learning High-Capacity Classifiers? 2018 VLDB 6.428887e-05
4,576 The Missing Piece in Complex Analytics: Low Latency, Scalable Model Management and Serving with Velox 2015 CIDR 6.0721464e-05
5,257 Probabilistic Demand Forecasting at Scale 2017 VLDB 5.6003925e-05
6,404 ColumnML: Column-Store Machine Learning with On-The-Fly Data Transformation 2019 VLDB 5.0786954e-05
6,538 Tuple-oriented Compression for Large-scale Mini-batch Stochastic Gradient Descent 2019 SIGMOD 5.023239e-05
6,986 A Cost-based Optimizer for Gradient Descent Optimization 2017 SIGMOD 4.8727048e-05
7,306 DAPHNE: An Open and Extensible System Infrastructure for Integrated Data Analysis Pipelines 2022 CIDR 4.7678574e-05
7,664 Schema Independent Relational Learning 2017 SIGMOD 4.6857329e-05
7,920 JoinBoost: Grow Trees Over Normalized Data Using Only SQL 2023 VLDB 4.6163888e-05
8,300 sPCA: Scalable Principal Component Analysis for Big Data on Distributed Platforms 2015 SIGMOD 4.5435639e-05
9,437 BlockJoin: Efficient Matrix Partitioning Through Joins 2017 VLDB 4.3425552e-05
12,020 The Case for Personal Data-Driven Decision Making 2014 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers