Back to papers
A Comparative Evaluation of Systems for Scalable Linear Algebra-based Analytics
Summary: LA-focused benchmarks compare scalable linear-algebra systems (MADlib, MLlib, SystemML, ScaLAPACK, SciDB, TensorFlow) on real and synthetic data. Reveals bottlenecks, unusual performance trends, and bugs; guides improvements and provides open code/data for replication.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 11729
- Venue
- VLDB
- Year
- 2018
- Pagerank
- 6.5959084e-05
- Overall Rank
- 3,948 | 72.54%
- DOI
-
10.14778/3275366.3275367
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 15 of 15 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 683 |
Cerebro: A Data System for Optimized Deep Learning Model Selection |
2020 |
VLDB |
0.00018195476 |
| 1,940 |
SliceLine: Fast, Linear-Algebra-based Slice Finding for ML Model Debugging |
2021 |
SIGMOD |
0.00010020173 |
| 2,350 |
An Intermediate Representation for Optimizing Machine Learning Pipelines |
2019 |
VLDB |
8.9788641e-05 |
| 5,088 |
TCUDB: Accelerating Database with Tensor Processors |
2022 |
SIGMOD |
5.7072189e-05 |
| 5,605 |
TPCx-AI - An Industry Standard Benchmark for Artificial Intelligence and Machine Learning Systems |
2023 |
VLDB |
5.4142007e-05 |
| 6,745 |
DistME: A Fast and Elastic Distributed Matrix Computation Engine using GPUs |
2019 |
SIGMOD |
4.9417155e-05 |
| 7,917 |
Array DBMS: Past, Present, and (Near) Future |
2021 |
VLDB |
4.6173899e-05 |
| 8,514 |
UPLIFT: Parallelization Strategies for Feature Transformations in Machine Learning Workloads |
2022 |
VLDB |
4.4944285e-05 |
| 8,595 |
Towards A Polyglot Framework for Factorized ML |
2021 |
VLDB |
4.4889397e-05 |
| 8,620 |
PreVision: An Out-of-Core Matrix Computation System with Optimal Buffer Replacement |
2024 |
SIGMOD |
4.4837361e-05 |
| 8,864 |
Cerebro: A Layered Data Platform for Scalable Deep Learning |
2021 |
CIDR |
4.4326439e-05 |
| 8,980 |
HADAD: A Lightweight Approach for Optimizing Hybrid Complex Analytics Queries |
2021 |
SIGMOD |
4.4169807e-05 |
| 10,177 |
InferF: Declarative Factorization of AI/ML Inferences over Joins |
2026 |
SIGMOD |
4.1945683e-05 |
| 11,339 |
Redundancy Elimination in Distributed Matrix Computation |
2022 |
SIGMOD |
4.1945683e-05 |
| 11,472 |
Hybrid Evaluation for Distributed Iterative Matrix Computation |
2021 |
SIGMOD |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 15 of 15 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 140 |
The MADlib Analytics Library or MAD Skills, the SQL |
2012 |
VLDB |
0.00042270404 |
| 183 |
Automatic Database Management System Tuning Through Large-scale Machine Learning |
2017 |
SIGMOD |
0.00036721403 |
| 318 |
Overview of SciDB: Large Scale Array Storage, Processing and Analysis |
2010 |
SIGMOD |
0.00027795661 |
| 543 |
MLbase: A Distributed Machine-learning System |
2013 |
CIDR |
0.00020526854 |
| 557 |
SystemML: Declarative Machine Learning on Spark |
2016 |
VLDB |
0.00020197988 |
| 1,071 |
Starfish: A Self-tuning System for Big Data Analytics |
2011 |
CIDR |
0.00014312777 |
| 1,158 |
Simulation of Database-Valued Markov Chains Using SimSQL |
2013 |
SIGMOD |
0.0001361064 |
| 1,167 |
Learning Generalized Linear Models Over Normalized Data |
2015 |
SIGMOD |
0.00013547713 |
| 1,279 |
Towards Linear Algebra over Normalized Data |
2017 |
VLDB |
0.00012868394 |
| 1,402 |
Hybrid Parallelization Strategies for Large-Scale Machine Learning in SystemML |
2014 |
VLDB |
0.00012180605 |
| 2,623 |
GenBase: A Complex Analytics Genomics Benchmark |
2014 |
SIGMOD |
8.4374366e-05 |
| 3,343 |
Comparative Evaluation of Big-Data Systems on Scientific Image Analytics Workloads |
2017 |
VLDB |
7.1967343e-05 |
| 3,455 |
A Comparison of Platforms for Implementing and Running Very Large Scale Machine Learning Algorithms |
2014 |
SIGMOD |
7.0771839e-05 |
| 3,982 |
The Myria Big Data Management and Analytics System and Cloud Service |
2017 |
CIDR |
6.5651188e-05 |
| 6,542 |
Profiling R on a Contemporary Processor |
2015 |
VLDB |
5.0216639e-05 |
Semantically Similar Papers