Back to papers
M3: Scaling Up Machine Learning via Memory Mapping
Summary: Demonstrates memory-mapped, out-of-core ML with M3 for single-machine scaling of logistic regression and k-means on datasets up to ~190GB. M3 delivers speeds faster than a 4-node Spark cluster and comparable to an 8-node cluster, enabling data-bound ML on one machine.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 5220
- Venue
- SIGMOD
- Year
- 2016
- Pagerank
- -
- Overall Rank
- 13,343 | 7.18%
- DOI
-
10.1145/2882903.2914830
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
Outgoing Citations (Sorted by Pagerank)
Showing 0 of 0 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 9,596 |
Scalable Graph Convolutional Network Training on Distributed-Memory Systems |
2023 |
VLDB |
4.319218e-05 |
| 9,222 |
Towards an Optimized GROUP BY Abstraction for Large-Scale Machine Learning |
2021 |
VLDB |
4.3698672e-05 |
| 11,423 |
Scalable Robust Graph Embedding with Spark |
2022 |
VLDB |
4.1945683e-05 |
| 6,404 |
ColumnML: Column-Store Machine Learning with On-The-Fly Data Transformation |
2019 |
VLDB |
5.0786954e-05 |
| 3,087 |
Scalable and Efficient Full-Graph GNN Training for Large Graphs |
2023 |
SIGMOD |
7.5939896e-05 |
| 6,330 |
Efficient Construction of Approximate Ad-Hoc ML models Through Materialization and Reuse |
2018 |
VLDB |
5.1077416e-05 |
| 8,300 |
sPCA: Scalable Principal Component Analysis for Big Data on Distributed Platforms |
2015 |
SIGMOD |
4.5435639e-05 |
| 1,402 |
Hybrid Parallelization Strategies for Large-Scale Machine Learning in SystemML |
2014 |
VLDB |
0.00012180605 |
| 1,967 |
Compressed Linear Algebra for Large-Scale Machine Learning |
2016 |
VLDB |
9.9131712e-05 |
| 4,802 |
Resource Elasticity for Large-Scale Machine Learning |
2015 |
SIGMOD |
5.9114415e-05 |