Database Paper Browser

Back to papers

Cumulon: Optimizing Statistical Data Analysis in the Cloud

Summary: Cumulon: cloud-native system for rapid matrix analytics development and deployment. Automatic optimization across operators, parameters, and hardware provisioning under time/budget constraints; implemented atop Hadoop/HDFS to avoid MapReduce limits. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4682
Venue
SIGMOD
Year
2013
Pagerank
8.3413995e-05
Overall Rank
2,667 | 81.45%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 21 of 21 citing papers.

Rank Citing Paper Year Venue Pagerank
557 SystemML: Declarative Machine Learning on Spark 2016 VLDB 0.00020197988
1,402 Hybrid Parallelization Strategies for Large-Scale Machine Learning in SystemML 2014 VLDB 0.00012180605
1,532 Data Management in Machine Learning: Challenges, Techniques, and Systems 2017 SIGMOD 0.00011472681
1,940 SliceLine: Fast, Linear-Algebra-based Slice Finding for ML Model Debugging 2021 SIGMOD 0.00010020173
1,967 Compressed Linear Algebra for Large-Scale Machine Learning 2016 VLDB 9.9131712e-05
3,878 Data Canopy: Accelerating Exploratory Statistical Analysis 2017 SIGMOD 6.6731435e-05
3,918 On Optimizing Operator Fusion Plans for Large-Scale Machine Learning in SystemML 2018 VLDB 6.6315176e-05
4,505 SPOOF: Sum-Product Optimization and Operator Fusion for Large-Scale Machine Learning 2017 CIDR 6.1327108e-05
4,774 LIMA: Fine-grained Lineage Tracing and Reuse in Machine Learning Systems 2021 SIGMOD 5.9316087e-05
4,802 Resource Elasticity for Large-Scale Machine Learning 2015 SIGMOD 5.9114415e-05
4,833 MNC: Structure-Exploiting Sparsity Estimation for Matrix Expressions 2019 SIGMOD 5.8916346e-05
5,487 SPORES: Sum-Product Optimization via Relational Equality Saturation for Large Scale Linear Algebra 2020 VLDB 5.4791501e-05
6,191 Automatic Optimization of Matrix Implementations for Distributed Machine Learning and Linear Algebra 2021 SIGMOD 5.1642282e-05
6,986 A Cost-based Optimizer for Gradient Descent Optimization 2017 SIGMOD 4.8727048e-05
8,262 FuseME: Distributed Matrix Computation Engine based on Cuboid-based Fused Operator and Plan Generation 2022 SIGMOD 4.5467867e-05
9,437 BlockJoin: Efficient Matrix Partitioning Through Joins 2017 VLDB 4.3425552e-05
9,596 Scalable Graph Convolutional Network Training on Distributed-Memory Systems 2023 VLDB 4.319218e-05
9,947 Cumulon: Matrix-Based Data Analytics in the Cloud with Spot Instances 2016 VLDB 4.2431724e-05
10,381 LCP: Enhancing Scientific Data Management with Lossy Compression for Particles 2025 SIGMOD 4.1945683e-05
11,472 Hybrid Evaluation for Distributed Iterative Matrix Computation 2021 SIGMOD 4.1945683e-05
13,339 Cumulon-D: Data Analytics in a Dynamic Spot Market 2017 VLDB -
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 9 of 9 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers