An Efficient MapReduce Cube Algorithm for Varied Data Distributions
Summary: MapReduce cube algorithm using SP-Sketch to detect skew and balance workload, enabling robust cube computation across diverse data distributions. Theory and experiments show speedups and lower communication versus prior MapReduce methods and Pig/Hive. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Tova Milo
- 2. Eyal Altshuler
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 19 of 19 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,300 | sPCA: Scalable Principal Component Analysis for Big Data on Distributed Platforms | 2015 | SIGMOD | 4.5435639e-05 |
| 11,933 | FP-Hadoop: Efficient Execution of Parallel Jobs Over Skewed Data | 2015 | VLDB | 4.1945683e-05 |
| 3,703 | Multi-Query Optimization in MapReduce Framework | 2014 | VLDB | 6.8289978e-05 |
| 8,108 | Execution Primitives for Scalable Joins and Aggregations in Map Reduce | 2014 | VLDB | 4.5846987e-05 |
| 4,061 | Advanced Partitioning Techniques for Massively Distributed Computation | 2012 | SIGMOD | 6.483587e-05 |
| 8,978 | SpongeFiles: Mitigating Data Skew in MapReduce Using Distributed Memory | 2014 | SIGMOD | 4.417225e-05 |
| 1,615 | The Performance of MapReduce: An In-depth Study | 2010 | VLDB | 0.00011132319 |
| 2,476 | A Platform for Scalable One-Pass Analytics using MapReduce | 2011 | SIGMOD | 8.6960139e-05 |
| 2,674 | Minimal MapReduce Algorithms | 2013 | SIGMOD | 8.3328645e-05 |
| 1,191 | Fast Computation of Sparse Datacubes | 1997 | VLDB | 0.00013434201 |