SkewTune in Action: Mitigating Skew in MapReduce Applications
Summary: SkewTune automatically mitigates data skew in MapReduce jobs, as a drop-in Hadoop replacement. Demonstration shows runtime skew mitigation on real cloud workloads and an interactive GUI detailing skew handling across real and synthetic skew. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. YongChul Kwon
- 2. Magdalena Balazinska
- 3. Bill Howe
- 4. Jerome Rolia
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,334 | SkewTune: Mitigating Skew in MapReduce Applications | 2012 | SIGMOD | 0.0001250413 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 413 | HaLoop: Efficient Iterative Data Processing on Large Clusters | 2010 | VLDB | 0.00023904409 |
| 1,334 | SkewTune: Mitigating Skew in MapReduce Applications | 2012 | SIGMOD | 0.0001250413 |
| 2,208 | Clustera: An Integrated Computation And Data Management System | 2008 | VLDB | 9.2873257e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,266 | Redoop Infrastructure for Recurring Big Data Queries | 2014 | VLDB | 4.3667196e-05 |
| 7,304 | MRTuner: A Toolkit to Enable Holistic Optimization for MapReduce Jobs | 2014 | VLDB | 4.7684491e-05 |
| 11,797 | Runtime Optimization of Join Location in Parallel Data Management Systems | 2017 | VLDB | 4.1945683e-05 |
| 2,476 | A Platform for Scalable One-Pass Analytics using MapReduce | 2011 | SIGMOD | 8.6960139e-05 |
| 588 | Practical Skew Handling in Parallel Joins | 1992 | VLDB | 0.00019604754 |
| 8,978 | SpongeFiles: Mitigating Data Skew in MapReduce Using Distributed Memory | 2014 | SIGMOD | 4.417225e-05 |
| 11,835 | An Efficient MapReduce Cube Algorithm for Varied Data Distributions | 2016 | SIGMOD | 4.1945683e-05 |
| 1,365 | Handling Data Skew in Multiprocessor Database Computers Using Partition Tuning | 1991 | VLDB | 0.00012368421 |
| 11,933 | FP-Hadoop: Efficient Execution of Parallel Jobs Over Skewed Data | 2015 | VLDB | 4.1945683e-05 |
| 1,334 | SkewTune: Mitigating Skew in MapReduce Applications | 2012 | SIGMOD | 0.0001250413 |