Stubby: A Transformation-based Optimizer for MapReduce Workflows
Summary: Stubby is a transformation-based, cost-aware optimizer for MapReduce workflows, enabling cost-driven search over a defined plan space via transformations. It targets a tractable subspace, extensible to new interfaces and optimizations, with evaluation on diverse workflows. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Harold Lim
- 2. Herodotos Herodotou
- 3. Shivnath Babu
Incoming Citations (Sorted by Pagerank)
Showing 13 of 13 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 8 of 8 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 42 | A Comparison of Approaches to Large-Scale Data Analysis | 2009 | SIGMOD | 0.00073498298 |
| 132 | The EXODUS Optimizer Generator | 1987 | SIGMOD | 0.00042994082 |
| 780 | Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience | 2009 | VLDB | 0.00016775082 |
| 868 | Profiling, What-if Analysis, and Cost-based Optimization of MapReduce Programs | 2011 | VLDB | 0.00015789681 |
| 947 | MRShare: Sharing Across Multiple Queries in MapReduce | 2010 | VLDB | 0.00015114576 |
| 1,280 | Automatic Optimization for MapReduce Programs | 2011 | VLDB | 0.0001285503 |
| 1,754 | Querying Multiple Features of Groups in Relational Databases | 1996 | VLDB | 0.00010670609 |
| 2,575 | A Latency and Fault-Tolerance Optimizer for Online Parallel Query Plans | 2011 | SIGMOD | 8.5133576e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,462 | Efficient and Provable Multi-Query Optimization | 2017 | PODS | 7.0703696e-05 |
| 979 | Interactive Analytical Processing in Big Data Systems: A Cross-Industry Study of MapReduce Workloads | 2012 | VLDB | 0.0001488055 |
| 8,108 | Execution Primitives for Scalable Joins and Aggregations in Map Reduce | 2014 | VLDB | 4.5846987e-05 |
| 2,205 | ReStore: Reusing Results of MapReduce Jobs | 2012 | VLDB | 9.2920002e-05 |
| 11,976 | Anti-Combining for MapReduce | 2014 | SIGMOD | 4.1945683e-05 |
| 7,304 | MRTuner: A Toolkit to Enable Holistic Optimization for MapReduce Jobs | 2014 | VLDB | 4.7684491e-05 |
| 8,358 | MapReduce Programming and Cost-based Optimization? Crossing this Chasm with Starfish | 2011 | VLDB | 4.5372998e-05 |
| 1,280 | Automatic Optimization for MapReduce Programs | 2011 | VLDB | 0.0001285503 |
| 3,703 | Multi-Query Optimization in MapReduce Framework | 2014 | VLDB | 6.8289978e-05 |
| 868 | Profiling, What-if Analysis, and Cost-based Optimization of MapReduce Programs | 2011 | VLDB | 0.00015789681 |