Multi-Query Optimization in MapReduce Framework
Summary: MapReduce multi-job optimization: grouping merges jobs for shared scans and map outputs; partial-materialization shares map outputs. An algorithm partitions the batch and assigns per-group techniques; Hadoop experiments show large gains over MRShare. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Guoping Wang
- 2. Chee-Yong Chan
Incoming Citations (Sorted by Pagerank)
Showing 13 of 13 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3 | Pig Latin: A Not-So-Foreign Language for Data Processing | 2008 | SIGMOD | 0.0024183614 |
| 70 | Hive - A Warehousing Solution Over a Map-Reduce Framework | 2009 | VLDB | 0.00059533166 |
| 780 | Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience | 2009 | VLDB | 0.00016775082 |
| 868 | Profiling, What-if Analysis, and Cost-based Optimization of MapReduce Programs | 2011 | VLDB | 0.00015789681 |
| 947 | MRShare: Sharing Across Multiple Queries in MapReduce | 2010 | VLDB | 0.00015114576 |
| 1,280 | Automatic Optimization for MapReduce Programs | 2011 | VLDB | 0.0001285503 |
| 2,205 | ReStore: Reusing Results of MapReduce Jobs | 2012 | VLDB | 9.2920002e-05 |
| 2,747 | Stubby: A Transformation-based Optimizer for MapReduce Workflows | 2012 | VLDB | 8.1828918e-05 |
| 5,903 | Building Wavelet Histograms on Large Data in MapReduce | 2012 | VLDB | 5.2791351e-05 |
| 8,358 | MapReduce Programming and Cost-based Optimization? Crossing this Chasm with Starfish | 2011 | VLDB | 4.5372998e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 960 | A Comparison of Join Algorithms for Log Processing in MapReduce | 2010 | SIGMOD | 0.00015012242 |
| 1,280 | Automatic Optimization for MapReduce Programs | 2011 | VLDB | 0.0001285503 |
| 42 | A Comparison of Approaches to Large-Scale Data Analysis | 2009 | SIGMOD | 0.00073498298 |
| 7,294 | Optimization for iterative queries on MapReduce | 2014 | VLDB | 4.773119e-05 |
| 868 | Profiling, What-if Analysis, and Cost-based Optimization of MapReduce Programs | 2011 | VLDB | 0.00015789681 |
| 1,615 | The Performance of MapReduce: An In-depth Study | 2010 | VLDB | 0.00011132319 |
| 15 | Map-Reduce-Merge: Simplified Relational Data Processing on Large Clusters | 2007 | SIGMOD | 0.0010654262 |
| 3,062 | Efficient Multi-way Theta-Join Processing Using MapReduce | 2012 | VLDB | 7.6343994e-05 |
| 2,674 | Minimal MapReduce Algorithms | 2013 | SIGMOD | 8.3328645e-05 |
| 947 | MRShare: Sharing Across Multiple Queries in MapReduce | 2010 | VLDB | 0.00015114576 |