Database Paper Browser

Back to papers

Minimal MapReduce Algorithms

Summary: Introduces the 'minimal algorithm' notion for MapReduce, optimizing load balancing, space, CPU, I/O, and network cost within a small constant factor. Shows existence of elegant minimal algorithms for fundamental database problems, validated by extensive experiments. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4652
Venue
SIGMOD
Year
2013
Pagerank
8.3328645e-05
Overall Rank
2,674 | 81.40%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 11 of 11 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 36 of 36 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
3 Pig Latin: A Not-So-Foreign Language for Data Processing 2008 SIGMOD 0.0024183614
22 SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets 2008 VLDB 0.0008456613
109 Dremel: Interactive Analysis of Web-Scale Datasets 2010 VLDB 0.00048186983
157 HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads 2009 VLDB 0.00040397359
447 Efficient Parallel Set-Similarity Joins Using MapReduce 2010 SIGMOD 0.00022900171
644 Densest Subgraph in Streaming and MapReduce 2012 VLDB 0.00018748988
794 Hadoop++: Making a Yellow Elephant Run Like a Cheetah (Without It Even Noticing) 2010 VLDB 0.00016605103
868 Profiling, What-if Analysis, and Cost-based Optimization of MapReduce Programs 2011 VLDB 0.00015789681
886 Fast Personalized PageRank on MapReduce 2011 SIGMOD 0.00015597161
913 Tenzing A SQL Implementation On The MapReduce Framework 2011 VLDB 0.00015408131
947 MRShare: Sharing Across Multiple Queries in MapReduce 2010 VLDB 0.00015114576
960 A Comparison of Join Algorithms for Log Processing in MapReduce 2010 SIGMOD 0.00015012242
1,074 Processing Theta-Joins using MapReduce* 2011 SIGMOD 0.00014260096
1,110 Parallel Evaluation of Conjunctive Queries 2011 PODS 0.00013968198
1,265 Jaql: A Scripting Language for Large Scale Semistructured Data Analysis 2011 VLDB 0.00012947629
1,280 Automatic Optimization for MapReduce Programs 2011 VLDB 0.0001285503
1,334 SkewTune: Mitigating Skew in MapReduce Applications 2012 SIGMOD 0.0001250413
1,464 Online Aggregation for Large MapReduce Jobs 2011 VLDB 0.00011865546
1,534 PerfXplain: Debugging MapReduce Job Performance 2012 VLDB 0.00011468393
1,715 V-SMART-Join: A Scalable MapReduce Framework for All-Pair Similarity Joins of Multisets and Vectors 2012 VLDB 0.00010803271
1,770 ParaTimer: A Progress Indicator for MapReduce DAGs 2010 SIGMOD 0.00010618229
1,863 Cheetah: A High Performance, Custom Data Warehouse on Top of MapReduce 2010 VLDB 0.00010286531
1,886 Social Content Matching in MapReduce 2011 VLDB 0.00010208945
1,931 Efficient Processing of k Nearest Neighbor Joins using MapReduce 2012 VLDB 0.00010040427
2,205 ReStore: Reusing Results of MapReduce Jobs 2012 VLDB 9.2920002e-05
2,439 CoHadoop: Flexible Data Placement and Its Exploitation in Hadoop 2011 VLDB 8.8190594e-05
2,630 PLANET: Massively Parallel Learning of Tree Ensembles with MapReduce 2009 VLDB 8.4128091e-05
2,747 Stubby: A Transformation-based Optimizer for MapReduce Workflows 2012 VLDB 8.1828918e-05
3,062 Efficient Multi-way Theta-Join Processing Using MapReduce 2012 VLDB 7.6343994e-05
3,115 Llama: Leveraging Columnar Storage for Scalable Join Processing in the MapReduce Framework 2011 SIGMOD 7.543505e-05
3,180 Energy Management for MapReduce Clusters 2010 VLDB 7.4302009e-05
3,208 Column-Oriented Storage Techniques for MapReduce 2011 VLDB 7.3781897e-05
3,279 Early Accurate Results for Advanced Analytics on MapReduce 2012 VLDB 7.2855494e-05
3,504 M3R: Increased Performance for In-Memory Hadoop Jobs 2012 VLDB 7.0347515e-05
3,626 Behavioral Simulations in MapReduce 2010 VLDB 6.9047458e-05
5,903 Building Wavelet Histograms on Large Data in MapReduce 2012 VLDB 5.2791351e-05
Previous Page 1 / 1 Next

Semantically Similar Papers