Database Paper Browser

Back to papers

Only Aggressive Elephants are Fast Elephants

Summary: HAIL extends HDFS upload to build per-block clustered indexes on every replica, enabling fast selective MapReduce access. Aggressive indexing yields up to 60% faster uploads (default three replicas) and up to 68x faster MapReduce queries, demonstrated across six large clusters. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10394
Venue
VLDB
Year
2012
Pagerank
5.694494e-05
Overall Rank
5,105 | 64.49%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 8 of 8 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 20 of 20 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
42 A Comparison of Approaches to Large-Scale Data Analysis 2009 SIGMOD 0.00073498298
80 Weaving Relations for Cache Performance 2001 VLDB 0.00055721729
103 Making B+-Trees Cache Conscious in Main Memory 2000 SIGMOD 0.00049150032
661 Database Tuning Advisor for Microsoft SQL Server 2005 2004 VLDB 0.00018481174
794 Hadoop++: Making a Yellow Elephant Run Like a Cheetah (Without It Even Noticing) 2010 VLDB 0.00016605103
868 Profiling, What-if Analysis, and Cost-based Optimization of MapReduce Programs 2011 VLDB 0.00015789681
947 MRShare: Sharing Across Multiple Queries in MapReduce 2010 VLDB 0.00015114576
953 Runtime Measurements in the Cloud: Observing, Analyzing, and Reducing Variance 2010 VLDB 0.00015095431
960 A Comparison of Join Algorithms for Log Processing in MapReduce 2010 SIGMOD 0.00015012242
1,280 Automatic Optimization for MapReduce Programs 2011 VLDB 0.0001285503
1,615 The Performance of MapReduce: An In-depth Study 2010 VLDB 0.00011132319
1,863 Cheetah: A High Performance, Custom Data Warehouse on Top of MapReduce 2010 VLDB 0.00010286531
2,367 Here are my Data Files. Here are my Queries. Where are my Results? 2011 CIDR 8.9511058e-05
2,439 CoHadoop: Flexible Data Placement and Its Exploitation in Hadoop 2011 VLDB 8.8190594e-05
2,470 CoPhy: A Scalable, Portable, and Interactive Index Advisor for Large Workloads 2011 VLDB 8.7333019e-05
2,658 Data Warehousing and Analytics Infrastructure at Facebook 2010 SIGMOD 8.3607429e-05
3,072 Constrained Physical Design Tuning 2008 VLDB 7.6114086e-05
3,180 Energy Management for MapReduce Clusters 2010 VLDB 7.4302009e-05
3,208 Column-Oriented Storage Techniques for MapReduce 2011 VLDB 7.3781897e-05
9,349 A Framework for Supporting DBMS-like Indexes in the Cloud 2011 VLDB 4.3526413e-05
Previous Page 1 / 1 Next

Semantically Similar Papers