Hadoop++: Making a Yellow Elephant Run Like a Cheetah (Without It Even Noticing)
Summary: Introduces Hadoop++, a UDF-driven acceleration layer that speeds MapReduce tasks inside Hadoop without modifying the framework. UDF injections at key points boost indexing and joins, beating Hadoop and HadoopDB while staying compatible with future changes. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Jens Dittrich
- 2. Yagiz Kargin
- 3. Jorge-Arnulfo Quiané-Ruiz
- 4. Vinay Setty
- 5. Alekh Jindal
- 6. Jörg Schad
Incoming Citations (Sorted by Pagerank)
Showing 39 of 39 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3 | Pig Latin: A Not-So-Foreign Language for Data Processing | 2008 | SIGMOD | 0.0024183614 |
| 15 | Map-Reduce-Merge: Simplified Relational Data Processing on Large Clusters | 2007 | SIGMOD | 0.0010654262 |
| 22 | SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets | 2008 | VLDB | 0.0008456613 |
| 42 | A Comparison of Approaches to Large-Scale Data Analysis | 2009 | SIGMOD | 0.00073498298 |
| 70 | Hive - A Warehousing Solution Over a Map-Reduce Framework | 2009 | VLDB | 0.00059533166 |
| 157 | HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads | 2009 | VLDB | 0.00040397359 |
| 168 | MAD Skills: New Analysis Practices for Big Data | 2009 | VLDB | 0.00038946305 |
| 238 | Cache Conscious Indexing for Decision-Support in Main Memory | 1999 | VLDB | 0.00031642425 |
| 780 | Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience | 2009 | VLDB | 0.00016775082 |
| 953 | Runtime Measurements in the Cloud: Observing, Analyzing, and Reducing Variance | 2010 | VLDB | 0.00015095431 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 11,933 | FP-Hadoop: Efficient Execution of Parallel Jobs Over Skewed Data | 2015 | VLDB | 4.1945683e-05 |
| 3,517 | Integrating Hadoop and Parallel DBMS | 2010 | SIGMOD | 7.0199423e-05 |
| 12,101 | Optimization Strategies for A/B Testing on HADOOP | 2013 | VLDB | 4.1945683e-05 |
| 7,270 | Oracle In-Database Hadoop: When MapReduce Meets RDBMS | 2012 | SIGMOD | 4.7813984e-05 |
| 2,337 | Efficient Processing of Data Warehousing Queries in a Split Execution Environment | 2011 | SIGMOD | 9.0098186e-05 |
| 3,208 | Column-Oriented Storage Techniques for MapReduce | 2011 | VLDB | 7.3781897e-05 |
| 2,476 | A Platform for Scalable One-Pass Analytics using MapReduce | 2011 | SIGMOD | 8.6960139e-05 |
| 9,375 | Efficient Big Data Processing in Hadoop MapReduce | 2012 | VLDB | 4.347384e-05 |
| 157 | HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads | 2009 | VLDB | 0.00040397359 |
| 1,615 | The Performance of MapReduce: An In-depth Study | 2010 | VLDB | 0.00011132319 |