Database Paper Browser

Back to papers

HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads

Summary: Hybrid architecture blending MapReduce scalability with DBMS-style optimization on shared-nothing hardware. Aims to match parallel DBMS performance while preserving MapReduce fault tolerance, scalability, and flexibility for cloud analytics. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
9957
Venue
VLDB
Year
2009
Pagerank
0.00040397359
Overall Rank
157 | 98.91%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 22 of 72 citing papers.

Rank Citing Paper Year Venue Pagerank
7,294 Optimization for iterative queries on MapReduce 2014 VLDB 4.773119e-05
7,958 CARTILAGE: Adding Flexibility to the Hadoop Skeleton 2013 SIGMOD 4.613363e-05
8,084 ScalaGiST: Scalable Generalized Search Trees for MapReduce Systems [Innovative Systems Paper] 2014 VLDB 4.5902866e-05
8,413 A Batch of PNUTS: Experiences Connecting Cloud Batch and Serving Systems 2011 SIGMOD 4.5203012e-05
8,464 Piranha: Optimizing Short Jobs in Hadoop 2013 VLDB 4.5052127e-05
8,924 QMapper for Smart Grid: Migrating SQL-based Application to Hive 2015 SIGMOD 4.427232e-05
9,004 DataGarage: Warehousing Massive Performance Data on Commodity Servers 2010 VLDB 4.4102022e-05
9,347 Rank Join Queries in NoSQL Databases 2014 VLDB 4.3526718e-05
9,607 Polyglot Data Management: State of the Art & Open Challenges 2022 VLDB 4.3177432e-05
9,894 OceanRT: Real-Time Analytics over Large Temporal Data 2014 SIGMOD 4.2602616e-05
10,591 Accio: Bolt-on Query Federation 2025 VLDB 4.1945683e-05
11,437 Two-Attribute Skew Free, Isolated CP Theorem, and Massively Parallel Joins 2021 PODS 4.1945683e-05
11,635 Automated Performance Management for the Big Data Stack 2019 CIDR 4.1945683e-05
11,690 Integration of Large-Scale Data Processing Systems and Traditional Parallel Database Technology 2019 VLDB 4.1945683e-05
11,948 Tutorial: SQL-on-Hadoop Systems 2015 VLDB 4.1945683e-05
11,949 Big Data Research: Will Industry Solve all the Problems? 2015 VLDB 4.1945683e-05
11,987 DGFIndex for Smart Grid: Enhancing Hive with a Cost-Effective Multidimensional Range Index 2014 VLDB 4.1945683e-05
12,005 Design and Implementation of a Real-Time Interactive Analytics System for Large Spatio-Temporal Data 2014 VLDB 4.1945683e-05
12,028 D-Hive: Data Bees Pollinating RDF, Text, and Time 2013 CIDR 4.1945683e-05
12,055 ODYS: An Approach to Building a Massively-Parallel Search Engine Using a DB-IR Tightly-Integrated Parallel DBMS for Higher-Level Functionality 2013 SIGMOD 4.1945683e-05
12,203 Resiliency-Aware Data Management 2011 VLDB 4.1945683e-05
12,226 Indexing Multi-dimensional Data in a Cloud System 2010 SIGMOD 4.1945683e-05
Previous Page 2 / 2 Next

Outgoing Citations (Sorted by Pagerank)

Showing 7 of 7 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers