Back to papers
HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads
Summary: Hybrid architecture blending MapReduce scalability with DBMS-style optimization on shared-nothing hardware. Aims to match parallel DBMS performance while preserving MapReduce fault tolerance, scalability, and flexibility for cloud analytics.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 9957
- Venue
- VLDB
- Year
- 2009
- Pagerank
- 0.00040397359
- Overall Rank
- 157 | 98.91%
- DOI
-
-
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 22 of 72 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 7,294 |
Optimization for iterative queries on MapReduce |
2014 |
VLDB |
4.773119e-05 |
| 7,958 |
CARTILAGE: Adding Flexibility to the Hadoop Skeleton |
2013 |
SIGMOD |
4.613363e-05 |
| 8,084 |
ScalaGiST: Scalable Generalized Search Trees for MapReduce Systems [Innovative Systems Paper] |
2014 |
VLDB |
4.5902866e-05 |
| 8,413 |
A Batch of PNUTS: Experiences Connecting Cloud Batch and Serving Systems |
2011 |
SIGMOD |
4.5203012e-05 |
| 8,464 |
Piranha: Optimizing Short Jobs in Hadoop |
2013 |
VLDB |
4.5052127e-05 |
| 8,924 |
QMapper for Smart Grid: Migrating SQL-based Application to Hive |
2015 |
SIGMOD |
4.427232e-05 |
| 9,004 |
DataGarage: Warehousing Massive Performance Data on Commodity Servers |
2010 |
VLDB |
4.4102022e-05 |
| 9,347 |
Rank Join Queries in NoSQL Databases |
2014 |
VLDB |
4.3526718e-05 |
| 9,607 |
Polyglot Data Management: State of the Art & Open Challenges |
2022 |
VLDB |
4.3177432e-05 |
| 9,894 |
OceanRT: Real-Time Analytics over Large Temporal Data |
2014 |
SIGMOD |
4.2602616e-05 |
| 10,591 |
Accio: Bolt-on Query Federation |
2025 |
VLDB |
4.1945683e-05 |
| 11,437 |
Two-Attribute Skew Free, Isolated CP Theorem, and Massively Parallel Joins |
2021 |
PODS |
4.1945683e-05 |
| 11,635 |
Automated Performance Management for the Big Data Stack |
2019 |
CIDR |
4.1945683e-05 |
| 11,690 |
Integration of Large-Scale Data Processing Systems and Traditional Parallel Database Technology |
2019 |
VLDB |
4.1945683e-05 |
| 11,948 |
Tutorial: SQL-on-Hadoop Systems |
2015 |
VLDB |
4.1945683e-05 |
| 11,949 |
Big Data Research: Will Industry Solve all the Problems? |
2015 |
VLDB |
4.1945683e-05 |
| 11,987 |
DGFIndex for Smart Grid: Enhancing Hive with a Cost-Effective Multidimensional Range Index |
2014 |
VLDB |
4.1945683e-05 |
| 12,005 |
Design and Implementation of a Real-Time Interactive Analytics System for Large Spatio-Temporal Data |
2014 |
VLDB |
4.1945683e-05 |
| 12,028 |
D-Hive: Data Bees Pollinating RDF, Text, and Time |
2013 |
CIDR |
4.1945683e-05 |
| 12,055 |
ODYS: An Approach to Building a Massively-Parallel Search Engine Using a DB-IR Tightly-Integrated Parallel DBMS for Higher-Level Functionality |
2013 |
SIGMOD |
4.1945683e-05 |
| 12,203 |
Resiliency-Aware Data Management |
2011 |
VLDB |
4.1945683e-05 |
| 12,226 |
Indexing Multi-dimensional Data in a Cloud System |
2010 |
SIGMOD |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Semantically Similar Papers