Database Paper Browser

Back to papers

Pangea: Monolithic Distributed Storage for Data Analytics

Summary: Pangea replaces multi-layered storage (HDFS, Alluxio, Spark) with a single monolithic distributed storage for both intermediate and long-lived data. It unifies buffering, data placement optimization, and failure recovery, delivering performance competitive with layered systems. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
11996
Venue
VLDB
Year
2019
Pagerank
4.6088289e-05
Overall Rank
8,002 | 44.34%
DOI
10.14778/3311880.3311885

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 6 of 6 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 16 of 16 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
21 C-Store: A Column-oriented DBMS 2005 VLDB 0.00086087497
22 SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets 2008 VLDB 0.0008456613
66 Spark SQL: Relational Data Processing in Spark 2015 SIGMOD 0.00061639801
128 An Evaluation of Buffer Management Strategies for Relational Database Systems 1985 VLDB 0.00044535268
285 Automating Physical Database Design in a Parallel Database 2002 SIGMOD 0.0002899128
286 Integrating Vertical and Horizontal Partitioning into Automated Physical Database Design 2004 SIGMOD 0.00028990057
306 The LRU-K Page Replacement Algorithm For Database Disk Buffering 1993 SIGMOD 0.00028228982
794 Hadoop++: Making a Yellow Elephant Run Like a Cheetah (Without It Even Noticing) 2010 VLDB 0.00016605103
979 Interactive Analytical Processing in Big Data Systems: A Cross-Industry Study of MapReduce Workloads 2012 VLDB 0.0001488055
1,873 An Architecture for Compiling UDF-centric Workflows 2015 VLDB 0.00010253002
2,439 CoHadoop: Flexible Data Placement and Its Exploitation in Hadoop 2011 VLDB 8.8190594e-05
3,669 XORing Elephants: Novel Erasure Codes for Big Data 2013 VLDB 6.8584744e-05
4,061 Advanced Partitioning Techniques for Massively Distributed Computation 2012 SIGMOD 6.483587e-05
4,174 Computation Reuse in Analytics Job Service at Microsoft 2018 SIGMOD 6.3856219e-05
5,793 Lifetime-Based Memory Management for Distributed Data Processing Systems 2016 VLDB 5.3258796e-05
9,332 PlinyCompute: A Platform for High-Performance, Distributed, Data-Intensive Tool Development 2018 SIGMOD 4.3556432e-05
Previous Page 1 / 1 Next

Semantically Similar Papers