Pangea: Monolithic Distributed Storage for Data Analytics
Summary: Pangea replaces multi-layered storage (HDFS, Alluxio, Spark) with a single monolithic distributed storage for both intermediate and long-lived data. It unifies buffering, data placement optimization, and failure recovery, delivering performance competitive with layered systems. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Jia Zou
- 2. Arun Iyengar
- 3. Chris Jermaine
Incoming Citations (Sorted by Pagerank)
Showing 6 of 6 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,061 | Serving Deep Learning Models with Deduplication from Relational Databases | 2022 | VLDB | 4.8463881e-05 |
| 7,168 | TimeUnion: An Efficient Architecture with Unified Data Model for Timeseries Management Systems on Hybrid Cloud Storage | 2022 | SIGMOD | 4.8121704e-05 |
| 7,476 | Lachesis: Automatic Partitioning for UDF-Centric Analytics | 2021 | VLDB | 4.7188928e-05 |
| 8,876 | MirrorKV: An Efficient Key-Value Store on Hybrid Cloud Storage with Balanced Performance of Compaction and Querying | 2023 | SIGMOD | 4.4304279e-05 |
| 10,177 | InferF: Declarative Factorization of AI/ML Inferences over Joins | 2026 | SIGMOD | 4.1945683e-05 |
| 10,499 | Privacy and Accuracy-Aware AI/ML Model Deduplication | 2025 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 16 of 16 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next