GraphAr: An Efficient Storage Scheme for Graph Data in Data Lakes
Summary: GraphAr is a Parquet-compatible storage layout that captures LPG semantics and reorganizes/encodes vertices, edges, labels and properties to enable graph-native access patterns (neighbor retrieval, label filtering) inside data lakes. It yields massive speedups versus vanilla Parquet/Acero (e.g., 4452× neighbor, 14.8× label filtering, 29.5× end-to-end). (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Xue Li
- 2. Weibin Zeng
- 3. Zhibin Wang
- 4. Diwen Zhu
- 5. Jingbo Xu
- 6. Wenyuan Yu
- 7. Jingren Zhou
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,672 | Sectric: Towards Accurate, Privacy-preserving and Efficient Triangle Counting | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 21 of 21 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,004 | Efficient Main Memory Data Management Using the DBGraph Storage Model | 1990 | VLDB | 5.76478e-05 |
| 5,338 | Fast In-Memory SQL Analytics on Typed Graphs | 2017 | VLDB | 5.5629772e-05 |
| 8,917 | Data Lakes Empowered by Knowledge Graph Technologies | 2021 | SIGMOD | 4.427232e-05 |
| 8,396 | Optimizing Declarative Graph Queries at Large Scale | 2019 | SIGMOD | 4.5276541e-05 |
| 10,731 | GraphCSR: A Degree-Equalized CSR Format for Large-scale Graph Processing | 2025 | VLDB | 4.1945683e-05 |
| 7,907 | Petabyte-Scale Row-Level Operations in Data Lakehouses | 2024 | VLDB | 4.6205839e-05 |
| 4,514 | An Empirical Evaluation of Columnar Storage Formats | 2024 | VLDB | 6.1204636e-05 |
| 4,012 | Columnar Storage and List-based Processing for Graph Database Management Systems | 2021 | VLDB | 6.5335884e-05 |
| 7,694 | LSMGraph: A High-Performance Dynamic Graph Storage System with Multi-Level CSR | 2024 | SIGMOD | 4.6757592e-05 |
| 2,130 | SQLGraph: An Efficient Relational-Based Property Graph Store | 2015 | SIGMOD | 9.4798556e-05 |