Database Paper Browser

Back to papers

GraphAr: An Efficient Storage Scheme for Graph Data in Data Lakes

Summary: GraphAr is a Parquet-compatible storage layout that captures LPG semantics and reorganizes/encodes vertices, edges, labels and properties to enable graph-native access patterns (neighbor retrieval, label filtering) inside data lakes. It yields massive speedups versus vanilla Parquet/Acero (e.g., 4452× neighbor, 14.8× label filtering, 29.5× end-to-end). (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
14141
Venue
VLDB
Year
2025
Pagerank
4.1945683e-05
Overall Rank
10,803 | 24.85%
DOI
10.14778/3712221.3712223

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 1 of 1 citing papers.

Rank Citing Paper Year Venue Pagerank
10,672 Sectric: Towards Accurate, Privacy-preserving and Efficient Triangle Counting 2025 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 21 of 21 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
167 The Snowflake Elastic Data Warehouse 2016 SIGMOD 0.00039180521
185 DuckDB: an Embeddable Analytical Database 2019 SIGMOD 0.00036538405
305 SIMD-Scan: Ultra Fast in-Memory Table Scan using on-Chip Vector Processing Units 2009 VLDB 0.00028248614
536 The LDBC Social Network Benchmark: Interactive Workload 2015 SIGMOD 0.00020722862
746 Delta Lake: High-Performance ACID Table Storage over Cloud Object Stores 2020 VLDB 0.00017326979
789 Cypher: An Evolving Query Language for Property Graphs 2018 SIGMOD 0.00016634256
939 Data Lake Management: Challenges and Opportunities 2019 VLDB 0.00015187344
958 Rethinking SIMD Vectorization for In-Memory Databases 2015 SIGMOD 0.00015045316
1,377 Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics 2021 CIDR 0.00012296941
1,785 PG-Schema: Schemas for Property Graphs 2023 SIGMOD 0.00010560236
2,127 SQL-on-Hadoop: Full Circle Back to Shared-Nothing Database Architectures 2014 VLDB 9.4863172e-05
2,473 Photon: A Fast Query Engine for Lakehouse Systems 2022 SIGMOD 8.7237281e-05
2,505 Graph Pattern Matching in GQL and SQL/PGQ 2022 SIGMOD 8.634551e-05
3,038 Azure Data Lake Store: A Hyperscale Distributed File Service for Big Data Analytics 2017 SIGMOD 7.6717218e-05
3,287 GraphScope: A Unified Engine For Big Graph Processing 2021 VLDB 7.2739447e-05
3,668 The LDBC Social Network Benchmark: Business Intelligence Workload 2023 VLDB 6.8591612e-05
3,988 All-in-One: Graph Processing in RDBMSs Revisited 2017 SIGMOD 6.5589605e-05
4,012 Columnar Storage and List-based Processing for Graph Database Management Systems 2021 VLDB 6.5335884e-05
4,667 FlexPushdownDB: Hybrid Pushdown and Caching in a Cloud DBMS 2021 VLDB 6.0116919e-05
6,639 Modern Techniques for Querying Graph-Structured Relations: Foundations, System Implementations, and Open Challenges 2022 VLDB 4.9801324e-05
7,427 Selection Pushdown in Column Stores using Bit Manipulation Instructions 2023 SIGMOD 4.7327406e-05
Previous Page 1 / 1 Next

Semantically Similar Papers