I Can't Believe It's Not (Only) Software! Bionic Distributed Storage for Parquet Files
Summary: FPGA-based distributed storage for Parquet delivers in-line deduplication and selective column reads, achieving high bandwidth with low latency. Software library-implemented app logic enables Python data-science workloads to access and filter Parquet data directly on the storage node, balancing generality with efficiency. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Lucas Kuhring
- 2. Zsolt Istv e1n
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 2 of 2 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,824 | BlueCache: A Scalable Distributed Flash-based Key-value Store | 2017 | VLDB | 8.0589366e-05 |
| 3,880 | Caribou: Intelligent Distributed Storage | 2017 | VLDB | 6.6700303e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,701 | Towards Functional Decomposition of Storage Formats | 2025 | CIDR | 4.3008468e-05 |
| 11,545 | Pixels: Multiversion Wide Table Store for Data Lakes | 2020 | CIDR | 4.1945683e-05 |
| 8,221 | Enabling Transparent Acceleration of Big Data Frameworks Using Heterogeneous Hardware | 2022 | VLDB | 4.5556812e-05 |
| 7,427 | Selection Pushdown in Column Stores using Bit Manipulation Instructions | 2023 | SIGMOD | 4.7327406e-05 |
| 6,279 | Self-Organizing Data Containers | 2022 | CIDR | 5.1295282e-05 |
| 3,058 | Rethinking Data-Intensive Science Using Scalable Analytics Systems | 2015 | SIGMOD | 7.6410159e-05 |
| 7,876 | Two Birds With One Stone: Designing a Hybrid Cloud Storage Engine for HTAP | 2024 | VLDB | 4.6298182e-05 |
| 8,002 | Pangea: Monolithic Distributed Storage for Data Analytics | 2019 | VLDB | 4.6088289e-05 |
| 7,907 | Petabyte-Scale Row-Level Operations in Data Lakehouses | 2024 | VLDB | 4.6205839e-05 |
| 4,514 | An Empirical Evaluation of Columnar Storage Formats | 2024 | VLDB | 6.1204636e-05 |