F3: The Open-Source Data File Format for the Future
Summary: F3: a self‑describing columnar format that embeds metadata and portable WebAssembly decoders in each file for platform‑agnostic, forward‑compatible decoding. Offers an extensible storage layout and API for adding encodings, avoiding repeated format rewrites; evaluated against Parquet/ORC with low Wasm overhead. (summarized by gpt-5-mini on Feb 11 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Xinyu Zeng
- 2. Ruijun Meng
- 3. Martin Prammer
- 4. Wes McKinney
- 5. Jignesh M. Patel
- 6. Andrew Pavlo
- 7. Huanchen Zhang
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,248 | Active Data Lakes: Regaining Physical Data Independence Without Losing Interoperability | 2026 | VLDB | 4.1945683e-05 |
| 10,854 | LiquidCache: Efficient Pushdown Caching for Cloud-Native Data Analytics | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 25 of 25 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,644 | BtrBlocks: Efficient Columnar Compression for Data Lakes | 2023 | SIGMOD | 6.8854928e-05 |
| 10,714 | Towards Designing Future-Proof Data Processing Systems | 2025 | VLDB | 4.1945683e-05 |
| 9,701 | Towards Functional Decomposition of Storage Formats | 2025 | CIDR | 4.3008468e-05 |
| 6,279 | Self-Organizing Data Containers | 2022 | CIDR | 5.1295282e-05 |
| 9,901 | AnyBlox: A Framework for Self-Decoding Datasets | 2025 | VLDB | 4.258022e-05 |
| 11,679 | I Can't Believe It's Not (Only) Software! Bionic Distributed Storage for Parquet Files | 2019 | VLDB | 4.1945683e-05 |
| 6,666 | Mainlining Databases: Supporting Fast Transactional Workloads on Universal Columnar Data File Formats | 2021 | VLDB | 4.9691571e-05 |
| 5,562 | A Deep Dive into Common Open Formats for Analytical DBMSs | 2023 | VLDB | 5.4331334e-05 |
| 9,645 | The FastLanes File Format | 2025 | VLDB | 4.3109001e-05 |
| 4,514 | An Empirical Evaluation of Columnar Storage Formats | 2024 | VLDB | 6.1204636e-05 |