Back to papers
AnyBlox: A Framework for Self-Decoding Datasets
Summary: AnyBlox bundles lightweight WebAssembly decoders with datasets so systems can read arbitrary encodings without native format support or spec changes. Decoupling decoders from systems and format specs enables transparent format evolution, instance‑optimized encodings, and secure, high‑performance integration with DuckDB, Spark, and Umbra.
(summarized by gpt-5-mini on Feb 09 2026)
- Paper ID
- 14021
- Venue
- VLDB
- Year
- 2025
- Pagerank
- 4.258022e-05
- Overall Rank
- 9,901 | 31.13%
- DOI
-
10.14778/3749646.3749672
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
Outgoing Citations (Sorted by Pagerank)
Showing 26 of 26 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 109 |
Dremel: Interactive Analysis of Web-Scale Datasets |
2010 |
VLDB |
0.00048186983 |
| 113 |
Encapsulation of Parallelism in the Volcano Query Processing System |
1990 |
SIGMOD |
0.00046764513 |
| 185 |
DuckDB: an Embeddable Analytical Database |
2019 |
SIGMOD |
0.00036538405 |
| 418 |
Morsel-Driven Parallelism: A NUMA-Aware Query Evaluation Framework for the Many-Core Age |
2014 |
SIGMOD |
0.00023729211 |
| 527 |
Rethinking Database System Architecture: Towards a Self-tuning RISC-style Database System |
2000 |
VLDB |
0.00020868847 |
| 735 |
Umbra: A Disk-Based System with In-Memory Performance |
2020 |
CIDR |
0.00017452467 |
| 853 |
Everything You Always Wanted to Know About Compiled and Vectorized Queries But Were Afraid to Ask |
2018 |
VLDB |
0.00015940507 |
| 1,284 |
Amazon Redshift Re-invented |
2022 |
SIGMOD |
0.00012837822 |
| 1,377 |
Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics |
2021 |
CIDR |
0.00012296941 |
| 2,134 |
How to Wring a Table Dry: Entropy Compression of Relations and Querying of Compressed Relations |
2006 |
VLDB |
9.4741038e-05 |
| 3,644 |
BtrBlocks: Efficient Columnar Compression for Data Lakes |
2023 |
SIGMOD |
6.8854928e-05 |
| 3,745 |
DeepSqueeze: Deep Semantic Compression for Tabular Data |
2020 |
SIGMOD |
6.7926132e-05 |
| 3,787 |
White-box Compression: Learning and Exploiting Compact Table Representations |
2020 |
CIDR |
6.7674374e-05 |
| 4,239 |
The Composable Data Management System Manifesto |
2023 |
VLDB |
6.3318452e-05 |
| 4,495 |
ClickHouse - Lightning Fast Analytics for Everyone |
2024 |
VLDB |
6.1410277e-05 |
| 4,507 |
ALP: Adaptive Lossless floating-Point Compression |
2023 |
SIGMOD |
6.131017e-05 |
| 4,518 |
The FastLanes Compression Layout: Decoding >100 Billion Integers per Second with Scalar Code |
2023 |
VLDB |
6.117844e-05 |
| 5,476 |
Containerized Execution of UDFs: An Experimental Evaluation |
2022 |
VLDB |
5.4866534e-05 |
| 5,640 |
AutoSteer: Learned Query Optimization for Any SQL Database |
2023 |
VLDB |
5.3933314e-05 |
| 6,340 |
Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query Engine |
2024 |
SIGMOD |
5.1051018e-05 |
| 6,402 |
BigLake: BigQuery’s Evolution toward a Multi-Cloud Lakehouse |
2024 |
SIGMOD |
5.079818e-05 |
| 6,525 |
Database Technology for the Masses: Sub-Operators as First-Class Entities |
2021 |
VLDB |
5.027205e-05 |
| 6,863 |
Declarative Sub-Operators for Universal Data Processing |
2023 |
VLDB |
4.905092e-05 |
| 8,582 |
Towards Query Optimizer as a Service (QOaaS) in a Unified LakeHouse Ecosystem: Can One QO Rule Them All? |
2025 |
CIDR |
4.492033e-05 |
| 9,701 |
Towards Functional Decomposition of Storage Formats |
2025 |
CIDR |
4.3008468e-05 |
| 9,702 |
Evaluating Query Languages and Systems for High-Energy Physics Data |
2022 |
VLDB |
4.3008468e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 5,563 |
AnyLog: a Grand Unification of the Internet of Things |
2020 |
CIDR |
5.4328568e-05 |
| 7,429 |
CompressDB: Enabling Efficient Compressed Data Direct Processing for Various Databases |
2022 |
SIGMOD |
4.7320139e-05 |
| 4,514 |
An Empirical Evaluation of Columnar Storage Formats |
2024 |
VLDB |
6.1204636e-05 |
| 11,430 |
AnyDB: An Architecture-less DBMS for Any Workload |
2021 |
CIDR |
4.1945683e-05 |
| 6,367 |
Good to the Last Bit: Data-Driven Encoding with CodecDB |
2021 |
SIGMOD |
5.0941072e-05 |
| 12,228 |
SecureBlox: Customizable Secure Distributed Data Processing |
2010 |
SIGMOD |
4.1945683e-05 |
| 6,455 |
DuckDB-Wasm: Fast Analytical Processing for the Web |
2022 |
VLDB |
5.0569626e-05 |
| 3,644 |
BtrBlocks: Efficient Columnar Compression for Data Lakes |
2023 |
SIGMOD |
6.8854928e-05 |
| 9,201 |
F3: The Open-Source Data File Format for the Future |
2026 |
SIGMOD |
4.3743539e-05 |
| 4,870 |
Exploiting Cloud Object Storage for High-Performance Analytics |
2023 |
VLDB |
5.8613885e-05 |