Self-Organizing Data Containers
Summary: Self-Organizing Data Containers (SDCs): a metadata-rich cloud storage format that instance-optimizes data representation to exploit actual data distributions and workloads, promising large speedups over static formats like Arrow/Parquet. Unifies partitioning, replication, indexing and materialization while capturing access histories and distributional statistics; presents a preliminary design, motivating experiments, and open optimization/systems challenges. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Samuel Madden
- 2. Jialin Ding
- 3. Tim Kraska
- 4. Sivaprasad Sudhir
- 5. David Cohen
- 6. Timothy Mattson
- 7. Nesime Tatbul
Incoming Citations (Sorted by Pagerank)
Showing 6 of 6 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,514 | An Empirical Evaluation of Columnar Storage Formats | 2024 | VLDB | 6.1204636e-05 |
| 5,318 | Analyzing and Comparing Lakehouse Storage Systems | 2023 | CIDR | 5.5715872e-05 |
| 5,562 | A Deep Dive into Common Open Formats for Analytical DBMSs | 2023 | VLDB | 5.4331334e-05 |
| 6,466 | Pando: Enhanced Data Skipping with Logical Data Partitioning | 2023 | VLDB | 5.0528281e-05 |
| 9,760 | Adaptive data transformations for QaaS | 2025 | CIDR | 4.2856106e-05 |
| 9,806 | The Image Calculator: 10x Faster Image-AI Inference by Replacing JPEG with Self-designing Storage Format | 2024 | SIGMOD | 4.2805224e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 12 of 12 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 12,164 | Cloud Resource Orchestration: A Data-Centric Approach | 2011 | CIDR | 4.1945683e-05 |
| 11,067 | Partition, Don’t Sort! Compression Boosters for Cloud Data Ingestion Pipelines | 2024 | VLDB | 4.1945683e-05 |
| 3,779 | Instance-Optimized Data Layouts for Cloud Analytics Workloads | 2021 | SIGMOD | 6.7747205e-05 |
| 7,557 | Invisible Glue: Scalable Self-Tuning Multi-Stores | 2015 | CIDR | 4.7112819e-05 |
| 8,366 | WWHow! Freeing Data Storage from Cages | 2013 | CIDR | 4.5357016e-05 |
| 6,104 | Automating Distributed Tiered Storage Management in Cluster Computing | 2020 | VLDB | 5.2080102e-05 |
| 6,456 | From Auto-tuning One Size Fits All to Self-designed and Learned Data-intensive Systems | 2019 | SIGMOD | 5.0564619e-05 |
| 2,261 | Towards Elastic Transactional Cloud Storage with Range Query Support | 2010 | VLDB | 9.1629995e-05 |
| 4,514 | An Empirical Evaluation of Columnar Storage Formats | 2024 | VLDB | 6.1204636e-05 |
| 9,701 | Towards Functional Decomposition of Storage Formats | 2025 | CIDR | 4.3008468e-05 |