Storing Matrices on Disk: Theory and Practice Revisited
Summary: Proposes Linearized Array B-tree (LAB-tree) for on-disk matrices with flexible layouts that adapt to sparsity patterns across portions of an array and over time. Revisits B-tree insert/split strategies and flushing policies, proposing theoretically guaranteed and empirically strong alternatives for scalable matrix storage. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Yi Zhang
- 2. Kamesh Munagala
- 3. Jun Yang
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,532 | Data Management in Machine Learning: Challenges, Techniques, and Systems | 2017 | SIGMOD | 0.00011472681 |
| 4,259 | Optimizing I/O for Big Array Analytics | 2012 | VLDB | 6.3147285e-05 |
| 10,378 | HyperMR: Efficient Hypergraph-enhanced Matrix Storage on Compute-in-Memory Architecture | 2025 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 168 | MAD Skills: New Analysis Practices for Big Data | 2009 | VLDB | 0.00038946305 |
| 318 | Overview of SciDB: Large Scale Array Storage, Processing and Analysis | 2010 | SIGMOD | 0.00027795661 |
| 1,076 | RIOT: I/O-Efficient Numerical Computing without SQL | 2009 | CIDR | 0.00014248449 |
| 2,622 | Optimal Policy For Batch Operations: Backup, Checkpointing, Reorganization, And Updating | 1977 | SIGMOD | 8.4378381e-05 |
| 3,280 | The Case for RodentStore, an Adaptive, Declarative Storage System | 2009 | CIDR | 7.2828962e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,134 | Incremental Elasticity For Array Databases | 2014 | SIGMOD | 4.822331e-05 |
| 1,876 | ArrayStore: A Storage Manager for Complex Parallel Array Processing | 2011 | SIGMOD | 0.00010239284 |
| 11,704 | Splaying Log-Structured Merge-Trees | 2018 | SIGMOD | 4.1945683e-05 |
| 8,657 | Improving Matrix-vector Multiplication via Lossless Grammar-Compressed Matrices | 2022 | VLDB | 4.4730648e-05 |
| 11,822 | Anti-Persistence on Persistent Storage: History-Independent Sparse Tables and Dictionaries | 2016 | PODS | 4.1945683e-05 |
| 7,390 | Making In-Memory Learned Indexes Efficient on Disk | 2024 | SIGMOD | 4.7431654e-05 |
| 5,098 | Multi-Disk B-trees | 1991 | SIGMOD | 5.7007294e-05 |
| 1,967 | Compressed Linear Algebra for Large-Scale Machine Learning | 2016 | VLDB | 9.9131712e-05 |
| 10,368 | B-Trees Are Back: Engineering Fast and Pageable Node Layouts | 2025 | SIGMOD | 4.1945683e-05 |
| 6,191 | Automatic Optimization of Matrix Implementations for Distributed Machine Learning and Linear Algebra | 2021 | SIGMOD | 5.1642282e-05 |