Progressive Compressed Records: Taking a Byte out of Deep Learning Data
Summary: PCRs: a data format that combines progressive compression with an efficient layout to view datasets at multiple levels, reducing training transfer. >50% compression tolerated; auto level selection; runtime access at ~50% bandwidth, enabling speedups. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,180 | FastFlow: Accelerating Deep Learning Model Training with Smart Offloading of Input Data Pipeline | 2023 | VLDB | 6.3793352e-05 |
| 8,786 | AWARE: Workload-aware, Redundancy-exploiting Linear Algebra | 2023 | SIGMOD | 4.4521262e-05 |
| 11,224 | Homomorphic Compression: Making Text Processing on Compression Unlimited | 2023 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 131 | Integrating Compression and Execution in Column-Oriented Database Systems | 2006 | SIGMOD | 0.0004370331 |
| 1,504 | Analyzing and Mitigating Data Stalls in DNN Training | 2021 | VLDB | 0.00011642333 |
| 2,067 | HippogriffDB: Balancing I/O and GPU Bandwidth in Big Data Analytics | 2016 | VLDB | 9.6392739e-05 |
| 2,170 | tf.data: A Machine Learning Data Processing Framework | 2021 | VLDB | 9.3821603e-05 |
| 5,123 | Accelerating Generalized Linear Models with MLWeaving: A One-Size-Fits-All System for Any-Precision Learning | 2019 | VLDB | 5.6796998e-05 |
Previous
Page 1 / 1
Next