FastFlow: Accelerating Deep Learning Model Training with Smart Offloading of Input Data Pipeline
Summary: FastFlow automatically mitigates CPU-side input-pipeline bottlenecks by smartly offloading preprocessing to remote CPUs and jointly leveraging local+remote resources to maximize GPU utilization. Integrated into TensorFlow, its performance-driven offloading policy yields 1–4.5× throughput gains vs TensorFlow/tf.data.service and up to 2.06× vs DALI. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Taegeon Um
- 2. Byungsoo Oh
- 3. Byeongchan Seo
- 4. Minhyeok Kweun
- 5. Goeun Kim
- 6. Woo-Yeon Lee
Incoming Citations (Sorted by Pagerank)
Showing 9 of 9 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,504 | Analyzing and Mitigating Data Stalls in DNN Training | 2021 | VLDB | 0.00011642333 |
| 2,170 | tf.data: A Machine Learning Data Processing Framework | 2021 | VLDB | 9.3821603e-05 |
| 3,698 | Where Is My Training Bottleneck? Hidden Trade-Offs in Deep Learning Preprocessing Pipelines | 2022 | SIGMOD | 6.8340435e-05 |
| 6,057 | Progressive Compressed Records: Taking a Byte out of Deep Learning Data | 2021 | VLDB | 5.2317752e-05 |
Previous
Page 1 / 1
Next