Database Paper Browser

Back to papers

FusionFlow: Accelerating Data Preprocessing for Machine Learning with CPU-GPU Cooperation

Summary: FusionFlow runs data-augmentation on CPUs and GPUs with GPU-aware allocations inside GPU free space and adaptive scheduling to avoid interfering with model training. Yields 16–285% single-machine throughput gains and cuts CPU needs ~50–60% versus CPU-only or remote preprocessing. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
13760
Venue
VLDB
Year
2024
Pagerank
4.5410024e-05
Overall Rank
8,348 | 41.93%
DOI
10.14778/3636218.3636238

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 2 of 2 citing papers.

Rank Citing Paper Year Venue Pagerank
8,735 TensorSocket: Shared Data Loading for Deep Learning Training 2026 SIGMOD 4.456315e-05
10,770 cedar: Optimized and Unified Machine Learning Input Data Pipelines 2025 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers