Database Paper Browser

Back to papers

GoldMiner: Elastic Scaling of Training Data Pre-Processing Pipelines for Deep Learning

Summary: GoldMiner decouples data pre-processing from model training with stateless data workers that elastically pool cluster resources. By automatically extracting stateless pre-processing from pipelines, it scales across nodes, delivering up to 12.1x faster jobs and up to 2.5x better GPU utilization in large clusters. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
6696
Venue
SIGMOD
Year
2023
Pagerank
5.4402488e-05
Overall Rank
5,552 | 61.38%
DOI
10.1145/3589773

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 5 of 5 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 3 of 3 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers