Database Paper Browser

Back to papers

MiCS: Near-linear Scaling for Training Gigantic Model on Public Cloud

Summary: MiCS minimizes communication by shrinking collective participant sets to exploit heterogeneous cloud bandwidth, avoid slow links, and amortize global gradient synchronization. On AWS it achieves up to 2.89x throughput vs prior systems and near-linear weak scaling (99.4% at 512 GPUs for a 100B model), improving GPU utilization over DeepSpeed in constrained public-cloud networks. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
13199
Venue
VLDB
Year
2023
Pagerank
8.9766205e-05
Overall Rank
2,352 | 83.64%
DOI
10.14778/3561261.3561265

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 5 of 5 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 0 of 0 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Semantically Similar Papers