Cite
Notes
Only stored in your browser.
Attribution
PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel
arXiv 2023
PyTorch Distributed: Experiences on Accelerating Data Parallel Training
arXiv 2020
from 2 papers
Rohan Varma
Shen Li
Yanli Zhao
Adam Paszke
Ajit Mathews
Alban Desmaison
Andrew Gu
Bernard Nguyen
Brian Vaughan
Can Balioglu