Re-evaluating the Memory-balanced Pipeline Parallelism: BPipe
Pipeline parallelism is an essential technique in the training of large-scale Transformer models. However, it suffers from imbalanced memory consumption, leading to insufficient memory utilization.
- Year
- 2024
- Hosting
- External sourcelicense unknown
Cite
Notes
Only stored in your browser.