Cite
Notes
Only stored in your browser.
Attribution
Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token Sequences
arXiv 2025
DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference
arXiv 2024
from 2 papers
Jeff Rasley
Samyam Rajbhandari
Yuxiong He
Ammar Ahmad Awan
Arash Bakhtiari
Aurick Qiao
Connor Holmes
Heyang Qin
Lev Kurilenko
Masahiro Tanaka