Cite
Notes
Only stored in your browser.
Attribution
DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference
arXiv 2024
DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention
arXiv 2023
from 2 papers
Heyang Qin
Samyam Rajbhandari
Yuxiong He
Arash Bakhtiari
Conglong Li
Connor Holmes
Jeff Rasley
Lev Kurilenko
Masahiro Tanaka
Michael Wyatt