Cite
Notes
Only stored in your browser.
Attribution
TAGC: Optimizing Gradient Communication in Distributed Transformer Training
arXiv 2025
from 1 papers
Alexey Dukhanov
Egor Spirin