Cite
Notes
Only stored in your browser.
Attribution
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
arXiv 2024
Zyda: A 1.3T Dataset for Open Language Modeling
from 2 papers
Beren Millidge
Quentin Anthony
Adam Ibrahim
Emily Shepperd
James Whittington
Paolo Glorioso
Vasudev Shyam
Yury Tokpanov