Yutao Zeng
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5Mixture-of-Depths Attention
arXiv 2026
UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning
arXiv 2025
HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
arXiv 2025
Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models
arXiv 2025
Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers