Cite
Notes
Only stored in your browser.
Attribution
Yuan 2.0-M32: Mixture of Experts with Attention Router
arXiv 2024
YUAN 2.0: A Large Language Model with Localized Filtering-based Attention
arXiv 2023
from 2 papers
Chao Wang
Jiangang Luo
Shaohua Wu
Tong Yu
Xi Chen
Xudong Zhao
Bing Zhao
Chong Shen
Fei Wang
Houbo He