Cite
Notes
Only stored in your browser.
Attribution
Muon is Scalable for LLM Training
arXiv 2025
MoBA: Mixture of Block Attention for Long-Context LLMs
Kimi-VL Technical Report
Kimi k1.5: Scaling Reinforcement Learning with LLMs
from 4 papers
Enzhe Lu
Guokun Lai
Huabin Zheng
Jianlin Su
Junjie Yan
Shaowei Liu
Weiran He
Xinyu Zhou
Yanru Chen
Yulun Du