Cite
Notes
Only stored in your browser.
Attribution
A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules
arXiv 2025
CPM: A Large-scale Generative Chinese Pre-trained Language Model
arXiv 2020
from 2 papers
Maosong Sun
professor
Zhiyuan Liu
Daixuan Li
Deming Ye
Fanchao Qi
Guoyang Zeng
Hao Zhou
Haodong Wen
Haozhe Ji
Huanqi Cao