Cite
Notes
Only stored in your browser.
Attribution
Farseer: A Refined Scaling Law in Large Language Models
arXiv 2025
Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining
from 2 papers
Daxin Jiang
founder
Houyi Li
Qiufeng Wang
Shuigeng Zhou
Wenzhen Zheng
Xiangyu Zhang
Zili Wang
Hanshan Zhang
Haoying Wang
Ning Ding
researcher