Cite
Notes
Only stored in your browser.
Attribution
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters
arXiv 2026
Farseer: A Refined Scaling Law in Large Language Models
arXiv 2025
Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining
from 3 papers
Daxin Jiang
founder
Xiangyu Zhang
Zili Wang
Hanshan Zhang
Houyi Li
Qiufeng Wang
Shijie Xuyang
Shuigeng Zhou
Yuantao Fan
Ailin Huang