Cite
Notes
Only stored in your browser.
Attribution
Open-domain Implicit Format Control for Large Language Model Generation
arXiv 2024
nanoLM: an Affordable LLM Pre-training Benchmark via Accurate Loss Prediction across Scales
arXiv 2023
Masked Structural Growth for 2x Faster Language Model Pre-training
from 3 papers
Yequan Wang
Aixin Sun
Jing Li
Peng Han
Xiang Li
Xin Jiang
Xuezhi Fang
Xuying Meng
Kang Liu
Shuo Shang