Cite
Notes
Only stored in your browser.
Attribution
Baichuan-M1: Pushing the Medical Capability of Large Language Models
arXiv 2025
Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems without Ground Truth Answers
Baichuan 2: Open Large-scale Language Models
arXiv 2023
from 3 papers
Bingning Wang
Da Pan
Guosheng Dong
Haizhou Zhao
Han Liu
Hongda Zhang
Lei Su
Liang Song
Mang Wang
Rihui Xin