Yupeng Zhang

Cite

Notes

Only stored in your browser.

Attribution

3papers

Authored papers

Baichuan-M1: Pushing the Medical Capability of Large Language Models

arXiv 2025

Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems without Ground Truth Answers

arXiv 2025

Baichuan 2: Open Large-scale Language Models

arXiv 2023

No known affiliations.

from 3 papers

Bingning Wang

Da Pan

Guosheng Dong

Haizhou Zhao

Han Liu

Hongda Zhang

Lei Su

Liang Song

Mang Wang

Rihui Xin