Zecheng Wang
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4Baichuan-M1: Pushing the Medical Capability of Large Language Models
arXiv 2025
Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems without Ground Truth Answers
arXiv 2025
Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging
arXiv 2024
Pre-training with Synthetic Data Helps Offline Reinforcement Learning
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers