Cite
Notes
Only stored in your browser.
Attribution
A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning
arXiv 2025
from 1 papers
Hiroshi Yoshihara
Yuichi Inoue