Taiki Yamaguchi

Cite

Notes

Only stored in your browser.

Attribution

1papers

Authored papers

A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning

arXiv 2025

No known affiliations.

from 1 papers

Hiroshi Yoshihara

Yuichi Inoue