Cite
Notes
Only stored in your browser.
Attribution
What Makes a Reward Model a Good Teacher? An Optimization Perspective
arXiv 2025
from 1 papers
Jason D. Lee
Noam Razin
Sanjeev Arora
professor
Stanley Wei
Zixuan Wang