Cite
Notes
Only stored in your browser.
Attribution
Process Reward Models That Think
arXiv 2025
MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges?
Small Language Models Need Strong Verifiers to Self-Correct Reasoning
arXiv 2024
from 3 papers
Honglak Lee
Lajanugen Logeswaran
Lu Wang
Moontae Lee
Muhammad Khalifa
Yunxiang Zhang
Grant D Murphy
Hao Peng
Rishabh Agarwal
Shitanshu Bhushan