Cite
Notes
Only stored in your browser.
Attribution
Training Language Models to Self-Correct via Reinforcement Learning
arXiv 2024
Evolving Reinforcement Learning Algorithms
evolving-reinforcement-learning-algorithms
from 2 papers
Aleksandra Faust
Avi Singh
Aviral Kumar
Colton Bishop
Cosmin Paduraru
Daiyi Peng
Disha Shrivastava
Doina Precup
Esteban Real
Feryal Behbahani