Cite
Notes
Only stored in your browser.
Attribution
Training Language Models to Self-Correct via Reinforcement Learning
arXiv 2024
from 1 papers
Aleksandra Faust
Avi Singh
Aviral Kumar
Cosmin Paduraru
Disha Shrivastava
Doina Precup
Feryal Behbahani
George Tucker
John D Co-Reyes
Kate Baumli