Cite
Notes
Only stored in your browser.
Attribution
Training Language Models to Self-Correct via Reinforcement Learning
arXiv 2024
CLUTR: Curriculum Learning via Unsupervised Task Representation Learning
arXiv 2022
Evolving Reinforcement Learning Algorithms
evolving-reinforcement-learning-algorithms
from 3 papers
John D Co-Reyes
Abdus Salam Azad
Avi Singh
Aviral Kumar
Colton Bishop
Cosmin Paduraru
Daiyi Peng
Disha Shrivastava
Doina Precup
Esteban Real