Cite
Notes
Only stored in your browser.
Attribution
Training Language Models to Self-Correct via Reinforcement Learning
arXiv 2024
Structured State Space Models for In-Context Reinforcement Learning
structured-state-space-models-for-in-context
from 2 papers
Albert Gu
Aleksandra Faust
Avi Singh
Aviral Kumar
Chris Lu
Colton Bishop
Cosmin Paduraru
Disha Shrivastava
Doina Precup
Emilio Parisotto