Cite
Notes
Only stored in your browser.
Attribution
Training Language Models to Self-Correct via Reinforcement Learning
arXiv 2024
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
arXiv 2020
Soft Actor-Critic Algorithms and Applications
arXiv 2018
from 3 papers
Aviral Kumar
Sergey Levine
professor
Abhishek Gupta
Aleksandra Faust
Aurick Zhou
Avi Singh
Colton Bishop
Cosmin Paduraru
Disha Shrivastava
Doina Precup