Cite
Notes
Only stored in your browser.
Attribution
Training Language Models to Self-Correct via Reinforcement Learning
arXiv 2024
Visual Dialog
visual-dialog-1
from 2 papers
Abhishek Das
Aleksandra Faust
Aviral Kumar
Colton Bishop
Cosmin Paduraru
Deshraj Yadav
Devi Parikh
Dhruv Batra
Disha Shrivastava
Doina Precup