Cite
Notes
Only stored in your browser.
Attribution
Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language Models
arXiv 2023
Reframing Human-AI Collaboration for Generating Free-Text Explanations
NAACL 2022 7
from 2 papers
Ashutosh Baheti
Faeze Brahman
researcher
Jack Hessel
Maarten Sap
Ronan Le Bras
Sarah Wiegreffe
Swabha Swayamdipta
Ximing Lu
Yejin Choi
professor