Cite
Notes
Only stored in your browser.
Attribution
AgentKit: Structured LLM Reasoning with Dynamic Graphs
arXiv 2024
Language Models can Solve Computer Tasks
NeurIPS 2023 11
Confronting Reward Model Overoptimization with Constrained RLHF
arXiv 2023
from 3 papers
Ruslan Salakhutdinov
professor
Aaditya K. Singh
Anca D. Dragan
DJ Strouse
Geunwoo Kim
Pierre Baldi
Shrimai Prabhumoye
So Yeon Min
Ted Moskovitz
Tom Mitchell