Tomasz Korbak
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"
arXiv 2023
Pretraining Language Models with Human Preferences
arXiv 2023
Aligning Language Models with Preferences through f-divergence Minimization
arXiv 2023
Training Language Models with Language Feedback at Scale
arXiv 2023
Compositional preference models for aligning LMs
arXiv 2023
Improving Code Generation by Training with Natural Language Feedback
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers