Akbir Khan
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5Language Models Learn to Mislead Humans via RLHF
arXiv 2024
Debating with More Persuasive LLMs Leads to More Truthful Answers
arXiv 2024
Alignment faking in large language models
arXiv 2024
JaxMARL: Multi-Agent RL Environments and Algorithms in JAX
arXiv 2023
The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs
the-goldilocks-of-pragmatic-understanding
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers