Kenneth Li
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner
arXiv 2024
Designing a Dashboard for Transparency and Control of Conversational AI
arXiv 2024
Q-Probe: A Lightweight Approach to Reward Maximization for Language Models
arXiv 2024
Measuring and Controlling Instruction (In)Stability in Language Model Dialogs
arXiv 2024
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
NeurIPS 2023 11
Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task
arXiv 2022
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers