Oam Patel

Cite

Notes

Only stored in your browser.

Attribution

3papers

Authored papers

Designing a Dashboard for Transparency and Control of Conversational AI

arXiv 2024

Defending Against Unforeseen Failure Modes with Latent Adversarial Training

arXiv 2024

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

NeurIPS 2023 11

No known affiliations.

from 3 papers

Fernanda Viégas

Kenneth Li

Martin Wattenberg

Aoyu Wu

Catherine Yeh

Dylan Hadfield-Menell

Hanspeter Pfister

Jan Riecke

Lennart Schulze

Nicholas Castillo Marin