Alex Mallen
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4Balancing Label Quantity and Quality for Scalable Elicitation
arXiv 2024
Representation Engineering: A Top-Down Approach to AI Transparency
arXiv 2023
Eliciting Latent Knowledge from Quirky Language Models
arXiv 2023
When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories
arXiv 2022
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers
Nora Belrose
Akari Asai
Alexander Pan
Andy Zou
founder
Ann-Kathrin Dombrowski
Dan Hendrycks
director
Daniel Khashabi
Dawn Song
professor
Hannaneh Hajishirzi
professor
J. Zico Kolter