Cite
Notes
Only stored in your browser.
Attribution
Recommendations and Reporting Checklist for Rigorous & Transparent Human Baselines in Model Evaluations
arXiv 2025
Representation Engineering: A Top-Down Approach to AI Transparency
arXiv 2023
from 2 papers
Alex Mallen
Alexander Pan
Andy Zou
founder
Anka Reuel
Ann-Kathrin Dombrowski
Chinmay Deshpande
Dan Hendrycks
director
Dawn Song
professor
Evie Coxon
J. Zico Kolter