Cite
Notes
Only stored in your browser.
Attribution
The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems
arXiv 2025
From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples
arXiv 2024
from 2 papers
Adam Khoja
Alice Gatti
researcher
Arunim Agarwal
Brad Kenstler
Cristina Menghini
Dan Hendrycks
director
Dean Lee
Eduardo Trevino
Isabelle Barrass
Mantas Mazeika