Cite
Notes
Only stored in your browser.
Attribution
Fantastic Bugs and Where to Find Them in AI Benchmarks
arXiv 2025
Reliable and Efficient Amortized Model-based Evaluation
from 2 papers
Sang Truong
Sanmi Koyejo
professor
Anka Reuel
Ben Domingue
Bo Li
Chibuike Uwakwe
Jirayu Burapacheep
Jonathan Perera
Michael Hardy
Nick Haber