Cite
Notes
Only stored in your browser.
Attribution
Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory
arXiv 2025
from 1 papers
Bing Xu
Conghui Zhu
Hailong Cao
Hongli Zhou
Hui Huang
Huicheng Wang
Jian Dong
Kehai Chen
Lvyuan Han
Muyun Yang