Dieuwke Hupkes
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5MultiLoKo: a multilingual local knowledge benchmark for LLMs spanning 31 languages
arXiv 2025
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
arXiv 2025
Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges
arXiv 2024
The Validity of Evaluation Results: Assessing Concurrence Across Compositionality Benchmarks
arXiv 2023
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
TMLR
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers
Nicholas Roberts
Aarohi Srivastava
researcher
Abhinav Rastogi
researcher
Abhishek Rao
Abu Awal Md Shoeb
Abubakar Abid
Adam Fisch
Adam R. Brown
Adam Santoro
Adina Williams