Hannah Rose Kirk
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11Clinical knowledge in LLMs does not translate to human interactions
arXiv 2025
Multilingual != Multicultural: Evaluating Gaps Between Multilingual Capabilities and Cultural Alignment in LLMs
arXiv 2025
Introducing v0.5 of the AI Safety Benchmark from MLCommons
arXiv 2024
The PRISM Alignment Dataset: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models
arXiv 2024
LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages
arXiv 2024
Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models
arXiv 2024
XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models
arXiv 2023
Indian-BhED: A Dataset for Measuring India-Centric Biases in Large Language Models
arXiv 2023
Assessing Language Model Deployment with Risk Cards
arXiv 2023
DataPerf: Benchmarks for Data-Centric AI Development
dataperf-benchmarks-for-data-centric-ai
Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-based Hate
NAACL 2022 7
Affiliations
Frequent co-authors
10from 11 papers