Dirk Hovy
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance
arXiv 2025
MSTS: A Multimodal Safety Test Suite for Vision-Language Models
arXiv 2025
Can Reasoning Help Large Language Models Capture Human Annotator Disagreement?
arXiv 2025
Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models
arXiv 2024
"My Answer is C": First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models
arXiv 2024
DADIT: A Dataset for Demographic Classification of Italian Twitter Users and a Comparison of Prediction Methods
arXiv 2024
XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models
arXiv 2023
Explaining Speech Classification Models via Word-Level Audio Segments and Paralinguistic Features
arXiv 2023
Towards Human-Level Text Coding with LLMs: The Case of Fatherhood Roles in Public Policy Documents
arXiv 2023
Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists
Findings (ACL) 2022 5
Affiliations
Frequent co-authors
10from 10 papers
Paul Röttger
Giuseppe Attanasio
Bertie Vidgen
Donya Rooein
Elena Baralis
Hannah Rose Kirk
Lorenzo Lupo
Musashi Hinck
Valentin Hofmann
Valentina Pyatkin
research-scientist