Bertie Vidgen
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8MSTS: A Multimodal Safety Test Suite for Vision-Language Models
arXiv 2025
TrustLLM: Trustworthiness in Large Language Models
arXiv 2024
Introducing v0.5 of the AI Safety Benchmark from MLCommons
arXiv 2024
The PRISM Alignment Dataset: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models
arXiv 2024
FinanceBench: A New Benchmark for Financial Question Answering
arXiv 2023
XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models
arXiv 2023
Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models
NAACL (WOAH) 2022 7
Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection
ACL 2021 5
Affiliations
Frequent co-authors
10from 8 papers