Max Bartolo
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9Atla Selene Mini: A General Purpose Evaluation Model
arXiv 2025
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
arXiv 2024
Introducing v0.5 of the AI Safety Benchmark from MLCommons
arXiv 2024
Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models
arXiv 2024
The PRISM Alignment Dataset: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models
arXiv 2024
Human Feedback is not Gold Standard
arXiv 2023
Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks
ACL 2022 5
DataPerf: Benchmarks for Data-Centric AI Development
dataperf-benchmarks-for-data-centric-ai
Beat the AI: Investigating Adversarial Human Annotation for Reading Comprehension
arXiv 2020
Affiliations
Frequent co-authors
10from 9 papers