Adina Williams
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14IntPhys 2: Benchmarking Intuitive Physics Understanding In Complex Synthetic Environments
arXiv 2025
Introducing v0.5 of the AI Safety Benchmark from MLCommons
arXiv 2024
Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
arXiv 2024
The PRISM Alignment Dataset: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models
arXiv 2024
Transformers Can Navigate Mazes With Multi-Step Prediction
arXiv 2024
Llama 2: Open Foundation and Fine-Tuned Chat Models
arXiv 2023
The Validity of Evaluation Results: Assessing Concurrence Across Compositionality Benchmarks
arXiv 2023
DataPerf: Benchmarks for Data-Centric AI Development
dataperf-benchmarks-for-data-centric-ai
Perturbation Augmentation for Fairer NLP
arXiv 2022
"I'm sorry to hear that": Finding New Biases in Language Models with a Holistic Descriptor Dataset
arXiv 2022
Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks
ACL 2022 5
A Latent-Variable Model for Intrinsic Probing
arXiv 2022
Are Natural Language Inference Models IMPPRESsive? Learning IMPlicature and PRESupposition
are-natural-language-inference-models-1
Adversarial NLI: A New Benchmark for Natural Language Understanding
adversarial-nli-a-new-benchmark-for-natural-1
Affiliations
Frequent co-authors
10from 14 papers