Cite
Notes
Only stored in your browser.
Attribution
FIRE-Bench: Evaluating Agents on the Rediscovery of Scientific Insights
arXiv 2026
What do Language Models Learn and When? The Implicit Curriculum Hypothesis
The Validity of Evaluation Results: Assessing Concurrence Across Compositionality Benchmarks
arXiv 2023
from 3 papers
Adina Williams
Claire Cardie
professor
Dieuwke Hupkes
Emmy Liu
Eric P. Xing
Fan Bai
Graham Neubig
Isabelle Lee
Jen-tse Huang
Jieyuan Liu