Cite
Notes
Only stored in your browser.
Attribution
CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning
arXiv 2025
FEABench: Evaluating Language Models on Multiphysics Reasoning Ability
from 2 papers
HAO CUI
Michael P Brenner
Paul Raccuglia
Peter Norgaard
Subhashini Venugopalan
Amil Merchant
Brian Rohr
Chenfei Jiang
Dan Morris
Drew Purves