Cite
Notes
Only stored in your browser.
Attribution
FEABench: Evaluating Language Models on Multiphysics Reasoning Ability
arXiv 2025
CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning
SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers
arXiv 2024
from 3 papers
HAO CUI
Michael P Brenner
Nayantara Mudur
Paul Raccuglia
Peter Norgaard
Amil Merchant
Brian Rohr
Chenfei Jiang
Dan Morris
Drew Purves