Cite
Notes
Only stored in your browser.
Attribution
PuzzlePlex: Benchmarking Foundation Models on Reasoning and Planning with Puzzles
arXiv 2025
FinanceMath: Knowledge-Intensive Math Reasoning in Finance Domains
arXiv 2023
DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Long and Specialized Documents
from 3 papers
Arman Cohan
Yilun Zhao
Yitao Long
Chen Zhao
Rui Zhang
Dennis Shasha
Jingchen Sun
Linyong Nan
Lyuhao Chen
Ryo Kamoi