Cite
Notes
Only stored in your browser.
Attribution
General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks
arXiv 2026
AMO-Bench: Large Language Models Still Struggle in High School Math Competitions
arXiv 2025
from 2 papers
Dan Ma
Shengnan An
Shuang Zhou
Xiaoyu Li
Xuezhi Cao
Xunliang Cai
Ziwen Wang
Shixiong Luo
Wenling Yuan
Xinxuan Lv