Cite
Notes
Only stored in your browser.
Attribution
BabyVision: Visual Reasoning Beyond Language
arXiv 2026
\$OneMillion-Bench: How Far are Language Agents from Human Experts?
xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations
arXiv 2025
from 3 papers
Kaiyuan Chen
Xiaobo Hu
Yang Liu
Yuan Gong
Fangfu Liu
Baobao Chang
Chen Sun
Chun Zhang
Gang Yao
Ge Zhang
researcher