0

math

Slug
math
Evals
12
Tools
58
Models
495
Papers
8

Evals testing this capability

12
View all

Tools lifting evals here

58
View all

Top models on this capability

495

by avg parsed score across evals here

mathBar chart with 21 bars. Highest value: R1 1776 at 95.4.
21 models

Papers in this area

8