0

GoodSirMath8k

Just GSM8K with the added reward based on how shakespearean the model is.

Domain
rl-env
License
unknown
Published
Oct 2025

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
prime-hub
Attribution policy →

Top score 0.0% by Claude Sonnet 4.5 - 1 model reporting (1 frontier)

Top models

1
GoodSirMath8kBar chart with 1 bar. Highest value: Claude Sonnet 4.5 at 0.
1 model

Related tools

1
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is GoodSirMath8k?
Just GSM8K with the added reward based on how shakespearean the model is.
What is the current top score on GoodSirMath8k?
The top reported score is 0.0% by Claude Sonnet 4.5, across 1 model reporting (1 from frontier labs).
How can a model improve its GoodSirMath8k score?
Tools linked to GoodSirMath8k on Sophon include Goodsirmath8k RL Env (Kunumi) - RL environments, datasets, and scaffolds that target this eval.
What license is GoodSirMath8k under?
GoodSirMath8k is available under unknown.