GoodSirMath8k
Just GSM8K with the added reward based on how shakespearean the model is.
- Domain
- rl-env
- License
- unknown
- Published
- Oct 2025
Cite
Notes
Only stored in your browser.
Top score 0.0% by Claude Sonnet 4.5 - 1 model reporting (1 frontier)
Top models
1Related tools
1Implementations, trainers, datasets and scaffolds linked to this eval.
FAQ
- What is GoodSirMath8k?
- Just GSM8K with the added reward based on how shakespearean the model is.
- What is the current top score on GoodSirMath8k?
- The top reported score is 0.0% by Claude Sonnet 4.5, across 1 model reporting (1 from frontier labs).
- How can a model improve its GoodSirMath8k score?
- Tools linked to GoodSirMath8k on Sophon include Goodsirmath8k RL Env (Kunumi) - RL environments, datasets, and scaffolds that target this eval.
- What license is GoodSirMath8k under?
- GoodSirMath8k is available under unknown.