0

Banking Mrm

Frontier

Minimal RL and evaluation environment for banking model risk management (MRM), grounded in SR 11‑7 and the OCC Model Risk Management Handbook.

Domain
rl-env
License
unknown
Published
Sep 2025

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
prime-hub
Attribution policy →

Top score 1.64 by GPT-4.1 - 5 models reporting (5 frontier)

Score history

5
00.511.52Apr 25May 25Jun 25Jul 25Aug 25GPT-4.1 MiniGPT-4.1

Top models

5
Banking MrmBar chart with 5 bars. Highest value: GPT-4.1 at 1.6.
5 models

Related tools

1
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is Banking Mrm?
Minimal RL and evaluation environment for banking model risk management (MRM), grounded in SR 11‑7 and the OCC Model Risk Management Handbook.
What is the current top score on Banking Mrm?
The top reported score is 1.64 by GPT-4.1, across 5 models reporting (5 from frontier labs).
How can a model improve its Banking Mrm score?
Tools linked to Banking Mrm on Sophon include Banking MRM RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.
What license is Banking Mrm under?
Banking Mrm is available under unknown.