OAB Bench RL Env (Kunumi)
Fresh
Benchmark made to evaluate llms in the Brazilian Bar Examination, using a multi-judge system.
- Type
- RL Env
- Publisher
- Kunumi
- Runtime
single-turn- License
- unknown
- Size
- v0.1.0
- Published
- Sep 2025
Cite
Notes
Only stored in your browser.
Public scores on this env
11 vf-eval report across 1 model