Defend Concede RL Env (Community)
Fresh
GSM8K defend/concede environment for training calibrated self-assessment and sycophancy resistance
- Type
- RL Env
- Capabilities
- Math
- Runtime
multi-turn- License
- unknown
- Size
- v0.1.0
- Published
- Jan 2026
Cite
Notes
Only stored in your browser.
Lift evidence
1| Eval | Tools known to lift | Source paper |
|---|---|---|
| GSM8K | Defend Concede RL Env (Community) | - |