0

Defend Concede RL Env (Community)

Fresh

GSM8K defend/concede environment for training calibrated self-assessment and sycophancy resistance

Type
RL Env
Capabilities
Math
Runtime
multi-turn
License
unknown
Size
v0.1.0
Published
Jan 2026

Cite

Notes

Only stored in your browser.

Lift evidence

1
EvalTools known to liftSource paper
GSM8KDefend Concede RL Env (Community)-

Contributors

1