Sensible Thinker RL Env (Community)
Fresh
Encouraging reasoning model to produce more sensible thinking process by ask other model to understand and predict zeroshot answer from only reasoning text
- Type
- RL Env
- License
- unknown
- Size
- v0.1.0
- Published
- Sep 2025
Cite
Notes
Only stored in your browser.
Lift evidence
1| Eval | Tools known to lift | Source paper |
|---|---|---|
| GSM8K | Sensible Thinker RL Env (Community) | - |