0

Sensible Thinker RL Env (Community)

Fresh

Encouraging reasoning model to produce more sensible thinking process by ask other model to understand and predict zeroshot answer from only reasoning text

Type
RL Env
License
unknown
Size
v0.1.0
Published
Sep 2025

Cite

Notes

Only stored in your browser.

Lift evidence

1
EvalTools known to liftSource paper
GSM8KSensible Thinker RL Env (Community)-

Contributors

1