0

Py Bug Trace Level 1

Fresh

Python output prediction eval: trace subtly broken code — harder-than-SWE-bench reasoning

Type
RL Env
License
unknown
Size
v0.3.0
Published
May 2026

Cite

Notes

Only stored in your browser.

Contributors

1