mohamed313

Role: RL env contributor

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

2tool contribs

Tool contributions

Python Coding Eval

HumanEval Python code generation. Model completes a function, run against unit tests in an isolated subprocess. Pass/fail reward. Lightweight baseline

RL Env

Python Coding Pro

HumanEval Python code-gen with sandboxed unit-test execution, timeouts, import-safety, partial credit, and a syntactic-validity metric. For RL & eval.

RL Env

Affiliations

No known affiliations.