0

Bigcodebench Hard Complete

BigCodeBench-Hard complete benchmark adapter for Harbor - challenging Python programming tasks with reward-based verification

Domain
agent-eval
Published
Nov 2025

Cite

Notes

Only stored in your browser.

FAQ

What is Bigcodebench Hard Complete?
BigCodeBench-Hard complete benchmark adapter for Harbor - challenging Python programming tasks with reward-based verification