Rexbench
A benchmark to evaluate the ability of AI agents to extend existing AI research through research experiment implementation tasks. Original benchmark: https://github.com/tinlaboratory/rexbench. Website: https://rexbench.com/.
- Domain
- agent-eval
- Published
- Nov 2025
Cite
Notes
Only stored in your browser.
FAQ
- What is Rexbench?
- A benchmark to evaluate the ability of AI agents to extend existing AI research through research experiment implementation tasks. Original benchmark: https://github.com/tinlaboratory/rexbench. Website: https://rexbench.com/.