0

Replicationbench

ReplicationBench - A benchmark for evaluating AI agents on reproducing computational results from astrophysics research papers. Adapted from Christine8888/replicationbench-release.

Domain
agent-eval
Published
Nov 2025

Cite

Notes

Only stored in your browser.

FAQ

What is Replicationbench?
ReplicationBench - A benchmark for evaluating AI agents on reproducing computational results from astrophysics research papers. Adapted from Christine8888/replicationbench-release.