0

CORE-Bench

Active

Evaluate how well an LLM Agent is at computationally reproducing the results of a set of scientific papers.

Domain
Coding
License
mit
Published
Feb 2025
Notable for
Benchmark for evaluating Coding.

Cite

Notes

Only stored in your browser.

FAQ

What is CORE-Bench?
Evaluate how well an LLM Agent is at computationally reproducing the results of a set of scientific papers.
What license is CORE-Bench under?
CORE-Bench is available under mit.