CORE-Bench
Active
Evaluate how well an LLM Agent is at computationally reproducing the results of a set of scientific papers.
- Publisher
- Princeton University
- Domain
- Coding
- License
- mit
- Published
- Feb 2025
- Notable for
- Benchmark for evaluating Coding.
Cite
Notes
Only stored in your browser.
FAQ
- What is CORE-Bench?
- Evaluate how well an LLM Agent is at computationally reproducing the results of a set of scientific papers.
- What license is CORE-Bench under?
- CORE-Bench is available under mit.