Llmsr Bench Full
Fresh
A comprehensive benchmark with challenging problems across four scientific domains specifically designed to evaluate LLM-based scientific equation discovery methods while preventing trivial memorization.
- Type
- RL Env
- Capabilities
- Scientific Reasoning
- Runtime
ORS- License
- unknown
- Size
- 240 tasks
- Published
- Feb 2026
Cite
Notes
Only stored in your browser.