EdisonScientific

EdisonScientific is an org.

Type: org

Cite

Notes

Only stored in your browser.

Evals

Tools

Models

Papers

Boards

People

Tools

Ether0

The dataset used to test the ether0 scientific reasoning model.

RL EnvScientific ReasoningScience

LAB Bench

The Language Agent Biology Benchmark, or LAB-Bench, is an evaluation dataset for AI systems intended to benchmark capabilities foundational to scientific research in biology. This is an implementation of a benchmark made by FutureHouse.

RL EnvScientific Research Assistance

BixBench

Bioinformatics Benchmark (BixBench) is a dataset comprising over 50 real-world scenarios of practical biological data analysis with nearly 300 associated open-answer questions designed to measure the ability of LLM-based agents to explore biological datasets, perform long, mul…

RL EnvScientific ReasoningScience