0

Fh Aviary

Saturated

Future House Aviary wrapper for verifiers - Scientific reasoning environments with tools

Domain
rl-env
License
mit
Published
Mar 2026

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
prime-hub
Attribution policy →

Top score 100.0% by GPT-4.1 Mini - 3 models reporting (3 frontier)

Score history

3
0%25%50%75%100%Mar 23Sep 23Mar 24Sep 24Mar 25GPT-4GPT-4.1 Mini

Top models

3
Fh AviaryBar chart with 3 bars. Highest value: GPT-4.1 Mini at 100.
3 models

Related tools

1
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is Fh Aviary?
Future House Aviary wrapper for verifiers - Scientific reasoning environments with tools
What is the current top score on Fh Aviary?
The top reported score is 100.0% by GPT-4.1 Mini, across 3 models reporting (3 from frontier labs).
How can a model improve its Fh Aviary score?
Tools linked to Fh Aviary on Sophon include FH Aviary RL Env (Prime Community) - RL environments, datasets, and scaffolds that target this eval.
What license is Fh Aviary under?
Fh Aviary is available under mit.