0

Atari 57

Active

57 Atari 2600 games played from raw pixels - the foundational reinforcement-learning benchmark from DeepMind's DQN era.

Domain
agentic
Format
Openenv
Size
57 tasks
License
GPL-2.0
Published
Jul 2012
Notable for
Benchmark for evaluating planning and image understanding in the agentic domain.

Cite

Notes

Only stored in your browser.

Related tools

1
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

Papers

2

FAQ

What is Atari 57?
57 Atari 2600 games played from raw pixels - the foundational reinforcement-learning benchmark from DeepMind's DQN era.
What capabilities does Atari 57 test?
Atari 57 evaluates planning, image understanding.
How can a model improve its Atari 57 score?
Tools linked to Atari 57 on Sophon include OpenEnv Atari (ALE) - RL environments, datasets, and scaffolds that target this eval.
What license is Atari 57 under?
Atari 57 is available under GPL-2.0.