0

Androidworld RL Env (Prime Community)

Fresh

AndroidWorld benchmark for evaluating autonomous agents on real Android apps with 116 tasks across 20 apps

Type
RL Env
Runtime
multi-turn
License
unknown
Size
v0.1.0
Published
Mar 2026

Cite

Notes

Only stored in your browser.

Public scores on this env

1

1 vf-eval report across 1 model

Open the scoring view →

Lift evidence

1
EvalTools known to liftSource paper
MiniWoB++Androidworld RL Env (Prime Community)-