Realworldqa RL Env (Community)
Fresh
RealWorldQA environment for evaluating vision-language models on real-world question answering
- Type
- RL Env
- Runtime
single-turn- License
- unknown
- Size
- v0.1.0
- Published
- Oct 2025
Cite
Notes
Only stored in your browser.
Public scores on this env
44 vf-eval reports across 4 models
1Mistral Small 3.2 24B InstructMistral AI80.0%2Gemini 2.5 FlashGoogle (Alphabet Inc.)60.0%3GLM 4.5VZai40.0%4Gemini 2.5 Flash Image PreviewGoogle (Alphabet Inc.)40.0%
Open the scoring view →