0

Evaluating Model Perception of Color Illusions in Photorealistic Scenes

Vision-language models exhibit perceptual biases similar to human vision when confronted with color illusions, demonstrated through a new dataset of realistic illusion images.

Year
2024
Venue
CVPR 2025 1
Authors
3
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2412.06184ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

We study the perception of color illusions by vision-language models. Color illusion, where a person's visual system perceives color differently from actual color, is well-studied in human vision. However, it remains underexplored whether vision-language models (VLMs), trained on large-scale human data, exhibit similar perceptual biases when confronted with such color illusions. We propose an automated framework for generating color illusion images, resulting in RCID (Realistic Color Illusion Dataset), a dataset of 19,000 realistic illusion images. Our experiments show that all studied VLMs exhibit perceptual biases similar human vision. Finally, we train a model to distinguish both human perception and actual pixel differences.

Authors

3