Cite
Notes
Only stored in your browser.
Attribution
SPARK: Multi-Vision Sensor Perception and Reasoning Benchmark for Large-scale Vision-Language Models
arXiv 2024
Are Vision-Language Models Truly Understanding Multi-vision Sensor?
from 2 papers
Byung-Kwan Lee
Sangyun Chung
Yong Man Ro
Se Yeon Kim
Youngchae Chee