Cite
Notes
Only stored in your browser.
Attribution
UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning
arXiv 2025
STAR-R1: Spacial TrAnsformation Reasoning by Reinforcing Multimodal LLMs
E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding
arXiv 2024
from 3 papers
Chang Wen Chen
Yang Wu
Ye Liu
Ying Shan
Zhongang Qi
Deli Zhao
Junfu Pu
Mingze Li
Songyou Li
Tingyang Xu