Cite
Notes
Only stored in your browser.
Attribution
Progressive Language-guided Visual Learning for Multi-Task Visual Grounding
arXiv 2025
Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decoder
VPTracker: Global Vision-Language Tracking via Visual Prompt and MLLM
from 3 papers
Jingchao Wang
Yefeng Zheng
Hong Wang
Kunhua Ji
Zhijian Wu
Kaiwen Zhou
Wenlong Zhang