Cite
Notes
Only stored in your browser.
Attribution
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling
arXiv 2025
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
arXiv 2024
from 2 papers
Jifeng Dai
Muyan Zhong
Xizhou Zhu
Zhe Chen
Bin Wang
Carson Chen
Chenxia Han
Chenxin Tao
Chenyu Yang
Guanzheng Chen