Yuqian Yuan
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11InstructSAM: Segment Any Instance with Any Instructions
arXiv 2026
RynnBrain: Open Embodied Foundation Models
arXiv 2026
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation
arXiv 2025
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding
arXiv 2025
EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?
arXiv 2025
RynnVLA-002: A Unified Vision-Language-Action and World Model
arXiv 2025
RynnEC: Bringing MLLMs into Embodied World
arXiv 2025
TokenPacker: Efficient Visual Projector for Multimodal LLM
arXiv 2024
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM
CVPR 2025 1
Chain of Ideas: Revolutionizing Research Via Novel Idea Development with LLM Agents
arXiv 2024
Osprey: Pixel Understanding with Visual Instruction Tuning
CVPR 2024 1
Affiliations
Frequent co-authors
10from 11 papers