Wenqiao Zhang
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10InstructSAM: Segment Any Instance with Any Instructions
arXiv 2026
RynnBrain: Open Embodied Foundation Models
arXiv 2026
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation
arXiv 2025
EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?
arXiv 2025
MoA: Heterogeneous Mixture of Adapters for Parameter-Efficient Fine-Tuning of Large Language Models
arXiv 2025
Think Twice, Click Once: Enhancing GUI Grounding via Fast and Slow Systems
arXiv 2025
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM
CVPR 2025 1
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
arXiv 2024
Align$^2$LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation
arXiv 2024
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions
arXiv 2023
Affiliations
Frequent co-authors
10from 10 papers