Zuyan Liu
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents
arXiv 2026
PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning
arXiv 2026
GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization
arXiv 2025
SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs
ICCV 2025
RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark
arXiv 2025
Ola: Pushing the Frontiers of Omni-Modal Language Model
arXiv 2025
Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution
arXiv 2024
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
CVPR 2025 1
Efficient Inference of Vision Instruction-Following Models with Elastic Cache
arXiv 2024
Unleashing Text-to-Image Diffusion Models for Visual Perception
unleashing-text-to-image-diffusion-models-for
Affiliations
Frequent co-authors
10from 10 papers