Xinyu Chen
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10INSPATIO-WORLD: A Real-Time 4D World Simulator via Spatiotemporal Autoregressive Modeling
arXiv 2026
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
arXiv 2025
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
arXiv 2025
VideoVista-CulturalLingo: 360$^\circ$ Horizons-Bridging Cultures, Languages, and Domains in Video Comprehension
arXiv 2025
Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data
arXiv 2025
UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE
arXiv 2025
VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization
arXiv 2025
Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment
arXiv 2024
LMEye: An Interactive Perception Network for Large Language Models
arXiv 2023
A Multi-Modal Context Reasoning Approach for Conditional Inference on Joint Textual and Visual Clues
arXiv 2023
Affiliations
Frequent co-authors
10from 10 papers