Xuanyu Zhang
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
arXiv 2025
Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data
arXiv 2025
UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE
arXiv 2025
Q-Insight: Understanding Image Quality via Visual Reinforcement Learning
arXiv 2025
Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling
arXiv 2024
Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy Training
arXiv 2024
FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models
arXiv 2024
Recurrent Drafter for Fast Speculative Decoding in Large Language Models
arXiv 2024
Affiliations
Frequent co-authors
10from 8 papers