Yushuo Guan
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning
arXiv 2026
Semantic Routing: Exploring Multi-Layer LLM Feature Weighting for Diffusion Transformers
arXiv 2026
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model
arXiv 2025
VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation
arXiv 2025
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers