Jiashuo Yu
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling
arXiv 2025
VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos
ICCV 2025
ExpVid: A Benchmark for Experiment Video Understanding & Reasoning
arXiv 2025
VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling
arXiv 2024
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
arXiv 2024
LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
arXiv 2023
Long-Term Rhythmic Video Soundtracker
arXiv 2023
Modality-Aware Contrastive Instance Learning with Self-Distillation for Weakly-Supervised Audio-Visual Violence Detection
arXiv 2022
MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing
arXiv 2021
Affiliations
Frequent co-authors
10from 9 papers