Yunsheng Wu
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding
arXiv 2026
Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision
arXiv 2026
Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding
arXiv 2026
Few-Shot Anomaly-Driven Generation for Anomaly Classification and Segmentation
arXiv 2025
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization
arXiv 2025
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models
arXiv 2025
MobileMamba: Lightweight Multi-Receptive Visual Mamba Network
CVPR 2025 1
FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
arXiv 2024
DF40: Toward Next-Generation Deepfake Detection
arXiv 2024
Aligning and Prompting Everything All at Once for Universal Visual Perception
arXiv 2023
MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation
arXiv 2023
Affiliations
Frequent co-authors
10from 11 papers