Yu Wu
- Papers
- 20
Cite
Notes
Only stored in your browser.
Authored papers
20Qwen-Image-VAE-2.0 Technical Report
arXiv 2026
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
preprint
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
arXiv 2025
Rethinking Query-based Transformer for Continual Image Segmentation
rethinking-query-based-transformer-for
DeepSeek-V3 Technical Report
arXiv 2024
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
arXiv 2024
The Silent Prompt: Initial Noise as Implicit Guidance for Goal-Driven Image Generation
ICCV 2025
Towards Open Respiratory Acoustic Foundation Models: Pretraining and Benchmarking
arXiv 2024
Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning
arXiv 2024
DVIS: Decoupled Video Instance Segmentation Framework
ICCV 2023 1
DVIS++: Improved Decoupled Framework for Universal Video Segmentation
arXiv 2023
Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World
ICCV 2023 1
Boundary Guided Learning-Free Semantic Control with Diffusion Models
boundary-guided-learning-free-semantic
Diffusion in Diffusion: Cyclic One-Way Diffusion for Text-Vision-Conditioned Generation
arXiv 2023
BEATs: Audio Pre-Training with Acoustic Tokenizers
arXiv 2022
Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation
arXiv 2022
Quantized GAN for Complex Music Generation from Dance Videos
arXiv 2022
UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training
arXiv 2021
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
arXiv 2021
MuTual: A Dataset for Multi-Turn Dialogue Reasoning
mutual-a-dataset-for-multi-turn-dialogue-1
Affiliations
Frequent co-authors
10from 20 papers