Yuping Wang
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
arXiv 2025
Generative AI for Autonomous Driving: Frontiers and Opportunities
arXiv 2025
MagiCodec: Simple Masked Gaussian-Injected Codec for High-Fidelity Reconstruction and Generation
arXiv 2025
UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving
ICCV 2025
Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization
arXiv 2025
Can Large Vision Language Models Read Maps Like a Human?
arXiv 2025
Seed-TTS: A Family of High-Quality Versatile Speech Generation Models
arXiv 2024
CMP: Cooperative Motion Prediction with Multi-Agent Communication
arXiv 2024
OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving
arXiv 2024
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
arXiv 2023
Halo: Estimation and Reduction of Hallucinations in Open-Source Weak Large Language Models
arXiv 2023
NeuFA: Neural Network Based End-to-End Forced Alignment with Bidirectional Attention Mechanism
arXiv 2022
Affiliations
Frequent co-authors
10from 12 papers