Chen Yang
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12WorldAct: Activating Monolithic 3D Worlds into Interactive-Ready Object-Centric Scenes
arXiv 2026
EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting
arXiv 2025
MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
arXiv 2025
URO-Bench: A Comprehensive Benchmark for End-to-End Spoken Dialogue Models
arXiv 2025
Few-step Flow for 3D Generation via Marginal-Data Transport Distillation
arXiv 2025
Goal2Story: A Multi-Agent Fleet based on Privately Enabled sLLMs for Impacting Mapping on Requirements Elicitation
arXiv 2025
Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation
arXiv 2024
GaussianObject: High-Quality 3D Object Reconstruction from Four Views with Gaussian Splatting
arXiv 2024
GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
arXiv 2024
A Survey of Resource-efficient LLM and Multimodal Foundation Models
arXiv 2024
EndoGaussian: Real-time Gaussian Splatting for Dynamic Endoscopic Scene Reconstruction
arXiv 2024
A Survey of Large Language Models
arXiv 2023
Affiliations
Frequent co-authors
10from 12 papers