Cite
Notes
Only stored in your browser.
Attribution
OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning
arXiv 2025
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM
UniAudio: An Audio Foundation Model Toward Universal Audio Generation
arXiv 2023
from 3 papers
Ambrish Dantrey
An-Chieh Cheng
Andrew Tao
Arushi Goel
Bryan Catanzaro
researcher
Chao-Han Huck Yang
Chyi-Jiunn Lin
Daguang Xu
Danial Mohseni Taheri
Dong Yang