Cite
Notes
Only stored in your browser.
Attribution
World Action Models: The Next Frontier in Embodied AI
arXiv 2026
MOVA: Towards Scalable and Synchronized Video-Audio Generation
InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech Systems
arXiv 2025
from 3 papers
Xipeng Qiu
Zhaoye Fei
Qinyuan Cheng
ShiMin Li
Cheng Chang
Chenhui Li
Chunguo Li
Chushu Zhou
Dong Zhang
Donghua Yu