Cite
Notes
Only stored in your browser.
Attribution
Tell What You Hear From What You See -- Video to Audio Generation Through Text
arXiv 2024
UniMuMo: Unified Text, Music and Motion Generation
from 2 papers
Chuang Gan
Eli Shlizerman
Gaowen Liu
Han Yang
Jiaben Chen
Kaizhi Qian
Xiulong Liu
Yutong Zhang