Cite
Notes
Only stored in your browser.
Attribution
Perceptual Decoupling for Scalable Multi-modal Reasoning via Reward-Optimized Captioning
arXiv 2025
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
CVPR 2025 1
from 2 papers
James T. Kwok
Kai Chen
Lanqing Hong
Yu Zhang
Zhenguo Li
Zhili Liu
Chunwei Wang
Daxin Tan
Dingdong Wang
Dit-yan Yeung