Cite
Notes
Only stored in your browser.
Attribution
Reasoning-Augmented Representations for Multimodal Retrieval
arXiv 2026
See, Hear, and Understand: Benchmarking Audiovisual Human Speech Understanding in Multimodal Large Language Models
arXiv 2025
UniTalk: Towards Universal Active Speaker Detection in Real World Scenarios
from 3 papers
Yong Jae Lee
Le Thien Phuc Nguyen
Zhuoran Yu
Anirudh Sundara Rajan
Brandon Han
Jeongik Lee
Jianrui Zhang
JuWan Maeng
Khoa Quang Nhat Cao
Lucas Poon