Cite
Notes
Only stored in your browser.
Attribution
MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition
ICCV 2023 1
Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding
from 2 papers
Aoxiong Yin
Xize Cheng
Zehan Wang
Zhou Zhao
Haifeng Huang
Huangdai Liu
Rongjie Huang
Tao Jin
Wang Lin
Yang Zhao