Aoxiong Yin
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4Kimi-Audio Technical Report
arXiv 2025
The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation
ICCV 2025
MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition
ICCV 2023 1
Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding
ICCV 2023 1
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers