Guangzhi Sun
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models
arXiv 2025
video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model
arXiv 2025
ACVUBench: Audio-Centric Video Understanding Benchmark
arXiv 2025
Large language models surpass human experts in predicting neuroscience results
arXiv 2024
SALMONN: Towards Generic Hearing Abilities for Large Language Models
arXiv 2023
Can Contextual Biasing Remain Effective with Whisper and GPT-2?
arXiv 2023
Fine-grained Audio-Visual Joint Representations for Multimodal Large Language Models
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers