Cite
Notes
Only stored in your browser.
Attribution
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM
arXiv 2025
Visually Interpretable Subtask Reasoning for Visual Question Answering
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities
arXiv 2024
from 3 papers
Bryan Catanzaro
researcher
Rafael Valle
Ambrish Dantrey
An-Chieh Cheng
Andrew Tao
Chao-Han Huck Yang
Daguang Xu
Danial Mohseni Taheri
Dong Yang
Ehsan Hosseini-Asl