Cite
Notes
Only stored in your browser.
Attribution
MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark
arXiv 2024
ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds
Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs
from 3 papers
Dinesh Manocha
Sonal Kumar
Sreyan Ghosh
Chandra Kiran Reddy Evuru
Ramani Duraiswami
Utkarsh Tyagi
Ashish Seth
Ramaneswaran Selvakumar
S Sakshi
Zeyu Jin