Sreyan Ghosh
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14Do Audio-Visual Large Language Models Really See and Hear?
arXiv 2026
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities
arXiv 2025
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM
arXiv 2025
MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark
arXiv 2024
ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds
arXiv 2024
Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs
arXiv 2024
Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data
arXiv 2024
CoDa: Constrained Generation based Data Augmentation for Low-Resource NLP
arXiv 2024
UNFUSED: UNsupervised Finetuning Using SElf supervised Distillation
arXiv 2023
ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious Correlations
arXiv 2023
DALE: Generative Data Augmentation for Low-Resource Legal NLP
arXiv 2023
PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech Representations
arXiv 2022
CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representations
arXiv 2022
data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup
arXiv 2022
Affiliations
Frequent co-authors
10from 14 papers