Sreyan Ghosh

Papers: 14

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

14papers

Authored papers

Do Audio-Visual Large Language Models Really See and Hear?

arXiv 2026

2026

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

arXiv 2025

2025

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

arXiv 2025

2025

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

arXiv 2024

2024

ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds

arXiv 2024

2024

Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs

arXiv 2024

2024

Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data

arXiv 2024

2024

CoDa: Constrained Generation based Data Augmentation for Low-Resource NLP

arXiv 2024

2024

UNFUSED: UNsupervised Finetuning Using SElf supervised Distillation

arXiv 2023

2023

DALE: Generative Data Augmentation for Low-Resource Legal NLP

arXiv 2023

2023

ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious Correlations

arXiv 2023

2023

CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representations

arXiv 2022

2022

data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup

arXiv 2022

2022

PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech Representations

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

from 14 papers

Dinesh Manocha

10 shared papers

Sonal Kumar

8 shared papers

Utkarsh Tyagi

5 shared papers

Chandra Kiran Reddy Evuru

4 shared papers

S Sakshi

4 shared papers

S. Umesh

4 shared papers

Bryan Catanzaro

researcher

Oriol Nieto

Rafael Valle

Ashish Seth