Rita Singh
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9Mellow: a small audio language model for reasoning
arXiv 2025
ControlVAR: Exploring Controllable Visual Autoregressive Modeling
arXiv 2024
Audio Entailment: Assessing Deductive Reasoning for Audio Understanding
arXiv 2024
$\text{R}^2$-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations
arXiv 2024
Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization?
arXiv 2024
Pengi: An Audio Language Model for Audio Tasks
pengi-an-audio-language-model-for-audio-tasks
Token Prediction as Implicit Classification to Identify LLM-Generated Text
arXiv 2023
Training Audio Captioning Models without Audio
arXiv 2023
GPT-Sentinel: Distinguishing Human and ChatGPT Generated Content
arXiv 2023
Affiliations
Frequent co-authors
10from 9 papers