Navonil Majumder
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks
arXiv 2025
NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards
arXiv 2025
JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment
arXiv 2025
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization
arXiv 2024
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization
arXiv 2024
Inference Time Alignment with Reward-Guided Tree Search
arXiv 2024
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse
arXiv 2024
Sentence Embedder Guided Utterance Encoder (SEGUE) for Spoken Language Understanding
arXiv 2023
Mustango: Toward Controllable Text-to-Music Generation
arXiv 2023
Flacuna: Unleashing the Problem Solving Power of Vicuna using FLAN Fine-Tuning
arXiv 2023
WikiDes: A Wikipedia-Based Dataset for Generating Short Descriptions from Paragraphs
arXiv 2022
Multiview Contextual Commonsense Inference: A New Dataset and Task
arXiv 2022
COSMIC: COmmonSense knowledge for eMotion Identification in Conversations
Findings of the Association for Computational Linguistics 2020
Recognizing Emotion Cause in Conversations
recognizing-emotion-cause-in-conversations
MIME: MIMicking Emotions for Empathetic Response Generation
EMNLP 2020 11
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations
meld-a-multimodal-multi-party-dataset-for-1
Affiliations
Frequent co-authors
10from 16 papers