Rishabh Bhardwaj
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8OffTopicEval: When Large Language Models Enter the Wrong Chat, Almost Always!
arXiv 2025
MSTS: A Multimodal Safety Test Suite for Vision-Language Models
arXiv 2025
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse
arXiv 2024
DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling
arXiv 2024
Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique
arXiv 2024
WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language Models
arXiv 2024
Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment
arXiv 2023
Recognizing Emotion Cause in Conversations
recognizing-emotion-cause-in-conversations
Affiliations
Frequent co-authors
10from 8 papers