Kavel Rao
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4ColorGrid: A Multi-Agent Non-Stationary Environment for Goal Inference and Assistance
arXiv 2025
WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs
arXiv 2024
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models
arXiv 2024
Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers