Weiyan Shi
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7Assessing Risks of Large Language Models in Mental Health Support: A Framework for Automated Clinical AI Red Teaming
arXiv 2026
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
arXiv 2025
GEM: A Gym for Agentic LLMs
arXiv 2025
Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity
arXiv 2025
PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action
arXiv 2024
How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs
arXiv 2024
Controllable Mixed-Initiative Dialogue Generation through Prompting
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers