Andy Zhou
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5Tamper-Resistant Safeguards for Open-Weight LLMs
arXiv 2024
RedCode: Risky Code Execution and Generation Benchmark for Code Agents
arXiv 2024
Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks
arXiv 2024
Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models
arXiv 2023
Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models
distilling-out-of-distribution-robustness
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers