Zeming Wei
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
63DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians
arXiv 2025
False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize
arXiv 2025
Boosting Jailbreak Attack with Momentum
arXiv 2024
Fight Back Against Jailbreaking via Prompt Adversarial Tuning
arXiv 2024
Adversarial Representation Engineering: A General Model Editing Framework for Large Language Models
arXiv 2024
Jatmo: Prompt Injection Defense by Task-Specific Finetuning
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers