Zifan Wang

Papers: 10

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

10papers

Authored papers

How Vulnerable Are AI Agents to Indirect Prompt Injections? Insights from a Large-Scale Public Competition

arXiv 2026

2026

SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?

arXiv 2025

2025

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

arXiv 2024

2024

Arm-Constrained Curriculum Learning for Loco-Manipulation of the Wheel-Legged Robot

arXiv 2024

2024

Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents

arXiv 2024

2024

Universal and Transferable Adversarial Attacks on Aligned Language Models

arXiv 2023

2023

Representation Engineering: A Top-Down Approach to AI Transparency

arXiv 2023

2023

Can LLMs Follow Simple Rules?

arXiv 2023

2023

Unlocking Deterministic Robustness Certification on ImageNet

unlocking-deterministic-robustness

2023

Score-CAM: Score-Weighted Visual Explanations for Convolutional Neural Networks

arXiv 2019

2019

Affiliations

No known affiliations.

Frequent co-authors

from 10 papers

Andy Zou

founder

5 shared papers

Matt Fredrikson

5 shared papers

Dan Hendrycks

director

3 shared papers

J. Zico Kolter

2 shared papers

Long Phan

researcher

2 shared papers

Mantas Mazeika

researcher

2 shared papers

Nathaniel Li

grad-student

Norman Mu

Sarah Chen

Sean Hendryx