StrongREJECT: Measuring LLM susceptibility to jailbreak attacks
Active
A benchmark that evaluates the susceptibility of LLMs to various jailbreak attacks.
- Publisher
- University of California, Berkeley
- Domain
- Safeguards
- License
- mit
- Published
- Feb 2025
- Notable for
- Benchmark for evaluating Safeguards.
Cite
Notes
Only stored in your browser.
FAQ
- What is StrongREJECT: Measuring LLM susceptibility to jailbreak attacks?
- A benchmark that evaluates the susceptibility of LLMs to various jailbreak attacks.
- What license is StrongREJECT: Measuring LLM susceptibility to jailbreak attacks under?
- StrongREJECT: Measuring LLM susceptibility to jailbreak attacks is available under mit.