Cite
Notes
Only stored in your browser.
Attribution
Effective Red-Teaming of Policy-Adherent Agents
arXiv 2025
A Novel Metric for Measuring the Robustness of Large Language Models in Non-adversarial Scenarios
arXiv 2024
from 2 papers
Eitan Farchi
Ella Rabinovich
George Kour
Guy Uziel
Itay Nakash
Koren Lazar
Matan Vetzler
Samuel Ackerman