0

Symmetry Defeats Auditing

We demonstrate an attack on Introspection Adapters (Shenoy et al., 2026).

Year
2026
Hosting
Full text hostedCC-BY-4.0

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2605.27836CC-BY-4.0
TL;DR
Semantic Scholar
Attribution policy →

Abstract

We demonstrate an attack on Introspection Adapters (Shenoy et al., 2026).