Aleksandar Makelov

Cite

Notes

Only stored in your browser.

Attribution

2papers

Authored papers

Is This the Subspace You Are Looking for? An Interpretability Illusion for Subspace Activation Patching

arXiv 2023

Towards Deep Learning Models Resistant to Adversarial Attacks

towards-deep-learning-models-resistant-to-1

No known affiliations.

from 2 papers

Adrian Vladu

Aleksander Mądry

Dimitris Tsipras

researcher

Georg Lange

Ludwig Schmidt

professor

Neel Nanda

researcher