Cite
Notes
Only stored in your browser.
Attribution
Enhancing Automated Interpretability with Output-Centric Feature Descriptions
arXiv 2025
from 1 papers
Atticus Geiger
Chen Agassy
Mor Geva
Yoav Gur-Arieh