Cite
Notes
Only stored in your browser.
Attribution
Training Language Models to Explain Their Own Computations
arXiv 2025
from 1 papers
Belinda Z. Li
Jacob Andreas
Jacob Steinhardt
founder
Zifan Carl Guo