Cite
Notes
Only stored in your browser.
Attribution
On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking
arXiv 2026
In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention
arXiv 2025
from 2 papers
Siyu Chen
Zhuoran Yang
Leda Wang
Xintian Pan