Cite
Notes
Only stored in your browser.
Attribution
Why Can't Transformers Learn Multiplication? Reverse-Engineering Reveals Long-Range Dependency Pitfalls
arXiv 2025
From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step
arXiv 2024
Implicit Chain of Thought Reasoning via Knowledge Distillation
arXiv 2023
from 3 papers
Yuntian Deng
professor
Andrew Lee
Chenhao Tan
Fernanda Viégas
Itamar Pres
Kiran Prasad
Martin Wattenberg
Paul Smolensky
Roland Fernandez
Vishrav Chaudhary