Stuart Shieber

Cite

Notes

Only stored in your browser.

Attribution

3papers

Authored papers

Why Can't Transformers Learn Multiplication? Reverse-Engineering Reveals Long-Range Dependency Pitfalls

arXiv 2025

From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step

arXiv 2024

Implicit Chain of Thought Reasoning via Knowledge Distillation

arXiv 2023

No known affiliations.

from 3 papers

Yuntian Deng

professor

Andrew Lee

Chenhao Tan

Fernanda Viégas

Itamar Pres

Kiran Prasad

Martin Wattenberg

Paul Smolensky

Roland Fernandez

Vishrav Chaudhary