Gregor Bachmann

Cite

Notes

Only stored in your browser.

Attribution

4papers

Authored papers

A Language Model's Guide Through Latent Space

arXiv 2024

The pitfalls of next-token prediction

arXiv 2024

Scaling MLPs: A Tale of Inductive Bias

scaling-mlps-a-tale-of-inductive-bias

Random Teachers are Good Teachers

arXiv 2023

No known affiliations.

from 4 papers

Sotiris Anagnostidis

Thomas Hofmann

Dimitri von Rütte

Felix Sarnthein

Vaishnavh Nagarajan