Cite
Notes
Only stored in your browser.
Attribution
Towards Meta-Pruning via Optimal Transport
arXiv 2024
A Language Model's Guide Through Latent Space
Scaling MLPs: A Tale of Inductive Bias
scaling-mlps-a-tale-of-inductive-bias
Transformer Fusion with Optimal Transport
arXiv 2023
Random Teachers are Good Teachers
from 5 papers
Thomas Hofmann
Gregor Bachmann
Sidak Pal Singh
Alexander Theus
Dimitri von Rütte
Felix Sarnthein
Friedrich Wicke
Jacopo Graldi
Marco Giordano
Moritz Imfeld