Cite
Notes
Only stored in your browser.
Attribution
Pretraining Large Language Models with NVFP4
arXiv 2025
A Theoretical Explanation for Perplexing Behaviors of Backpropagation-based Visualizations
a-theoretical-explanation-for-perplexing-1
from 2 papers
Aaron Blakeman
Abhijit Paithankar
Abhinav Goel
Aditya Vavre
Alex Kondratenko
Alexis Bjorlin
Anjulie Agrusa
Ashwin Poojary
Asit Mishra
Ben Lanir