On the Interplay Between Stepsize Tuning and Progressive Sharpening
Recent empirical work has revealed an intriguing property of deep learning models by which the sharpness (largest eigenvalue of the Hessian) increases throughout optimization until it stabilizes around a critical value at which the optimizer operates at the edge of stability,…
- Year
- 2023
- Hosting
- External sourcelicense unknown
Cite
Notes
Only stored in your browser.