The loss landscape of neural networks is a critical aspect of their training, and understanding its properties is essential for improving their performance. In this paper, we investigate how the loss surface changes when the sample size increases, a previously unexplored issue. We theoretically analyze the convergence of the loss landscape in a fully connected neural network and derive upper bounds for the difference in loss function values when adding a new object to the sample. Our empirical study confirms these results on various datasets, demonstrating the convergence of the loss function surface for image classification tasks. Our findings provide insights into the local geometry of neural loss landscapes and have implications for the development of sample size determination techniques.
Unraveling the Hessian: A Key to Smooth Convergence in Loss Function Landscapes
The study explores the convergence of the loss surface in neural networks as sample size increases, offering insights into the local geometry of loss landscapes and sample size determination techniques.
- Year
- 2024
- Venue
- arXiv 2024
- Authors
- 2
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2409.11995ARXIV-DEFAULT
- TL;DR
- Semantic Scholar