The lottery ticket hypothesis conjectures the existence of sparse subnetworks of large randomly initialized deep neural networks that can be successfully trained in isolation. Recent work has experimentally observed that some of these tickets can be practically reused across a variety of tasks, hinting at some form of universality. We formalize this concept and theoretically prove that not only do such universal tickets exist but they also do not require further training. Our proofs introduce a couple of technical innovations related to pruning for strong lottery tickets, including extensions of subset sum results and a strategy to leverage higher amounts of depth. Our explicit sparse constructions of universal function families might be of independent interest, as they highlight representational benefits induced by univariate convolutional architectures.
On the Existence of Universal Lottery Tickets
Theory and formalization prove the existence of universally usable, pre-trained sparse subnetworks in deep neural networks without requiring further training, supported by novel pruning techniques and convolutional architecture benefits.
- Year
- 2021
- Venue
- on-the-existence-of-universal-lottery-tickets
- Authors
- 4
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2111.11146v2ARXIV-DEFAULT
- TL;DR
- Semantic Scholar