We introduce a model-based asynchronous multi-fidelity method for hyperparameter and neural architecture search that combines the strengths of asynchronous Hyperband and Gaussian process-based Bayesian optimization. At the heart of our method is a probabilistic model that can simultaneously reason across hyperparameters and resource levels, and supports decision-making in the presence of pending evaluations. We demonstrate the effectiveness of our method on a wide range of challenging benchmarks, for tabular data, image classification and language modelling, and report substantial speed-ups over current state-of-the-art methods. Our new methods, along with asynchronous baselines, are implemented in a distributed framework which will be open sourced along with this publication.
Model-based Asynchronous Hyperparameter and Neural Architecture Search
A model-based asynchronous multi-fidelity method combining Hyperband and Bayesian optimization achieves significant speed-ups in hyperparameter and neural architecture search across various benchmarks.
- Year
- 2020
- Venue
- arXiv 2020
- Authors
- 5
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2003.10865v2ARXIV-DEFAULT
- TL;DR
- Semantic Scholar