0

Padé Activation Units: End-to-end Learning of Flexible Activation Functions in Deep Networks

Flexible parametric rational functions called Padé Activation Units (PAUs) are used in deep networks to improve performance and learn new activation functions.

Year
2019
Venue
ICLR 2020 1
Authors
3
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/1907.06732v3ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

The performance of deep network learning strongly depends on the choice of the non-linear activation function associated with each neuron. However, deciding on the best activation is non-trivial, and the choice depends on the architecture, hyper-parameters, and even on the dataset. Typically these activations are fixed by hand before training. Here, we demonstrate how to eliminate the reliance on first picking fixed activation functions by using flexible parametric rational functions instead. The resulting Pad'e Activation Units (PAUs) can both approximate common activation functions and also learn new ones while providing compact representations. Our empirical evidence shows that end-to-end learning deep networks with PAUs can increase the predictive performance. Moreover, PAUs pave the way to approximations with provable robustness. https://github.com/ml-research/pau

Authors

3