0

Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling

Advanced recurrent units with gating mechanisms, such as LSTM and GRU, outperform traditional recurrent units like tanh units in tasks like polyphonic music and speech signal modeling, with GRU being comparable to LSTM.

Year
2014
Venue
arXiv 2014
Authors
4
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/1412.3555ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

In this paper we compare different types of recurrent units in recurrent neural networks (RNNs). Especially, we focus on more sophisticated units that implement a gating mechanism, such as a long short-term memory (LSTM) unit and a recently proposed gated recurrent unit (GRU). We evaluate these recurrent units on the tasks of polyphonic music modeling and speech signal modeling. Our experiments revealed that these advanced recurrent units are indeed better than more traditional recurrent units such as tanh units. Also, we found GRU to be comparable to LSTM.

Authors

4