On the effectiveness of Gated Echo State Networks for data exhibiting long-term dependencies
Computer Science and Information Systems, Tome 19 (2022) no. 1.

Voir la notice de l'article provenant de la source Computer Science and Information Systems website

In the context of recurrent neural networks, gated architectures such as the GRU have contributed to the development of highly accurate machine learning models that can tackle long-term dependencies in the data. However, the training of such networks is performed by the expensive algorithm of gradient descent with backpropagation through time. On the other hand, reservoir computing approaches such as Echo State Networks (ESNs) can produce models that can be trained efficiently thanks to the use of fixed random parameters, but are not ideal for dealing with data presenting long-term dependencies. We explore the problem of employing gated architectures in ESNs from both theoretical and empirical perspectives.We do so by deriving and evaluating a necessary condition for the non-contractivity of the state transition function, which is important to overcome the fading-memorycharacterization of conventional ESNs. We find that using pure reservoir computing methodologies is not sufficient for effective gating mechanisms, while insteadtraining even only the gates is highly effective in terms of predictive accuracy.
Keywords: echo state networks, gated recurrent neural networks, reservoir computing
@article{CSIS_2022_19_1_a18,
     author = {Daniele Di Sarli and Claudio Gallicchio and Alessio Micheli},
     title = {On the effectiveness of {Gated} {Echo} {State} {Networks} for data exhibiting long-term dependencies},
     journal = {Computer Science and Information Systems},
     publisher = {mathdoc},
     volume = {19},
     number = {1},
     year = {2022},
     url = {http://geodesic.mathdoc.fr/item/CSIS_2022_19_1_a18/}
}
TY  - JOUR
AU  - Daniele Di Sarli
AU  - Claudio Gallicchio
AU  - Alessio Micheli
TI  - On the effectiveness of Gated Echo State Networks for data exhibiting long-term dependencies
JO  - Computer Science and Information Systems
PY  - 2022
VL  - 19
IS  - 1
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/CSIS_2022_19_1_a18/
ID  - CSIS_2022_19_1_a18
ER  - 
%0 Journal Article
%A Daniele Di Sarli
%A Claudio Gallicchio
%A Alessio Micheli
%T On the effectiveness of Gated Echo State Networks for data exhibiting long-term dependencies
%J Computer Science and Information Systems
%D 2022
%V 19
%N 1
%I mathdoc
%U http://geodesic.mathdoc.fr/item/CSIS_2022_19_1_a18/
%F CSIS_2022_19_1_a18
Daniele Di Sarli; Claudio Gallicchio; Alessio Micheli. On the effectiveness of Gated Echo State Networks for data exhibiting long-term dependencies. Computer Science and Information Systems, Tome 19 (2022) no. 1. http://geodesic.mathdoc.fr/item/CSIS_2022_19_1_a18/