Markov decision processes with time-varying discount factors and random horizon
Kybernetika, Tome 53 (2017) no. 1, pp. 82-98.

Voir la notice de l'article provenant de la source Czech Digital Mathematics Library

This paper is related to Markov Decision Processes. The optimal control problem is to minimize the expected total discounted cost, with a non-constant discount factor. The discount factor is time-varying and it could depend on the state and the action. Furthermore, it is considered that the horizon of the optimization problem is given by a discrete random variable, that is, a random horizon is assumed. Under general conditions on Markov control model, using the dynamic programming approach, an optimality equation for both cases is obtained, namely, finite support and infinite support of the random horizon. The obtained results are illustrated by two examples, one of them related to optimal replacement.
DOI : 10.14736/kyb-2017-1-0082
Classification : 90C39, 90C40, 93E20
Keywords: Markov decision process; dynamic programming; varying discount factor; random horizon
@article{10_14736_kyb_2017_1_0082,
     author = {Ilhuicatzi-Rold\'an, Rocio and Cruz-Su\'arez, Hugo and Ch\'avez-Rodr{\'\i}guez, Selene},
     title = {Markov decision processes with time-varying discount factors and random horizon},
     journal = {Kybernetika},
     pages = {82--98},
     publisher = {mathdoc},
     volume = {53},
     number = {1},
     year = {2017},
     doi = {10.14736/kyb-2017-1-0082},
     mrnumber = {3638557},
     zbl = {06738595},
     language = {en},
     url = {http://geodesic.mathdoc.fr/articles/10.14736/kyb-2017-1-0082/}
}
TY  - JOUR
AU  - Ilhuicatzi-Roldán, Rocio
AU  - Cruz-Suárez, Hugo
AU  - Chávez-Rodríguez, Selene
TI  - Markov decision processes with time-varying discount factors and random horizon
JO  - Kybernetika
PY  - 2017
SP  - 82
EP  - 98
VL  - 53
IS  - 1
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/articles/10.14736/kyb-2017-1-0082/
DO  - 10.14736/kyb-2017-1-0082
LA  - en
ID  - 10_14736_kyb_2017_1_0082
ER  - 
%0 Journal Article
%A Ilhuicatzi-Roldán, Rocio
%A Cruz-Suárez, Hugo
%A Chávez-Rodríguez, Selene
%T Markov decision processes with time-varying discount factors and random horizon
%J Kybernetika
%D 2017
%P 82-98
%V 53
%N 1
%I mathdoc
%U http://geodesic.mathdoc.fr/articles/10.14736/kyb-2017-1-0082/
%R 10.14736/kyb-2017-1-0082
%G en
%F 10_14736_kyb_2017_1_0082
Ilhuicatzi-Roldán, Rocio; Cruz-Suárez, Hugo; Chávez-Rodríguez, Selene. Markov decision processes with time-varying discount factors and random horizon. Kybernetika, Tome 53 (2017) no. 1, pp. 82-98. doi : 10.14736/kyb-2017-1-0082. http://geodesic.mathdoc.fr/articles/10.14736/kyb-2017-1-0082/

Cité par Sources :