The exponential cost optimality for finite horizon semi-Markov decision processes
Kybernetika, Tome 58 (2022) no. 3, pp. 301-319.

Voir la notice de l'article provenant de la source Czech Digital Mathematics Library

This paper considers an exponential cost optimality problem for finite horizon semi-Markov decision processes (SMDPs). The objective is to calculate an optimal policy with minimal exponential costs over the full set of policies in a finite horizon. First, under the standard regular and compact-continuity conditions, we establish the optimality equation, prove that the value function is the unique solution of the optimality equation and the existence of an optimal policy by using the minimum nonnegative solution approach. Second, we establish a new value iteration algorithm to calculate both the value function and the $\epsilon$-optimal policy. Finally, we give a computable machine maintenance system to illustrate the convergence of the algorithm.
DOI : 10.14736/kyb-2022-3-0301
Classification : 60E20, 90C40
Keywords: semi-Markov decision processes; exponential cost; finite horizon; optimality equation; optimal policy
@article{10_14736_kyb_2022_3_0301,
     author = {Huo, Haifeng and Wen, Xian},
     title = {The exponential cost optimality for finite horizon {semi-Markov} decision processes},
     journal = {Kybernetika},
     pages = {301--319},
     publisher = {mathdoc},
     volume = {58},
     number = {3},
     year = {2022},
     doi = {10.14736/kyb-2022-3-0301},
     mrnumber = {4494093},
     zbl = {07613047},
     language = {en},
     url = {http://geodesic.mathdoc.fr/articles/10.14736/kyb-2022-3-0301/}
}
TY  - JOUR
AU  - Huo, Haifeng
AU  - Wen, Xian
TI  - The exponential cost optimality for finite horizon semi-Markov decision processes
JO  - Kybernetika
PY  - 2022
SP  - 301
EP  - 319
VL  - 58
IS  - 3
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/articles/10.14736/kyb-2022-3-0301/
DO  - 10.14736/kyb-2022-3-0301
LA  - en
ID  - 10_14736_kyb_2022_3_0301
ER  - 
%0 Journal Article
%A Huo, Haifeng
%A Wen, Xian
%T The exponential cost optimality for finite horizon semi-Markov decision processes
%J Kybernetika
%D 2022
%P 301-319
%V 58
%N 3
%I mathdoc
%U http://geodesic.mathdoc.fr/articles/10.14736/kyb-2022-3-0301/
%R 10.14736/kyb-2022-3-0301
%G en
%F 10_14736_kyb_2022_3_0301
Huo, Haifeng; Wen, Xian. The exponential cost optimality for finite horizon semi-Markov decision processes. Kybernetika, Tome 58 (2022) no. 3, pp. 301-319. doi : 10.14736/kyb-2022-3-0301. http://geodesic.mathdoc.fr/articles/10.14736/kyb-2022-3-0301/

Cité par Sources :