The exponential cost optimality for finite horizon semi-Markov decision processes
Kybernetika, Tome 58 (2022) no. 3, pp. 301-319
Voir la notice de l'article provenant de la source Czech Digital Mathematics Library
This paper considers an exponential cost optimality problem for finite horizon semi-Markov decision processes (SMDPs). The objective is to calculate an optimal policy with minimal exponential costs over the full set of policies in a finite horizon. First, under the standard regular and compact-continuity conditions, we establish the optimality equation, prove that the value function is the unique solution of the optimality equation and the existence of an optimal policy by using the minimum nonnegative solution approach. Second, we establish a new value iteration algorithm to calculate both the value function and the $\epsilon$-optimal policy. Finally, we give a computable machine maintenance system to illustrate the convergence of the algorithm.
DOI :
10.14736/kyb-2022-3-0301
Classification :
60E20, 90C40
Keywords: semi-Markov decision processes; exponential cost; finite horizon; optimality equation; optimal policy
Keywords: semi-Markov decision processes; exponential cost; finite horizon; optimality equation; optimal policy
@article{10_14736_kyb_2022_3_0301,
author = {Huo, Haifeng and Wen, Xian},
title = {The exponential cost optimality for finite horizon {semi-Markov} decision processes},
journal = {Kybernetika},
pages = {301--319},
publisher = {mathdoc},
volume = {58},
number = {3},
year = {2022},
doi = {10.14736/kyb-2022-3-0301},
mrnumber = {4494093},
zbl = {07613047},
language = {en},
url = {http://geodesic.mathdoc.fr/articles/10.14736/kyb-2022-3-0301/}
}
TY - JOUR AU - Huo, Haifeng AU - Wen, Xian TI - The exponential cost optimality for finite horizon semi-Markov decision processes JO - Kybernetika PY - 2022 SP - 301 EP - 319 VL - 58 IS - 3 PB - mathdoc UR - http://geodesic.mathdoc.fr/articles/10.14736/kyb-2022-3-0301/ DO - 10.14736/kyb-2022-3-0301 LA - en ID - 10_14736_kyb_2022_3_0301 ER -
%0 Journal Article %A Huo, Haifeng %A Wen, Xian %T The exponential cost optimality for finite horizon semi-Markov decision processes %J Kybernetika %D 2022 %P 301-319 %V 58 %N 3 %I mathdoc %U http://geodesic.mathdoc.fr/articles/10.14736/kyb-2022-3-0301/ %R 10.14736/kyb-2022-3-0301 %G en %F 10_14736_kyb_2022_3_0301
Huo, Haifeng; Wen, Xian. The exponential cost optimality for finite horizon semi-Markov decision processes. Kybernetika, Tome 58 (2022) no. 3, pp. 301-319. doi: 10.14736/kyb-2022-3-0301
Cité par Sources :