Strong average optimality criterion for continuous-time Markov decision processes
Kybernetika, Tome 50 (2014) no. 6, pp. 950-977
Voir la notice de l'article provenant de la source Czech Digital Mathematics Library
This paper deals with continuous-time Markov decision processes with the unbounded transition rates under the strong average cost criterion. The state and action spaces are Borel spaces, and the costs are allowed to be unbounded from above and from below. Under mild conditions, we first prove that the finite-horizon optimal value function is a solution to the optimality equation for the case of uncountable state spaces and unbounded transition rates, and that there exists an optimal deterministic Markov policy. Then, using the two average optimality inequalities, we show that the set of all strong average optimal policies coincides with the set of all average optimal policies, and thus obtain the existence of strong average optimal policies. Furthermore, employing the technique of the skeleton chains of controlled continuous-time Markov chains and Chapman-Kolmogorov equation, we give a new set of sufficient conditions imposed on the primitive data of the model for the verification of the uniform exponential ergodicity of continuous-time Markov chains governed by stationary policies. Finally, we illustrate our main results with an example.
DOI :
10.14736/kyb-2014-6-0950
Classification :
49K45, 90C40, 93E20
Keywords: continuous-time Markov decision processes; strong average optimality criterion; finite-horizon expected total cost criterion; unbounded transition rates; optimal policy; optimal value function
Keywords: continuous-time Markov decision processes; strong average optimality criterion; finite-horizon expected total cost criterion; unbounded transition rates; optimal policy; optimal value function
@article{10_14736_kyb_2014_6_0950,
author = {Wei, Qingda and Chen, Xian},
title = {Strong average optimality criterion for continuous-time {Markov} decision processes},
journal = {Kybernetika},
pages = {950--977},
publisher = {mathdoc},
volume = {50},
number = {6},
year = {2014},
doi = {10.14736/kyb-2014-6-0950},
mrnumber = {3301781},
zbl = {1307.93467},
language = {en},
url = {http://geodesic.mathdoc.fr/articles/10.14736/kyb-2014-6-0950/}
}
TY - JOUR AU - Wei, Qingda AU - Chen, Xian TI - Strong average optimality criterion for continuous-time Markov decision processes JO - Kybernetika PY - 2014 SP - 950 EP - 977 VL - 50 IS - 6 PB - mathdoc UR - http://geodesic.mathdoc.fr/articles/10.14736/kyb-2014-6-0950/ DO - 10.14736/kyb-2014-6-0950 LA - en ID - 10_14736_kyb_2014_6_0950 ER -
%0 Journal Article %A Wei, Qingda %A Chen, Xian %T Strong average optimality criterion for continuous-time Markov decision processes %J Kybernetika %D 2014 %P 950-977 %V 50 %N 6 %I mathdoc %U http://geodesic.mathdoc.fr/articles/10.14736/kyb-2014-6-0950/ %R 10.14736/kyb-2014-6-0950 %G en %F 10_14736_kyb_2014_6_0950
Wei, Qingda; Chen, Xian. Strong average optimality criterion for continuous-time Markov decision processes. Kybernetika, Tome 50 (2014) no. 6, pp. 950-977. doi: 10.14736/kyb-2014-6-0950
Cité par Sources :