Another set of verifiable conditions for average Markov decision processes with Borel spaces
Kybernetika, Tome 51 (2015) no. 2, pp. 276-292
Voir la notice de l'article provenant de la source Czech Digital Mathematics Library
In this paper we give a new set of verifiable conditions for the existence of average optimal stationary policies in discrete-time Markov decision processes with Borel spaces and unbounded reward/cost functions. More precisely, we provide another set of conditions, which only consists of a Lyapunov-type condition and the common continuity-compactness conditions. These conditions are imposed on the primitive data of the model of Markov decision processes and thus easy to verify. We also give two examples for which all our conditions are satisfied, but some of conditions in the related literature fail to hold.
DOI :
10.14736/kyb-2015-2-0276
Classification :
90C40, 93E20
Keywords: discrete-time Markov decision processes; average reward criterion; optimal stationary policy; Lyapunov-type condition; unbounded reward/cost function
Keywords: discrete-time Markov decision processes; average reward criterion; optimal stationary policy; Lyapunov-type condition; unbounded reward/cost function
@article{10_14736_kyb_2015_2_0276,
author = {Zou, Xiaolong and Guo, Xianping},
title = {Another set of verifiable conditions for average {Markov} decision processes with {Borel} spaces},
journal = {Kybernetika},
pages = {276--292},
publisher = {mathdoc},
volume = {51},
number = {2},
year = {2015},
doi = {10.14736/kyb-2015-2-0276},
mrnumber = {3350562},
zbl = {06487079},
language = {en},
url = {http://geodesic.mathdoc.fr/articles/10.14736/kyb-2015-2-0276/}
}
TY - JOUR AU - Zou, Xiaolong AU - Guo, Xianping TI - Another set of verifiable conditions for average Markov decision processes with Borel spaces JO - Kybernetika PY - 2015 SP - 276 EP - 292 VL - 51 IS - 2 PB - mathdoc UR - http://geodesic.mathdoc.fr/articles/10.14736/kyb-2015-2-0276/ DO - 10.14736/kyb-2015-2-0276 LA - en ID - 10_14736_kyb_2015_2_0276 ER -
%0 Journal Article %A Zou, Xiaolong %A Guo, Xianping %T Another set of verifiable conditions for average Markov decision processes with Borel spaces %J Kybernetika %D 2015 %P 276-292 %V 51 %N 2 %I mathdoc %U http://geodesic.mathdoc.fr/articles/10.14736/kyb-2015-2-0276/ %R 10.14736/kyb-2015-2-0276 %G en %F 10_14736_kyb_2015_2_0276
Zou, Xiaolong; Guo, Xianping. Another set of verifiable conditions for average Markov decision processes with Borel spaces. Kybernetika, Tome 51 (2015) no. 2, pp. 276-292. doi: 10.14736/kyb-2015-2-0276
Cité par Sources :