Constrained optimality problem of Markov decision processes with Borel spaces and varying discount factors
Kybernetika, Tome 57 (2021) no. 2, pp. 295-311
Voir la notice de l'article provenant de la source Czech Digital Mathematics Library
This paper focuses on the constrained optimality of discrete-time Markov decision processes (DTMDPs) with state-dependent discount factors, Borel state and compact Borel action spaces, and possibly unbounded costs. By means of the properties of so-called occupation measures of policies and the technique of transforming the original constrained optimality problem of DTMDPs into a convex program one, we prove the existence of an optimal randomized stationary policies under reasonable conditions.
DOI :
10.14736/kyb-2021-2-0295
Classification :
60J27, 90C40
Keywords: constrained optimality problem; discrete-time Markov decision processes; Borel state and action spaces; varying discount factors; unbounded costs
Keywords: constrained optimality problem; discrete-time Markov decision processes; Borel state and action spaces; varying discount factors; unbounded costs
@article{10_14736_kyb_2021_2_0295,
author = {Wu, Xiao and Tang, Yanqiu},
title = {Constrained optimality problem of {Markov} decision processes with {Borel} spaces and varying discount factors},
journal = {Kybernetika},
pages = {295--311},
publisher = {mathdoc},
volume = {57},
number = {2},
year = {2021},
doi = {10.14736/kyb-2021-2-0295},
mrnumber = {4273577},
zbl = {07396268},
language = {en},
url = {http://geodesic.mathdoc.fr/articles/10.14736/kyb-2021-2-0295/}
}
TY - JOUR AU - Wu, Xiao AU - Tang, Yanqiu TI - Constrained optimality problem of Markov decision processes with Borel spaces and varying discount factors JO - Kybernetika PY - 2021 SP - 295 EP - 311 VL - 57 IS - 2 PB - mathdoc UR - http://geodesic.mathdoc.fr/articles/10.14736/kyb-2021-2-0295/ DO - 10.14736/kyb-2021-2-0295 LA - en ID - 10_14736_kyb_2021_2_0295 ER -
%0 Journal Article %A Wu, Xiao %A Tang, Yanqiu %T Constrained optimality problem of Markov decision processes with Borel spaces and varying discount factors %J Kybernetika %D 2021 %P 295-311 %V 57 %N 2 %I mathdoc %U http://geodesic.mathdoc.fr/articles/10.14736/kyb-2021-2-0295/ %R 10.14736/kyb-2021-2-0295 %G en %F 10_14736_kyb_2021_2_0295
Wu, Xiao; Tang, Yanqiu. Constrained optimality problem of Markov decision processes with Borel spaces and varying discount factors. Kybernetika, Tome 57 (2021) no. 2, pp. 295-311. doi: 10.14736/kyb-2021-2-0295
Cité par Sources :