Constrained optimality problem of Markov decision processes with Borel spaces and varying discount factors

Wu, Xiao; Tang, Yanqiu

doi:10.14736/kyb-2021-2-0295

Wu, Xiao ; Tang, Yanqiu

Kybernetika, Tome 57 (2021) no. 2, pp. 295-311

Voir la notice de l'article provenant de la source Czech Digital Mathematics Library

Résumé

This paper focuses on the constrained optimality of discrete-time Markov decision processes (DTMDPs) with state-dependent discount factors, Borel state and compact Borel action spaces, and possibly unbounded costs. By means of the properties of so-called occupation measures of policies and the technique of transforming the original constrained optimality problem of DTMDPs into a convex program one, we prove the existence of an optimal randomized stationary policies under reasonable conditions.

MR Zbl

DOI : 10.14736/kyb-2021-2-0295

Classification : 60J27, 90C40
Keywords: constrained optimality problem; discrete-time Markov decision processes; Borel state and action spaces; varying discount factors; unbounded costs

@article{10_14736_kyb_2021_2_0295,
     author = {Wu, Xiao and Tang, Yanqiu},
     title = {Constrained optimality problem of {Markov} decision processes with {Borel} spaces and varying discount factors},
     journal = {Kybernetika},
     pages = {295--311},
     publisher = {mathdoc},
     volume = {57},
     number = {2},
     year = {2021},
     doi = {10.14736/kyb-2021-2-0295},
     mrnumber = {4273577},
     zbl = {07396268},
     language = {en},
     url = {http://geodesic.mathdoc.fr/articles/10.14736/kyb-2021-2-0295/}
}

TY  - JOUR
AU  - Wu, Xiao
AU  - Tang, Yanqiu
TI  - Constrained optimality problem of Markov decision processes with Borel spaces and varying discount factors
JO  - Kybernetika
PY  - 2021
SP  - 295
EP  - 311
VL  - 57
IS  - 2
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/articles/10.14736/kyb-2021-2-0295/
DO  - 10.14736/kyb-2021-2-0295
LA  - en
ID  - 10_14736_kyb_2021_2_0295
ER  -

%0 Journal Article
%A Wu, Xiao
%A Tang, Yanqiu
%T Constrained optimality problem of Markov decision processes with Borel spaces and varying discount factors
%J Kybernetika
%D 2021
%P 295-311
%V 57
%N 2
%I mathdoc
%U http://geodesic.mathdoc.fr/articles/10.14736/kyb-2021-2-0295/
%R 10.14736/kyb-2021-2-0295
%G en
%F 10_14736_kyb_2021_2_0295

Wu, Xiao; Tang, Yanqiu. Constrained optimality problem of Markov decision processes with Borel spaces and varying discount factors. Kybernetika, Tome 57 (2021) no. 2, pp. 295-311. doi: 10.14736/kyb-2021-2-0295

Cité par Sources :

Parcourir par

Geodesic

Parcourir par