Partially observable Markov decision processes with partially observable random discount factors
Kybernetika, Tome 58 (2022) no. 6, pp. 960-983.

Voir la notice de l'article provenant de la source Czech Digital Mathematics Library

This paper deals with a class of partially observable discounted Markov decision processes defined on Borel state and action spaces, under unbounded one-stage cost. The discount rate is a stochastic process evolving according to a difference equation, which is also assumed to be partially observable. Introducing a suitable control model and filtering processes, we prove the existence of optimal control policies. In addition, we illustrate our results in a class of GI/GI/1 queueing systems where we obtain explicitly the corresponding optimality equation and the filtering process.
DOI : 10.14736/kyb-2022-6-0960
Classification : 90B22, 90C39
Keywords: partially observable systems; discounted criterion; random discount factors; queueing models; optimal policies
@article{10_14736_kyb_2022_6_0960,
     author = {Martinez-Garcia, E. Everardo and Minj\'arez-Sosa, J. Adolfo and Vega-Amaya, Oscar},
     title = {Partially observable {Markov} decision processes with partially observable random discount factors},
     journal = {Kybernetika},
     pages = {960--983},
     publisher = {mathdoc},
     volume = {58},
     number = {6},
     year = {2022},
     doi = {10.14736/kyb-2022-6-0960},
     mrnumber = {4548223},
     zbl = {07655866},
     language = {en},
     url = {http://geodesic.mathdoc.fr/articles/10.14736/kyb-2022-6-0960/}
}
TY  - JOUR
AU  - Martinez-Garcia, E. Everardo
AU  - Minjárez-Sosa, J. Adolfo
AU  - Vega-Amaya, Oscar
TI  - Partially observable Markov decision processes with partially observable random discount factors
JO  - Kybernetika
PY  - 2022
SP  - 960
EP  - 983
VL  - 58
IS  - 6
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/articles/10.14736/kyb-2022-6-0960/
DO  - 10.14736/kyb-2022-6-0960
LA  - en
ID  - 10_14736_kyb_2022_6_0960
ER  - 
%0 Journal Article
%A Martinez-Garcia, E. Everardo
%A Minjárez-Sosa, J. Adolfo
%A Vega-Amaya, Oscar
%T Partially observable Markov decision processes with partially observable random discount factors
%J Kybernetika
%D 2022
%P 960-983
%V 58
%N 6
%I mathdoc
%U http://geodesic.mathdoc.fr/articles/10.14736/kyb-2022-6-0960/
%R 10.14736/kyb-2022-6-0960
%G en
%F 10_14736_kyb_2022_6_0960
Martinez-Garcia, E. Everardo; Minjárez-Sosa, J. Adolfo; Vega-Amaya, Oscar. Partially observable Markov decision processes with partially observable random discount factors. Kybernetika, Tome 58 (2022) no. 6, pp. 960-983. doi : 10.14736/kyb-2022-6-0960. http://geodesic.mathdoc.fr/articles/10.14736/kyb-2022-6-0960/

Cité par Sources :