Partially observable Markov decision processes with partially observable random discount factors
Kybernetika, Tome 58 (2022) no. 6, pp. 960-983
Cet article a éte moissonné depuis la source Czech Digital Mathematics Library
This paper deals with a class of partially observable discounted Markov decision processes defined on Borel state and action spaces, under unbounded one-stage cost. The discount rate is a stochastic process evolving according to a difference equation, which is also assumed to be partially observable. Introducing a suitable control model and filtering processes, we prove the existence of optimal control policies. In addition, we illustrate our results in a class of GI/GI/1 queueing systems where we obtain explicitly the corresponding optimality equation and the filtering process.
This paper deals with a class of partially observable discounted Markov decision processes defined on Borel state and action spaces, under unbounded one-stage cost. The discount rate is a stochastic process evolving according to a difference equation, which is also assumed to be partially observable. Introducing a suitable control model and filtering processes, we prove the existence of optimal control policies. In addition, we illustrate our results in a class of GI/GI/1 queueing systems where we obtain explicitly the corresponding optimality equation and the filtering process.
DOI :
10.14736/kyb-2022-6-0960
Classification :
90B22, 90C39
Keywords: partially observable systems; discounted criterion; random discount factors; queueing models; optimal policies
Keywords: partially observable systems; discounted criterion; random discount factors; queueing models; optimal policies
@article{10_14736_kyb_2022_6_0960,
author = {Martinez-Garcia, E. Everardo and Minj\'arez-Sosa, J. Adolfo and Vega-Amaya, Oscar},
title = {Partially observable {Markov} decision processes with partially observable random discount factors},
journal = {Kybernetika},
pages = {960--983},
year = {2022},
volume = {58},
number = {6},
doi = {10.14736/kyb-2022-6-0960},
mrnumber = {4548223},
zbl = {07655866},
language = {en},
url = {http://geodesic.mathdoc.fr/articles/10.14736/kyb-2022-6-0960/}
}
TY - JOUR AU - Martinez-Garcia, E. Everardo AU - Minjárez-Sosa, J. Adolfo AU - Vega-Amaya, Oscar TI - Partially observable Markov decision processes with partially observable random discount factors JO - Kybernetika PY - 2022 SP - 960 EP - 983 VL - 58 IS - 6 UR - http://geodesic.mathdoc.fr/articles/10.14736/kyb-2022-6-0960/ DO - 10.14736/kyb-2022-6-0960 LA - en ID - 10_14736_kyb_2022_6_0960 ER -
%0 Journal Article %A Martinez-Garcia, E. Everardo %A Minjárez-Sosa, J. Adolfo %A Vega-Amaya, Oscar %T Partially observable Markov decision processes with partially observable random discount factors %J Kybernetika %D 2022 %P 960-983 %V 58 %N 6 %U http://geodesic.mathdoc.fr/articles/10.14736/kyb-2022-6-0960/ %R 10.14736/kyb-2022-6-0960 %G en %F 10_14736_kyb_2022_6_0960
Martinez-Garcia, E. Everardo; Minjárez-Sosa, J. Adolfo; Vega-Amaya, Oscar. Partially observable Markov decision processes with partially observable random discount factors. Kybernetika, Tome 58 (2022) no. 6, pp. 960-983. doi: 10.14736/kyb-2022-6-0960
Cité par Sources :