Markov decision processes on finite spaces with fuzzy total rewards
Kybernetika, Tome 58 (2022) no. 2, pp. 180-199.

Voir la notice de l'article provenant de la source Czech Digital Mathematics Library

The paper concerns Markov decision processes (MDPs) with both the state and the decision spaces being finite and with the total reward as the objective function. For such a kind of MDPs, the authors assume that the reward function is of a fuzzy type. Specifically, this fuzzy reward function is of a suitable trapezoidal shape which is a function of a standard non-fuzzy reward. The fuzzy control problem consists of determining a control policy that maximizes the fuzzy expected total reward, where the maximization is made with respect to the partial order on the $\alpha$-cuts of fuzzy numbers. The optimal policy and the optimal value function for the fuzzy optimal control problem are characterized by means of the dynamic programming equation of the standard optimal control problem and, as main conclusions, it is obtained that the optimal policy of the standard problem and the fuzzy one coincide and the fuzzy optimal value function is of a convenient trapezoidal form. As illustrations, fuzzy extensions of an optimal stopping problem and of a red-black gambling model are presented.
DOI : 10.14736/kyb-2022-2-0180
Classification : 90C40, 93C40
Keywords: Markov decision process; total reward; fuzzy reward; trapezoidal fuzzy number; optimal stopping problem; gambling model
@article{10_14736_kyb_2022_2_0180,
     author = {Carrero-Vera, Karla and Cruz-Su\'arez, Hugo and Montes-de-Oca, Ra\'ul},
     title = {Markov decision processes on finite spaces with fuzzy total rewards},
     journal = {Kybernetika},
     pages = {180--199},
     publisher = {mathdoc},
     volume = {58},
     number = {2},
     year = {2022},
     doi = {10.14736/kyb-2022-2-0180},
     mrnumber = {4467492},
     zbl = {07584152},
     language = {en},
     url = {http://geodesic.mathdoc.fr/articles/10.14736/kyb-2022-2-0180/}
}
TY  - JOUR
AU  - Carrero-Vera, Karla
AU  - Cruz-Suárez, Hugo
AU  - Montes-de-Oca, Raúl
TI  - Markov decision processes on finite spaces with fuzzy total rewards
JO  - Kybernetika
PY  - 2022
SP  - 180
EP  - 199
VL  - 58
IS  - 2
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/articles/10.14736/kyb-2022-2-0180/
DO  - 10.14736/kyb-2022-2-0180
LA  - en
ID  - 10_14736_kyb_2022_2_0180
ER  - 
%0 Journal Article
%A Carrero-Vera, Karla
%A Cruz-Suárez, Hugo
%A Montes-de-Oca, Raúl
%T Markov decision processes on finite spaces with fuzzy total rewards
%J Kybernetika
%D 2022
%P 180-199
%V 58
%N 2
%I mathdoc
%U http://geodesic.mathdoc.fr/articles/10.14736/kyb-2022-2-0180/
%R 10.14736/kyb-2022-2-0180
%G en
%F 10_14736_kyb_2022_2_0180
Carrero-Vera, Karla; Cruz-Suárez, Hugo; Montes-de-Oca, Raúl. Markov decision processes on finite spaces with fuzzy total rewards. Kybernetika, Tome 58 (2022) no. 2, pp. 180-199. doi : 10.14736/kyb-2022-2-0180. http://geodesic.mathdoc.fr/articles/10.14736/kyb-2022-2-0180/

Cité par Sources :