An optimality system for finite average Markov decision chains under risk-aversion

Alanís-Durán, Alfredo; Cavazos-Cadena, Rolando

Alanís-Durán, Alfredo ; Cavazos-Cadena, Rolando

Kybernetika, Tome 48 (2012) no. 1, pp. 83-104

Voir la notice de l'article provenant de la source Czech Digital Mathematics Library

Résumé

This work concerns controlled Markov chains with finite state space and compact action sets. The decision maker is risk-averse with constant risk-sensitivity, and the performance of a control policy is measured by the long-run average cost criterion. Under standard continuity-compactness conditions, it is shown that the (possibly non-constant) optimal value function is characterized by a system of optimality equations which allows to obtain an optimal stationary policy. Also, it is shown that the optimal superior and inferior limit average cost functions coincide.

MR Zbl

Classification : 60J05, 93C55, 93E20
Keywords: partition of the state space; nonconstant optimal average cost; discounted approximations to the risk-sensitive average cost criterion; equality of superior and inferior limit risk-averse average criteria

@article{KYB_2012__48_1_a4,
     author = {Alan{\'\i}s-Dur\'an, Alfredo and Cavazos-Cadena, Rolando},
     title = {An optimality system for finite average {Markov} decision chains under risk-aversion},
     journal = {Kybernetika},
     pages = {83--104},
     publisher = {mathdoc},
     volume = {48},
     number = {1},
     year = {2012},
     mrnumber = {2932929},
     zbl = {1243.93127},
     language = {en},
     url = {http://geodesic.mathdoc.fr/item/KYB_2012__48_1_a4/}
}

TY  - JOUR
AU  - Alanís-Durán, Alfredo
AU  - Cavazos-Cadena, Rolando
TI  - An optimality system for finite average Markov decision chains under risk-aversion
JO  - Kybernetika
PY  - 2012
SP  - 83
EP  - 104
VL  - 48
IS  - 1
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/KYB_2012__48_1_a4/
LA  - en
ID  - KYB_2012__48_1_a4
ER  -

%0 Journal Article
%A Alanís-Durán, Alfredo
%A Cavazos-Cadena, Rolando
%T An optimality system for finite average Markov decision chains under risk-aversion
%J Kybernetika
%D 2012
%P 83-104
%V 48
%N 1
%I mathdoc
%U http://geodesic.mathdoc.fr/item/KYB_2012__48_1_a4/
%G en
%F KYB_2012__48_1_a4

Alanís-Durán, Alfredo; Cavazos-Cadena, Rolando. An optimality system for finite average Markov decision chains under risk-aversion. Kybernetika, Tome 48 (2012) no. 1, pp. 83-104. http://geodesic.mathdoc.fr/item/KYB_2012__48_1_a4/

Parcourir par

Geodesic

Parcourir par