An optimality system for finite average Markov decision chains under risk-aversion
Kybernetika, Tome 48 (2012) no. 1, pp. 83-104.

Voir la notice de l'article provenant de la source Czech Digital Mathematics Library

This work concerns controlled Markov chains with finite state space and compact action sets. The decision maker is risk-averse with constant risk-sensitivity, and the performance of a control policy is measured by the long-run average cost criterion. Under standard continuity-compactness conditions, it is shown that the (possibly non-constant) optimal value function is characterized by a system of optimality equations which allows to obtain an optimal stationary policy. Also, it is shown that the optimal superior and inferior limit average cost functions coincide.
Classification : 60J05, 93C55, 93E20
Keywords: partition of the state space; nonconstant optimal average cost; discounted approximations to the risk-sensitive average cost criterion; equality of superior and inferior limit risk-averse average criteria
@article{KYB_2012__48_1_a4,
     author = {Alan{\'\i}s-Dur\'an, Alfredo and Cavazos-Cadena, Rolando},
     title = {An optimality system for finite average {Markov} decision chains under risk-aversion},
     journal = {Kybernetika},
     pages = {83--104},
     publisher = {mathdoc},
     volume = {48},
     number = {1},
     year = {2012},
     mrnumber = {2932929},
     zbl = {1243.93127},
     language = {en},
     url = {http://geodesic.mathdoc.fr/item/KYB_2012__48_1_a4/}
}
TY  - JOUR
AU  - Alanís-Durán, Alfredo
AU  - Cavazos-Cadena, Rolando
TI  - An optimality system for finite average Markov decision chains under risk-aversion
JO  - Kybernetika
PY  - 2012
SP  - 83
EP  - 104
VL  - 48
IS  - 1
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/KYB_2012__48_1_a4/
LA  - en
ID  - KYB_2012__48_1_a4
ER  - 
%0 Journal Article
%A Alanís-Durán, Alfredo
%A Cavazos-Cadena, Rolando
%T An optimality system for finite average Markov decision chains under risk-aversion
%J Kybernetika
%D 2012
%P 83-104
%V 48
%N 1
%I mathdoc
%U http://geodesic.mathdoc.fr/item/KYB_2012__48_1_a4/
%G en
%F KYB_2012__48_1_a4
Alanís-Durán, Alfredo; Cavazos-Cadena, Rolando. An optimality system for finite average Markov decision chains under risk-aversion. Kybernetika, Tome 48 (2012) no. 1, pp. 83-104. http://geodesic.mathdoc.fr/item/KYB_2012__48_1_a4/