Stationary optimal policies in a class
of multichain positive dynamic programs with
finite state space and risk-sensitive criterion
Applicationes Mathematicae, Tome 28 (2001) no. 1, pp. 93-109
Cet article a éte moissonné depuis la source Institute of Mathematics Polish Academy of Sciences
This work concerns Markov decision processes with finite
state space and compact action sets. The decision maker is
supposed to have a constant-risk sensitivity coefficient, and
a control policy is graded via the risk-sensitive expected
total-reward criterion associated with nonnegative one-step
rewards. Assuming that the optimal value function is finite,
under mild continuity and compactness restrictions the following
result is established: If the number of ergodic classes when a
stationary policy is used to drive the system depends
continuously on the policy employed, then there exists an
optimal stationary policy, extending results obtained by
Schal (1984) for risk-neutral dynamic programming. We use results recently
established for unichain systems, and
analyze the general multichain case via a reduction to a model
with the unichain property.
Keywords:
work concerns markov decision processes finite state space compact action sets decision maker supposed have constant risk sensitivity coefficient control policy graded via risk sensitive expected total reward criterion associated nonnegative one step rewards assuming optimal value function finite under mild continuity compactness restrictions following result established number ergodic classes stationary policy drive system depends continuously policy employed there exists optimal stationary policy extending results obtained schal risk neutral dynamic programming results recently established unichain systems analyze general multichain via reduction model unichain property
Affiliations des auteurs :
Rolando Cavazos-Cadena 1 ; Raul Montes-de-Oca 2
@article{10_4064_am28_1_7,
author = {Rolando Cavazos-Cadena and Raul Montes-de-Oca},
title = {Stationary optimal policies in a class
of multichain positive dynamic programs with
finite state space and risk-sensitive criterion},
journal = {Applicationes Mathematicae},
pages = {93--109},
year = {2001},
volume = {28},
number = {1},
doi = {10.4064/am28-1-7},
language = {en},
url = {http://geodesic.mathdoc.fr/articles/10.4064/am28-1-7/}
}
TY - JOUR AU - Rolando Cavazos-Cadena AU - Raul Montes-de-Oca TI - Stationary optimal policies in a class of multichain positive dynamic programs with finite state space and risk-sensitive criterion JO - Applicationes Mathematicae PY - 2001 SP - 93 EP - 109 VL - 28 IS - 1 UR - http://geodesic.mathdoc.fr/articles/10.4064/am28-1-7/ DO - 10.4064/am28-1-7 LA - en ID - 10_4064_am28_1_7 ER -
%0 Journal Article %A Rolando Cavazos-Cadena %A Raul Montes-de-Oca %T Stationary optimal policies in a class of multichain positive dynamic programs with finite state space and risk-sensitive criterion %J Applicationes Mathematicae %D 2001 %P 93-109 %V 28 %N 1 %U http://geodesic.mathdoc.fr/articles/10.4064/am28-1-7/ %R 10.4064/am28-1-7 %G en %F 10_4064_am28_1_7
Rolando Cavazos-Cadena; Raul Montes-de-Oca. Stationary optimal policies in a class of multichain positive dynamic programs with finite state space and risk-sensitive criterion. Applicationes Mathematicae, Tome 28 (2001) no. 1, pp. 93-109. doi: 10.4064/am28-1-7
Cité par Sources :