On a class of policies in general Markov decision models
Teoriâ veroâtnostej i ee primeneniâ, Tome 18 (1973) no. 4, pp. 815-817

Voir la notice de l'article provenant de la source Math-Net.Ru

The paper studies stationary policies which, under some final reward, become optimal on each time interval $[0, n]$ and provide a total gain linearly dependent on $n$. Necessary and sufficient conditions for the existence of such policies are given in the form of equations (4), (5). These equations appeared previously in various cases as sufficient optimality conditions for the average-per-unit-time criterion.
@article{TVP_1973_18_4_a11,
     author = {A. A. Yushkevich},
     title = {On a class of policies in general {Markov} decision models},
     journal = {Teori\^a vero\^atnostej i ee primeneni\^a},
     pages = {815--817},
     publisher = {mathdoc},
     volume = {18},
     number = {4},
     year = {1973},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/TVP_1973_18_4_a11/}
}
TY  - JOUR
AU  - A. A. Yushkevich
TI  - On a class of policies in general Markov decision models
JO  - Teoriâ veroâtnostej i ee primeneniâ
PY  - 1973
SP  - 815
EP  - 817
VL  - 18
IS  - 4
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/TVP_1973_18_4_a11/
LA  - ru
ID  - TVP_1973_18_4_a11
ER  - 
%0 Journal Article
%A A. A. Yushkevich
%T On a class of policies in general Markov decision models
%J Teoriâ veroâtnostej i ee primeneniâ
%D 1973
%P 815-817
%V 18
%N 4
%I mathdoc
%U http://geodesic.mathdoc.fr/item/TVP_1973_18_4_a11/
%G ru
%F TVP_1973_18_4_a11
A. A. Yushkevich. On a class of policies in general Markov decision models. Teoriâ veroâtnostej i ee primeneniâ, Tome 18 (1973) no. 4, pp. 815-817. http://geodesic.mathdoc.fr/item/TVP_1973_18_4_a11/