On a class of policies in general Markov decision models

A. A. Yushkevich

A. A. Yushkevich

Teoriâ veroâtnostej i ee primeneniâ, Tome 18 (1973) no. 4, pp. 815-817

Cet article a éte moissonné depuis la source Math-Net.Ru

Voir la notice de l'article

Résumé

The paper studies stationary policies which, under some final reward, become optimal on each time interval $[0, n]$ and provide a total gain linearly dependent on $n$. Necessary and sufficient conditions for the existence of such policies are given in the form of equations (4), (5). These equations appeared previously in various cases as sufficient optimality conditions for the average-per-unit-time criterion.

Export
Comment citer

@article{TVP_1973_18_4_a11,
     author = {A. A. Yushkevich},
     title = {On a class of policies in general {Markov} decision models},
     journal = {Teori\^a vero\^atnostej i ee primeneni\^a},
     pages = {815--817},
     year = {1973},
     volume = {18},
     number = {4},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/TVP_1973_18_4_a11/}
}

TY  - JOUR
AU  - A. A. Yushkevich
TI  - On a class of policies in general Markov decision models
JO  - Teoriâ veroâtnostej i ee primeneniâ
PY  - 1973
SP  - 815
EP  - 817
VL  - 18
IS  - 4
UR  - http://geodesic.mathdoc.fr/item/TVP_1973_18_4_a11/
LA  - ru
ID  - TVP_1973_18_4_a11
ER  -

%0 Journal Article
%A A. A. Yushkevich
%T On a class of policies in general Markov decision models
%J Teoriâ veroâtnostej i ee primeneniâ
%D 1973
%P 815-817
%V 18
%N 4
%U http://geodesic.mathdoc.fr/item/TVP_1973_18_4_a11/
%G ru
%F TVP_1973_18_4_a11

A. A. Yushkevich. On a class of policies in general Markov decision models. Teoriâ veroâtnostej i ee primeneniâ, Tome 18 (1973) no. 4, pp. 815-817. http://geodesic.mathdoc.fr/item/TVP_1973_18_4_a11/

Parcourir par

Geodesic

Parcourir par