Adaptive control for discrete-time Markov processes with unbounded costs: Discounted criterion
Kybernetika, Tome 34 (1998) no. 2, p. [217]
Voir la notice de l'article provenant de la source Czech Digital Mathematics Library
We study the adaptive control problem for discrete-time Markov control processes with Borel state and action spaces and possibly unbounded one-stage costs. The processes are given by recurrent equations $x_{t+1}=F(x_t,a_t,\xi _t),\,\,t=0,1,\ldots $ with i.i.d. $\Re ^k$-valued random vectors $\xi _t$ whose density $\rho $ is unknown. Assuming observability of $\xi _t$ we propose the procedure of statistical estimation of $\rho $ that allows us to prove discounted asymptotic optimality of two types of adaptive policies used early for the processes with bounded costs.
Classification :
60J05, 62M05, 93C40, 93E35
Keywords: Markov control process; unbounded costs; discounted asymptotic optimality; density estimator; rate of convergence
Keywords: Markov control process; unbounded costs; discounted asymptotic optimality; density estimator; rate of convergence
@article{KYB_1998__34_2_a8,
author = {Gordienko, Evgueni I. and Minj\'arez-Sosa, J. Adolfo},
title = {Adaptive control for discrete-time {Markov} processes with unbounded costs: {Discounted} criterion},
journal = {Kybernetika},
pages = {[217]},
publisher = {mathdoc},
volume = {34},
number = {2},
year = {1998},
mrnumber = {1621512},
zbl = {1274.90474},
language = {en},
url = {http://geodesic.mathdoc.fr/item/KYB_1998__34_2_a8/}
}
TY - JOUR AU - Gordienko, Evgueni I. AU - Minjárez-Sosa, J. Adolfo TI - Adaptive control for discrete-time Markov processes with unbounded costs: Discounted criterion JO - Kybernetika PY - 1998 SP - [217] VL - 34 IS - 2 PB - mathdoc UR - http://geodesic.mathdoc.fr/item/KYB_1998__34_2_a8/ LA - en ID - KYB_1998__34_2_a8 ER -
%0 Journal Article %A Gordienko, Evgueni I. %A Minjárez-Sosa, J. Adolfo %T Adaptive control for discrete-time Markov processes with unbounded costs: Discounted criterion %J Kybernetika %D 1998 %P [217] %V 34 %N 2 %I mathdoc %U http://geodesic.mathdoc.fr/item/KYB_1998__34_2_a8/ %G en %F KYB_1998__34_2_a8
Gordienko, Evgueni I.; Minjárez-Sosa, J. Adolfo. Adaptive control for discrete-time Markov processes with unbounded costs: Discounted criterion. Kybernetika, Tome 34 (1998) no. 2, p. [217]. http://geodesic.mathdoc.fr/item/KYB_1998__34_2_a8/