Adaptive control of discrete time Markov processes by the large deviations method
Applicationes Mathematicae, Tome 27 (2000) no. 3, pp. 265-285
Voir la notice de l'article provenant de la source Institute of Mathematics Polish Academy of Sciences
Some discrete time controlled Markov processes in a locally compact metric space whose transition operators depend on an unknown parameter are described. The adaptive controls are constructed using the large deviations of empirical distributions which are uniform in the parameter that takes values in a compact set. The adaptive procedure uses a finite family of continuous, almost optimal controls. Using the large deviations property it is shown that an adaptive control which is a fixed almost optimal control after a finite time is almost optimal with probability nearly 1.
DOI :
10.4064/am-27-3-265-285
Keywords:
discrete time controlled Markov processes, adaptive control, large deviations
Affiliations des auteurs :
T. Duncan 1 ; B. Pasik-Duncan 1 ; Łukasz Stettner 1
@article{10_4064_am_27_3_265_285,
author = {T. Duncan and B. Pasik-Duncan and {\L}ukasz Stettner},
title = {Adaptive control of discrete time {Markov} processes by the large deviations method},
journal = {Applicationes Mathematicae},
pages = {265--285},
publisher = {mathdoc},
volume = {27},
number = {3},
year = {2000},
doi = {10.4064/am-27-3-265-285},
zbl = {1006.93071},
language = {en},
url = {http://geodesic.mathdoc.fr/articles/10.4064/am-27-3-265-285/}
}
TY - JOUR AU - T. Duncan AU - B. Pasik-Duncan AU - Łukasz Stettner TI - Adaptive control of discrete time Markov processes by the large deviations method JO - Applicationes Mathematicae PY - 2000 SP - 265 EP - 285 VL - 27 IS - 3 PB - mathdoc UR - http://geodesic.mathdoc.fr/articles/10.4064/am-27-3-265-285/ DO - 10.4064/am-27-3-265-285 LA - en ID - 10_4064_am_27_3_265_285 ER -
%0 Journal Article %A T. Duncan %A B. Pasik-Duncan %A Łukasz Stettner %T Adaptive control of discrete time Markov processes by the large deviations method %J Applicationes Mathematicae %D 2000 %P 265-285 %V 27 %N 3 %I mathdoc %U http://geodesic.mathdoc.fr/articles/10.4064/am-27-3-265-285/ %R 10.4064/am-27-3-265-285 %G en %F 10_4064_am_27_3_265_285
T. Duncan; B. Pasik-Duncan; Łukasz Stettner. Adaptive control of discrete time Markov processes by the large deviations method. Applicationes Mathematicae, Tome 27 (2000) no. 3, pp. 265-285. doi: 10.4064/am-27-3-265-285
Cité par Sources :