Recursive self-tuning control of finite Markov chains
Applicationes Mathematicae, Tome 24 (1997) no. 2, pp. 169-188
Cet article a éte moissonné depuis la source Institute of Mathematics Polish Academy of Sciences
A recursive self-tuning control scheme for finite Markov chains is proposed wherein the unknown parameter is estimated by a stochastic approximation scheme for maximizing the log-likelihood function and the control is obtained via a relative value iteration algorithm. The analysis uses the asymptotic o.d.e.s associated with these.
DOI :
10.4064/am-24-2-169-188
Keywords:
controlled Markov chains, stochastic approximation, relative value iteration, self-tuning control, adaptive control
Affiliations des auteurs :
Vivek Borkar 1
@article{10_4064_am_24_2_169_188,
author = {Vivek Borkar},
title = {Recursive self-tuning control of finite {Markov} chains},
journal = {Applicationes Mathematicae},
pages = {169--188},
year = {1997},
volume = {24},
number = {2},
doi = {10.4064/am-24-2-169-188},
zbl = {0951.93537},
language = {en},
url = {http://geodesic.mathdoc.fr/articles/10.4064/am-24-2-169-188/}
}
Vivek Borkar. Recursive self-tuning control of finite Markov chains. Applicationes Mathematicae, Tome 24 (1997) no. 2, pp. 169-188. doi: 10.4064/am-24-2-169-188
Cité par Sources :