Adaptive control of diffusion processes with a discounted reward criterion
Applicationes Mathematicae, Tome 47 (2020) no. 2, pp. 225-253
Cet article a éte moissonné depuis la source Institute of Mathematics Polish Academy of Sciences
The optimal control problem we are dealing with in this paper is to determine control policies that maximize a discounted reward criterion when the dynamic system evolves as a stochastic differential equation (SDE). Both the instantaneous reward function and the SDE’s drift coefficient may depend on an unknown parameter. We give conditions ensuring the existence of an asymptotically optimal policy using the so-called Principle of Estimation and Control. We illustrate our results with several examples.
Keywords:
optimal control problem dealing paper determine control policies maximize discounted reward criterion dynamic system evolves stochastic differential equation sde instantaneous reward function sde drift coefficient may depend unknown parameter conditions ensuring existence asymptotically optimal policy using so called principle estimation control illustrate results several examples
Affiliations des auteurs :
B. A. Escobedo-Trujillo 1 ; O. Hernández-Lerma 2 ; F. A. Alaffita-Hernández 3
@article{10_4064_am2421_10_2020,
author = {B. A. Escobedo-Trujillo and O. Hern\'andez-Lerma and F. A. Alaffita-Hern\'andez},
title = {Adaptive control of diffusion processes with a discounted reward criterion},
journal = {Applicationes Mathematicae},
pages = {225--253},
year = {2020},
volume = {47},
number = {2},
doi = {10.4064/am2421-10-2020},
language = {en},
url = {http://geodesic.mathdoc.fr/articles/10.4064/am2421-10-2020/}
}
TY - JOUR AU - B. A. Escobedo-Trujillo AU - O. Hernández-Lerma AU - F. A. Alaffita-Hernández TI - Adaptive control of diffusion processes with a discounted reward criterion JO - Applicationes Mathematicae PY - 2020 SP - 225 EP - 253 VL - 47 IS - 2 UR - http://geodesic.mathdoc.fr/articles/10.4064/am2421-10-2020/ DO - 10.4064/am2421-10-2020 LA - en ID - 10_4064_am2421_10_2020 ER -
%0 Journal Article %A B. A. Escobedo-Trujillo %A O. Hernández-Lerma %A F. A. Alaffita-Hernández %T Adaptive control of diffusion processes with a discounted reward criterion %J Applicationes Mathematicae %D 2020 %P 225-253 %V 47 %N 2 %U http://geodesic.mathdoc.fr/articles/10.4064/am2421-10-2020/ %R 10.4064/am2421-10-2020 %G en %F 10_4064_am2421_10_2020
B. A. Escobedo-Trujillo; O. Hernández-Lerma; F. A. Alaffita-Hernández. Adaptive control of diffusion processes with a discounted reward criterion. Applicationes Mathematicae, Tome 47 (2020) no. 2, pp. 225-253. doi: 10.4064/am2421-10-2020
Cité par Sources :