Deterministic optimal policies for
Markov control processes
with pathwise constraints
Applicationes Mathematicae, Tome 39 (2012) no. 2, pp. 185-209
Cet article a éte moissonné depuis la source Institute of Mathematics Polish Academy of Sciences
This paper deals with discrete-time Markov control processes in Borel spaces with unbounded rewards. Under suitable hypotheses, we show that a randomized stationary policy is optimal for a certain expected constrained problem (ECP) if and only if it is optimal for the corresponding pathwise constrained problem (pathwise CP). Moreover, we show that a certain parametric family of unconstrained optimality equations yields convergence properties that lead to an approximation scheme which allows us to obtain constrained optimal policies as the limit of unconstrained deterministic optimal policies. In addition, we give sufficient conditions for the existence of deterministic policies that solve these constrained problems.
Keywords:
paper deals discrete time markov control processes borel spaces unbounded rewards under suitable hypotheses randomized stationary policy optimal certain expected constrained problem ecp only optimal corresponding pathwise constrained problem pathwise moreover certain parametric family unconstrained optimality equations yields convergence properties lead approximation scheme which allows obtain constrained optimal policies limit unconstrained deterministic optimal policies addition sufficient conditions existence deterministic policies solve these constrained problems
Affiliations des auteurs :
Armando F. Mendoza-Pérez 1 ; Onésimo Hernández-Lerma 2
@article{10_4064_am39_2_6,
author = {Armando F. Mendoza-P\'erez and On\'esimo Hern\'andez-Lerma},
title = {Deterministic optimal policies for
{Markov} control processes
with pathwise constraints},
journal = {Applicationes Mathematicae},
pages = {185--209},
year = {2012},
volume = {39},
number = {2},
doi = {10.4064/am39-2-6},
language = {en},
url = {http://geodesic.mathdoc.fr/articles/10.4064/am39-2-6/}
}
TY - JOUR AU - Armando F. Mendoza-Pérez AU - Onésimo Hernández-Lerma TI - Deterministic optimal policies for Markov control processes with pathwise constraints JO - Applicationes Mathematicae PY - 2012 SP - 185 EP - 209 VL - 39 IS - 2 UR - http://geodesic.mathdoc.fr/articles/10.4064/am39-2-6/ DO - 10.4064/am39-2-6 LA - en ID - 10_4064_am39_2_6 ER -
%0 Journal Article %A Armando F. Mendoza-Pérez %A Onésimo Hernández-Lerma %T Deterministic optimal policies for Markov control processes with pathwise constraints %J Applicationes Mathematicae %D 2012 %P 185-209 %V 39 %N 2 %U http://geodesic.mathdoc.fr/articles/10.4064/am39-2-6/ %R 10.4064/am39-2-6 %G en %F 10_4064_am39_2_6
Armando F. Mendoza-Pérez; Onésimo Hernández-Lerma. Deterministic optimal policies for Markov control processes with pathwise constraints. Applicationes Mathematicae, Tome 39 (2012) no. 2, pp. 185-209. doi: 10.4064/am39-2-6
Cité par Sources :