Deterministic optimal policies for Markov control processes with pathwise constraints
Applicationes Mathematicae, Tome 39 (2012) no. 2, pp. 185-209.

Voir la notice de l'article provenant de la source Institute of Mathematics Polish Academy of Sciences

This paper deals with discrete-time Markov control processes in Borel spaces with unbounded rewards. Under suitable hypotheses, we show that a randomized stationary policy is optimal for a certain expected constrained problem (ECP) if and only if it is optimal for the corresponding pathwise constrained problem (pathwise CP). Moreover, we show that a certain parametric family of unconstrained optimality equations yields convergence properties that lead to an approximation scheme which allows us to obtain constrained optimal policies as the limit of unconstrained deterministic optimal policies. In addition, we give sufficient conditions for the existence of deterministic policies that solve these constrained problems.
DOI : 10.4064/am39-2-6
Keywords: paper deals discrete time markov control processes borel spaces unbounded rewards under suitable hypotheses randomized stationary policy optimal certain expected constrained problem ecp only optimal corresponding pathwise constrained problem pathwise moreover certain parametric family unconstrained optimality equations yields convergence properties lead approximation scheme which allows obtain constrained optimal policies limit unconstrained deterministic optimal policies addition sufficient conditions existence deterministic policies solve these constrained problems

Armando F. Mendoza-Pérez 1 ; Onésimo Hernández-Lerma 2

1 CEFyMAP-UNACH Cuarta Oriente Norte 1428 entre 13 y 14 norte C.P. 29040 Tuxtla Gutiérrez, Chiapas, México
2 Mathematics Department CINVESTAV-IPN A. Postal 14-740 México D.F. 07000, México
@article{10_4064_am39_2_6,
     author = {Armando F. Mendoza-P\'erez and On\'esimo Hern\'andez-Lerma},
     title = {Deterministic optimal policies for
 {Markov} control processes
 with pathwise constraints},
     journal = {Applicationes Mathematicae},
     pages = {185--209},
     publisher = {mathdoc},
     volume = {39},
     number = {2},
     year = {2012},
     doi = {10.4064/am39-2-6},
     language = {en},
     url = {http://geodesic.mathdoc.fr/articles/10.4064/am39-2-6/}
}
TY  - JOUR
AU  - Armando F. Mendoza-Pérez
AU  - Onésimo Hernández-Lerma
TI  - Deterministic optimal policies for
 Markov control processes
 with pathwise constraints
JO  - Applicationes Mathematicae
PY  - 2012
SP  - 185
EP  - 209
VL  - 39
IS  - 2
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/articles/10.4064/am39-2-6/
DO  - 10.4064/am39-2-6
LA  - en
ID  - 10_4064_am39_2_6
ER  - 
%0 Journal Article
%A Armando F. Mendoza-Pérez
%A Onésimo Hernández-Lerma
%T Deterministic optimal policies for
 Markov control processes
 with pathwise constraints
%J Applicationes Mathematicae
%D 2012
%P 185-209
%V 39
%N 2
%I mathdoc
%U http://geodesic.mathdoc.fr/articles/10.4064/am39-2-6/
%R 10.4064/am39-2-6
%G en
%F 10_4064_am39_2_6
Armando F. Mendoza-Pérez; Onésimo Hernández-Lerma. Deterministic optimal policies for
 Markov control processes
 with pathwise constraints. Applicationes Mathematicae, Tome 39 (2012) no. 2, pp. 185-209. doi : 10.4064/am39-2-6. http://geodesic.mathdoc.fr/articles/10.4064/am39-2-6/

Cité par Sources :