Existence of average optimal policies in Markov control processes with strictly unbounded costs

Hernández-Lerma, Onésimo

Hernández-Lerma, Onésimo

Kybernetika, Tome 29 (1993) no. 1, pp. 1-17 Cet article a éte moissonné depuis la source Czech Digital Mathematics Library

Voir la notice de l'article

MR Zbl

Classification : 49L20, 90C40, 93E20

@article{KYB_1993_29_1_a0,
     author = {Hern\'andez-Lerma, On\'esimo},
     title = {Existence of average optimal policies in {Markov} control processes with strictly unbounded costs},
     journal = {Kybernetika},
     pages = {1--17},
     year = {1993},
     volume = {29},
     number = {1},
     mrnumber = {1227738},
     zbl = {0792.93120},
     language = {en},
     url = {http://geodesic.mathdoc.fr/item/KYB_1993_29_1_a0/}
}

TY  - JOUR
AU  - Hernández-Lerma, Onésimo
TI  - Existence of average optimal policies in Markov control processes with strictly unbounded costs
JO  - Kybernetika
PY  - 1993
SP  - 1
EP  - 17
VL  - 29
IS  - 1
UR  - http://geodesic.mathdoc.fr/item/KYB_1993_29_1_a0/
LA  - en
ID  - KYB_1993_29_1_a0
ER  -

%0 Journal Article
%A Hernández-Lerma, Onésimo
%T Existence of average optimal policies in Markov control processes with strictly unbounded costs
%J Kybernetika
%D 1993
%P 1-17
%V 29
%N 1
%U http://geodesic.mathdoc.fr/item/KYB_1993_29_1_a0/
%G en
%F KYB_1993_29_1_a0

Hernández-Lerma, Onésimo. Existence of average optimal policies in Markov control processes with strictly unbounded costs. Kybernetika, Tome 29 (1993) no. 1, pp. 1-17. http://geodesic.mathdoc.fr/item/KYB_1993_29_1_a0/

Bibliographie
Cité par

[1] D. P. Bertsekas: Dynamic Programming: Deterministic and Stochastic Models. Prentice-Hall, Englewood Cliffs, N. J. 1987. | MR | Zbl

[2] D. P. Bertsekas, S. E. Shreve: Stochastic Optimal Control: The Discrete Time Case. Academic Press, New York 1978. | MR | Zbl

[3] P. Billingsley: Convergence of Probability Measures. Wiley, New York 1968. | MR | Zbl

[4] D. Blackwell: Memoryless strategies in finite-stage dynamic programming. Ann. Math. Statist. 35 (1964), 863-865. | MR | Zbl

[5] D. Blackwell: Discounted dynamic programming. Ann. Math. Statist. 36 (1965), 226-235. | MR | Zbl

[6] V. S. Borkar: Control of Markov chains with long-run average cost criterion: the dynamic programming equations. SIAM J. Control Optim. 27 (1989), 642-657. | MR | Zbl

[7] R. Cavazos-Cadena: Solution to the optimality equation in a class of average Markov decision chains with unbounded costs. Kybernetika 27 (1991), 23-37. | MR

[8] J. Diebolt, D. Guegan: Probabilistic properties of the general nonlinear markovian process of order one and applications to time series modelling. Rapport Technique No. 125, Laboratoire de Statistique Theorique et Appliquee, CNR-URA 1321, Universite Paris VI, 1990.

[9] J. L. Doob: Stochastic Processes. Wiley, New York 1953. | MR | Zbl

[10] M. Duflo: Methodes Recursives Aleatoires. Masson, Paris 1990. | MR | Zbl

[11] E. B. Dynkin, A. A. Yushkevich: Controlled Markov Processes. Springer - Verlag, Berlin 1979. | MR

[12] R. Hartley: Dynamic programming and an undiscounted, infinite horizon, convex stochastic control problem. In: Recent Developments in Markov Decision Processes (R. Hartley, L. C. Thomas and D.J. White, eds.). Academic Press, London 1980, pp. 277-300.

[13] O. Hernandez-Lerma: Lyapunov criteria for stability of differential equations with Markov parameters. Boletin Soc. Mat. Mexicana 24 (1979), 27-48. | MR | Zbl

[14] O. Hernandez-Lerma: Adaptive Markov Control Processes. Springer - Verlag, New York 1989. | MR | Zbl

[15] O. Hernandez-Lerma: Average optimality in dynamic programming on Borel spaces - unbounded costs and controls. Syst. Control Lett. 17 (1991), 237-242. | MR | Zbl

[16] O. Hernandez-Lerma, J. B. Lasserre: Average cost optimal policies for Markov control processes with Borel state space and unbounded costs. Syst. Control Lett. 15 (1990), 349-356. | MR | Zbl

[17] O. Hernandez-Lerma, J. B. Lasserre: Linear programming and average optimality of Markov control processes on Borel spaces - unbounded costs. Rapport LAAS, LAAS-CNRS, Toulouse 1992. To appear in SIAM J. Control Optim. | MR

[18] O. Hernandez-Lerma R. Montes de Oca, R. Cavazos-Cadena: Recurrence conditions for Markov decision processes with Borel state space: a survey. Ann. Oper. Res. 28 (1991), 29-46. | MR

[19] K. Hinderer: Foundations of Non-Stationary Dynamic Programming with Discrete Time Parameter. Springer-Verlag, Berlin 1970. | MR | Zbl

[20] M. Yu. Kitayev: Semi-Markov and jump Markov control models: average cost criterion. Theory Probab. Appl. 30 (1985), 272-288. | MR

[21] M. Kurano: The existence of a minimum pair of state and policy for Markov decision processes under the hypothesis of Doeblin. SIAM J. Control Optim. 27 (1989), 296-307. | MR | Zbl

[22] H. J. Kushner: Introduction to Stochastic Control. Holt, Rinehart and Winston, New York 1971. | MR | Zbl

[23] A. Leizarowitz: Optimal controls for diffusions in $R^n$. J. Math. Anal. Appl. 149 (1990), 180-209, | MR

[24] S. P. Meyn: Ergodic theorems for discrete time stochastic systems using a stochastic Lyapunov function. SIAM J. Control Optim. 27 (1989), 1409-1439. | MR | Zbl

[25] A. Mokkadem: Sur un modele autoregressif nonlineaire. Ergodicite et ergodicite geometrique. J. Time Series Anal. 8 (1987), 195-205. | MR

[26] D. Revuz: Markov Chains. Second edition. North-Holland, Amsterdam 1984. | MR | Zbl

[27] U. Rieder: Measurable selection theorems for optimization problems. Manuscripta Math. 24 (1978), 507-518. | MR | Zbl

[28] V. I. Rotar, T. A. Konyuhova: Two papers on asymptotic optimality in probability and almost surely. Preprint, Central Economic Mathematical Institute (CEMI), Moscow 1991.

[29] R. H. Stockbridge: Time-average control of martingale problems: a linear programming formulation. Ann. Probab. 18 (1990), 206-217. | MR | Zbl

[30] J. Wijngaard: Existence of average optimal strategies in markovian decision problems with strictly unbounded costs. In: Dynamic Programming and Its Applications (M. L. Puterman, ed.), Academic Press, New York 1978, pp. 369-386. | MR | Zbl

[31] K. Yosida: Functional Analysis. Fifth edition. Springer-Verlag, Berlin 1978. | MR | Zbl

Parcourir par

Geodesic

Parcourir par