%0 Journal Article
%A Rolando Cavazos-Cadena
%A Raúl Montes-de-Oca
%T Estimation and control
 in finite Markov decision processes
 with the average reward criterion
%J Applicationes Mathematicae
%D 2004
%P 127-154
%V 31
%N 2
%U http://geodesic.mathdoc.fr/articles/10.4064/am31-2-1/
%R 10.4064/am31-2-1
%G en
%F 10_4064_am31_2_1