%0 Journal Article %A Rolando Cavazos-Cadena %A Raúl Montes-de-Oca %T Estimation and control in finite Markov decision processes with the average reward criterion %J Applicationes Mathematicae %D 2004 %P 127-154 %V 31 %N 2 %I mathdoc %U http://geodesic.mathdoc.fr/articles/10.4064/am31-2-1/ %R 10.4064/am31-2-1 %G en %F 10_4064_am31_2_1