TY - JOUR AU - M. Goessel AU - V. G. Sragovich TI - Adaptive control of Markov chains with rewards JO - Doklady Akademii Nauk PY - 1980 SP - 523 EP - 527 VL - 254 IS - 3 PB - mathdoc UR - http://geodesic.mathdoc.fr/item/DAN_1980_254_3_a1/ LA - ru ID - DAN_1980_254_3_a1 ER -