%0 Journal Article %A D. B. Rokhlin %T $Q$-learning in a stochastic Stackelberg game between an uninformed leader and a naive follower %J Teoriâ veroâtnostej i ee primeneniâ %D 2019 %P 53-74 %V 64 %N 1 %U http://geodesic.mathdoc.fr/item/TVP_2019_64_1_a3/ %G ru %F TVP_2019_64_1_a3