%0 Journal Article
%A D. B. Rokhlin
%T $Q$-learning in a stochastic Stackelberg game between an uninformed leader and a naive follower
%J Teoriâ veroâtnostej i ee primeneniâ
%D 2019
%P 53-74
%V 64
%N 1
%U http://geodesic.mathdoc.fr/item/TVP_2019_64_1_a3/
%G ru
%F TVP_2019_64_1_a3