TY - JOUR AU - M. G. Gorodnichev TI - On the application of reinforcement learning in the task of choosing the optimal trajectory JO - News of the Kabardin-Balkar scientific center of RAS PY - 2025 SP - 86 EP - 102 VL - 27 IS - 2 PB - mathdoc UR - http://geodesic.mathdoc.fr/item/IZKAB_2025_27_2_a5/ LA - ru ID - IZKAB_2025_27_2_a5 ER -