TY  - JOUR
AU  - M. G. Gorodnichev
TI  - On the application of reinforcement learning in the task of choosing the optimal trajectory
JO  - News of the Kabardin-Balkar scientific center of RAS
PY  - 2025
SP  - 86
EP  - 102
VL  - 27
IS  - 2
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/IZKAB_2025_27_2_a5/
LA  - ru
ID  - IZKAB_2025_27_2_a5
ER  -