Decision-making model under presence of experts as a modified multi-armed bandit problem
Matematičeskaâ teoriâ igr i eë priloženiâ, Tome 9 (2017) no. 4, pp. 69-87

Voir la notice de l'article provenant de la source Math-Net.Ru

The modified multi-armed bandit problem is formulated in the paper which allows the player to use so-called expert hints in the decision making process. As a player in this problem is meant some automated system that uses a certain strategy (algorithm) for making a decision under conditions of uncertainty. The approach is developed for the case of $m$ experts. A modification of the well-known UCB1 algorithm is proposed to solve the multi-armed bandit problem. The results of a numerical experiment are given in order to show influence of expert hints on the player's payoff.
Keywords: multi-armed bandit problem, decision making, optimization methods, machine learning algorithms.
@article{MGTA_2017_9_4_a4,
     author = {Dmitriy S. Smirnov and Ekaterina V. Gromova},
     title = {Decision-making model under presence of experts as a modified multi-armed bandit problem},
     journal = {Matemati\v{c}eska\^a teori\^a igr i e\"e prilo\v{z}eni\^a},
     pages = {69--87},
     publisher = {mathdoc},
     volume = {9},
     number = {4},
     year = {2017},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/MGTA_2017_9_4_a4/}
}
TY  - JOUR
AU  - Dmitriy S. Smirnov
AU  - Ekaterina V. Gromova
TI  - Decision-making model under presence of experts as a modified multi-armed bandit problem
JO  - Matematičeskaâ teoriâ igr i eë priloženiâ
PY  - 2017
SP  - 69
EP  - 87
VL  - 9
IS  - 4
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/MGTA_2017_9_4_a4/
LA  - ru
ID  - MGTA_2017_9_4_a4
ER  - 
%0 Journal Article
%A Dmitriy S. Smirnov
%A Ekaterina V. Gromova
%T Decision-making model under presence of experts as a modified multi-armed bandit problem
%J Matematičeskaâ teoriâ igr i eë priloženiâ
%D 2017
%P 69-87
%V 9
%N 4
%I mathdoc
%U http://geodesic.mathdoc.fr/item/MGTA_2017_9_4_a4/
%G ru
%F MGTA_2017_9_4_a4
Dmitriy S. Smirnov; Ekaterina V. Gromova. Decision-making model under presence of experts as a modified multi-armed bandit problem. Matematičeskaâ teoriâ igr i eë priloženiâ, Tome 9 (2017) no. 4, pp. 69-87. http://geodesic.mathdoc.fr/item/MGTA_2017_9_4_a4/