TY  - JOUR
AU  - Zhao, Dongfang
AU  - Liu, Jiafeng
AU  - Wu, Rui
AU  - Cheng, Dansong
AU  - Tang, Xianglong
TI  - An active exploration method for data efficient reinforcement learning
JO  - International Journal of Applied Mathematics and Computer Science
PY  - 2019
SP  - 351
EP  - 362
VL  - 29
IS  - 2
UR  - http://geodesic.mathdoc.fr/item/IJAMCS_2019_29_2_a10/
LA  - en
ID  - IJAMCS_2019_29_2_a10
ER  -