Comparing Reinforcement Learning Algorithms for a Trip Building Task: a Multi-objective Approach Using Non-Local Information
Computer Science and Information Systems, Tome 21 (2024) no. 1.

Voir la notice de l'article provenant de la source Computer Science and Information Systems website

Using reinforcement learning (RL) to support agents in making decisions that consider more than one objective poses challenges. We formulate the problem of multiple agents learning how to travel from A to B as a reinforcement learning task modeled as a stochastic game, in which we take into account: (i) more than one objective, (ii) non-stationarity, (iii) communication of local and non-local information among the various actors. We use and compare RL algorithms, both for the single objective (Q-learning), as well as for multiple objectives (Pareto Q-learning), with and without non-local communication. We evaluate these methods in a scenario in which hundreds of agents have to learn how to travel from their origins to their destinations, aiming at minimizing their travel times, as well as the carbon monoxide vehicles emit. Results show that the use of non-local communication reduces both travel time and emissions.
Keywords: reinforcement learning, multi-agent systems, multi-objective reinforcement learning, route choice
@article{CSIS_2024_21_1_a17,
     author = {Henrique U. Gobbi and Guilherme Dytz dos Santos and Ana L. C. Bazzan},
     title = {Comparing {Reinforcement} {Learning} {Algorithms} for a {Trip} {Building} {Task:} a {Multi-objective} {Approach} {Using} {Non-Local} {Information}},
     journal = {Computer Science and Information Systems},
     publisher = {mathdoc},
     volume = {21},
     number = {1},
     year = {2024},
     url = {http://geodesic.mathdoc.fr/item/CSIS_2024_21_1_a17/}
}
TY  - JOUR
AU  - Henrique U. Gobbi
AU  - Guilherme Dytz dos Santos
AU  - Ana L. C. Bazzan
TI  - Comparing Reinforcement Learning Algorithms for a Trip Building Task: a Multi-objective Approach Using Non-Local Information
JO  - Computer Science and Information Systems
PY  - 2024
VL  - 21
IS  - 1
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/CSIS_2024_21_1_a17/
ID  - CSIS_2024_21_1_a17
ER  - 
%0 Journal Article
%A Henrique U. Gobbi
%A Guilherme Dytz dos Santos
%A Ana L. C. Bazzan
%T Comparing Reinforcement Learning Algorithms for a Trip Building Task: a Multi-objective Approach Using Non-Local Information
%J Computer Science and Information Systems
%D 2024
%V 21
%N 1
%I mathdoc
%U http://geodesic.mathdoc.fr/item/CSIS_2024_21_1_a17/
%F CSIS_2024_21_1_a17
Henrique U. Gobbi; Guilherme Dytz dos Santos; Ana L. C. Bazzan. Comparing Reinforcement Learning Algorithms for a Trip Building Task: a Multi-objective Approach Using Non-Local Information. Computer Science and Information Systems, Tome 21 (2024) no. 1. http://geodesic.mathdoc.fr/item/CSIS_2024_21_1_a17/