Impact of subsampling and tree depth on random forests
ESAIM: Probability and Statistics, Tome 22 (2018), pp. 96-128

Voir la notice de l'article provenant de la source Numdam

Random forests are ensemble learning methods introduced by Breiman [Mach. Learn. 45 (2001) 5–32] that operate by averaging several decision trees built on a randomly selected subspace of the data set. Despite their widespread use in practice, the respective roles of the different mechanisms at work in Breiman’s forests are not yet fully understood, neither is the tuning of the corresponding parameters. In this paper, we study the influence of two parameters, namely the subsampling rate and the tree depth, on Breiman’s forests performance. More precisely, we prove that quantile forests (a specific type of random forests) based on subsampling and quantile forests whose tree construction is terminated early have similar performances, as long as their respective parameters (subsampling rate and tree depth) are well chosen. Moreover, experiments show that a proper tuning of these parameters leads in most cases to an improvement of Breiman’s original forests in terms of mean squared error.

DOI : 10.1051/ps/2018008
Classification : 62G05, 62G20
Keywords: Random forests, randomization, parameter tuning, subsampling, tree depth

Duroux, Roxane 1 ; Scornet, Erwan 1

1
@article{PS_2018__22__96_0,
     author = {Duroux, Roxane and Scornet, Erwan},
     title = {Impact of subsampling and tree depth on random forests},
     journal = {ESAIM: Probability and Statistics},
     pages = {96--128},
     publisher = {EDP-Sciences},
     volume = {22},
     year = {2018},
     doi = {10.1051/ps/2018008},
     mrnumber = {3891755},
     zbl = {1409.62072},
     language = {en},
     url = {http://geodesic.mathdoc.fr/articles/10.1051/ps/2018008/}
}
TY  - JOUR
AU  - Duroux, Roxane
AU  - Scornet, Erwan
TI  - Impact of subsampling and tree depth on random forests
JO  - ESAIM: Probability and Statistics
PY  - 2018
SP  - 96
EP  - 128
VL  - 22
PB  - EDP-Sciences
UR  - http://geodesic.mathdoc.fr/articles/10.1051/ps/2018008/
DO  - 10.1051/ps/2018008
LA  - en
ID  - PS_2018__22__96_0
ER  - 
%0 Journal Article
%A Duroux, Roxane
%A Scornet, Erwan
%T Impact of subsampling and tree depth on random forests
%J ESAIM: Probability and Statistics
%D 2018
%P 96-128
%V 22
%I EDP-Sciences
%U http://geodesic.mathdoc.fr/articles/10.1051/ps/2018008/
%R 10.1051/ps/2018008
%G en
%F PS_2018__22__96_0
Duroux, Roxane; Scornet, Erwan. Impact of subsampling and tree depth on random forests. ESAIM: Probability and Statistics, Tome 22 (2018), pp. 96-128. doi: 10.1051/ps/2018008

Cité par Sources :