Improving feature selection process resistance to failures caused by curse-of-dimensionality effects
Kybernetika, Tome 47 (2011) no. 3, pp. 401-425.

Voir la notice de l'article provenant de la source Czech Digital Mathematics Library

The purpose of feature selection in machine learning is at least two-fold - saving measurement acquisition costs and reducing the negative effects of the curse of dimensionality with the aim to improve the accuracy of the models and the classification rate of classifiers with respect to previously unknown data. Yet it has been shown recently that the process of feature selection itself can be negatively affected by the very same curse of dimensionality - feature selection methods may easily over-fit or perform unstably. Such an outcome is unlikely to generalize well and the resulting recognition system may fail to deliver the expectable performance. In many tasks, it is therefore crucial to employ additional mechanisms of making the feature selection process more stable and resistant the curse of dimensionality effects. In this paper we discuss three different approaches to reducing this problem. We present an algorithmic extension applicable to various feature selection methods, capable of reducing excessive feature subset dependency not only on specific training data, but also on specific criterion function properties. Further, we discuss the concept of criteria ensembles, where various criteria vote about feature inclusion/removal and go on to provide a general definition of feature selection hybridization aimed at combining the advantages of dependent and independent criteria. The presented ideas are illustrated through examples and summarizing recommendations are given.
Classification : 62G05, 62H30, 68T10
Keywords: feature selection; curse of dimensionality; over-fitting; stability; machine learning; dimensionality reduction
@article{KYB_2011__47_3_a6,
     author = {Somol, Petr and Grim, Ji\v{r}{\'\i} and Novovi\v{c}ov\'a, Jana and Pudil, Pavel},
     title = {Improving feature selection process resistance to failures caused by curse-of-dimensionality effects},
     journal = {Kybernetika},
     pages = {401--425},
     publisher = {mathdoc},
     volume = {47},
     number = {3},
     year = {2011},
     zbl = {1218.62065},
     language = {en},
     url = {http://geodesic.mathdoc.fr/item/KYB_2011__47_3_a6/}
}
TY  - JOUR
AU  - Somol, Petr
AU  - Grim, Jiří
AU  - Novovičová, Jana
AU  - Pudil, Pavel
TI  - Improving feature selection process resistance to failures caused by curse-of-dimensionality effects
JO  - Kybernetika
PY  - 2011
SP  - 401
EP  - 425
VL  - 47
IS  - 3
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/KYB_2011__47_3_a6/
LA  - en
ID  - KYB_2011__47_3_a6
ER  - 
%0 Journal Article
%A Somol, Petr
%A Grim, Jiří
%A Novovičová, Jana
%A Pudil, Pavel
%T Improving feature selection process resistance to failures caused by curse-of-dimensionality effects
%J Kybernetika
%D 2011
%P 401-425
%V 47
%N 3
%I mathdoc
%U http://geodesic.mathdoc.fr/item/KYB_2011__47_3_a6/
%G en
%F KYB_2011__47_3_a6
Somol, Petr; Grim, Jiří; Novovičová, Jana; Pudil, Pavel. Improving feature selection process resistance to failures caused by curse-of-dimensionality effects. Kybernetika, Tome 47 (2011) no. 3, pp. 401-425. http://geodesic.mathdoc.fr/item/KYB_2011__47_3_a6/