On relevant features selection based on information theory
Teoriâ veroâtnostej i ee primeneniâ, Tome 68 (2023) no. 3, pp. 483-508
Voir la notice de l'article provenant de la source Math-Net.Ru
It is shown that widely used suboptimal algorithms of feature selection based
on information theory concepts do not necessarily identify a collection of
features (relevant in a sense) affecting the studied random response. This
can be considered as a reflection of the epistasis phenomenon known in
genetics, when individual features have little effect on increased risk
of complex disease, whereas certain combinations of features have
significant impact on risk. It is demonstrated that a similar effect is also
manifested in inferences employing statistical estimates of mutual
information.
Keywords:
feature selection, mutual information, sequential selection of features, epistasis effect.
Mots-clés : interaction information
Mots-clés : interaction information
@article{TVP_2023_68_3_a3,
author = {A. V. Bulinski},
title = {On relevant features selection based on information theory},
journal = {Teori\^a vero\^atnostej i ee primeneni\^a},
pages = {483--508},
publisher = {mathdoc},
volume = {68},
number = {3},
year = {2023},
language = {ru},
url = {http://geodesic.mathdoc.fr/item/TVP_2023_68_3_a3/}
}
A. V. Bulinski. On relevant features selection based on information theory. Teoriâ veroâtnostej i ee primeneniâ, Tome 68 (2023) no. 3, pp. 483-508. http://geodesic.mathdoc.fr/item/TVP_2023_68_3_a3/