Extrapolation of the Bayesian classifier with an unknown support of the two-class mixture distribution
Trudy Matematicheskogo Instituta imeni V.A. Steklova, Tome 79 (2024) no. 6, pp. 991-1015
Voir la notice de l'article provenant de la source Math-Net.Ru
This work introduces a method aimed at enhancing the reliability of the Bayesian classifier. The method involves augmenting the training dataset, which consists of a mixture of distributions from two original classes, with artificially generated observations from a third, ‘background’ class, uniformly distributed over a compact set that contains the unknown support of the original mixture.
This modification allows the value of the discriminant function outside the support of the training data distribution to approach a prescribed level (in this case, zero). Adding a decision option for ‘Refusal to Classify’, triggered when the discriminant function takes sufficiently small values, results in a localized increase in classifier reliability. Specifically, this approach addresses several issues: it enables the rejection of data that differs significantly from the training data; facilitates the detection of anomalies in input data; and avoids decision-making in ‘boundary’ regions when separating classes.
The paper provides a theoretical justification for the optimality of the proposed classifier. The practical utility of the method is demonstrated through classification tasks involving images and time series.
Additionally, a methodology for identifying trusted regions is proposed. This methodology can be used to detect anomalous data, cases of parameter shifts in class distributions, and areas of overlap between the distributions of the original classes. Based on these trusted regions, quantitative metrics for classifier reliability and efficiency are introduced.
Bibliography: 23 titles.
Keywords:
machine learning, Bayesian classifier, trusted machine learning, interpretability, out-of-distribution (OOD), time series classification, rejection of classification, background class.
Mots-clés : image classification
Mots-clés : image classification
@article{RM_2024_79_6_a3,
author = {K. S. Lukyanov and P. A. Yaskov and A. I. Perminov and A. P. Kovalenko and D. Y. Turdakov},
title = {Extrapolation of the {Bayesian} classifier with an unknown support of the two-class mixture distribution},
journal = {Trudy Matematicheskogo Instituta imeni V.A. Steklova},
pages = {991--1015},
publisher = {mathdoc},
volume = {79},
number = {6},
year = {2024},
language = {en},
url = {http://geodesic.mathdoc.fr/item/RM_2024_79_6_a3/}
}
TY - JOUR AU - K. S. Lukyanov AU - P. A. Yaskov AU - A. I. Perminov AU - A. P. Kovalenko AU - D. Y. Turdakov TI - Extrapolation of the Bayesian classifier with an unknown support of the two-class mixture distribution JO - Trudy Matematicheskogo Instituta imeni V.A. Steklova PY - 2024 SP - 991 EP - 1015 VL - 79 IS - 6 PB - mathdoc UR - http://geodesic.mathdoc.fr/item/RM_2024_79_6_a3/ LA - en ID - RM_2024_79_6_a3 ER -
%0 Journal Article %A K. S. Lukyanov %A P. A. Yaskov %A A. I. Perminov %A A. P. Kovalenko %A D. Y. Turdakov %T Extrapolation of the Bayesian classifier with an unknown support of the two-class mixture distribution %J Trudy Matematicheskogo Instituta imeni V.A. Steklova %D 2024 %P 991-1015 %V 79 %N 6 %I mathdoc %U http://geodesic.mathdoc.fr/item/RM_2024_79_6_a3/ %G en %F RM_2024_79_6_a3
K. S. Lukyanov; P. A. Yaskov; A. I. Perminov; A. P. Kovalenko; D. Y. Turdakov. Extrapolation of the Bayesian classifier with an unknown support of the two-class mixture distribution. Trudy Matematicheskogo Instituta imeni V.A. Steklova, Tome 79 (2024) no. 6, pp. 991-1015. http://geodesic.mathdoc.fr/item/RM_2024_79_6_a3/