Extrapolation of the Bayesian classifier with an unknown support of the two-class mixture distribution
Trudy Matematicheskogo Instituta imeni V.A. Steklova, Tome 79 (2024) no. 6, pp. 991-1015

Voir la notice de l'article provenant de la source Math-Net.Ru

This work introduces a method aimed at enhancing the reliability of the Bayesian classifier. The method involves augmenting the training dataset, which consists of a mixture of distributions from two original classes, with artificially generated observations from a third, ‘background’ class, uniformly distributed over a compact set that contains the unknown support of the original mixture. This modification allows the value of the discriminant function outside the support of the training data distribution to approach a prescribed level (in this case, zero). Adding a decision option for ‘Refusal to Classify’, triggered when the discriminant function takes sufficiently small values, results in a localized increase in classifier reliability. Specifically, this approach addresses several issues: it enables the rejection of data that differs significantly from the training data; facilitates the detection of anomalies in input data; and avoids decision-making in ‘boundary’ regions when separating classes. The paper provides a theoretical justification for the optimality of the proposed classifier. The practical utility of the method is demonstrated through classification tasks involving images and time series. Additionally, a methodology for identifying trusted regions is proposed. This methodology can be used to detect anomalous data, cases of parameter shifts in class distributions, and areas of overlap between the distributions of the original classes. Based on these trusted regions, quantitative metrics for classifier reliability and efficiency are introduced. Bibliography: 23 titles.
Keywords: machine learning, Bayesian classifier, trusted machine learning, interpretability, out-of-distribution (OOD), time series classification, rejection of classification, background class.
Mots-clés : image classification
@article{RM_2024_79_6_a3,
     author = {K. S. Lukyanov and P. A. Yaskov and A. I. Perminov and A. P. Kovalenko and D. Y. Turdakov},
     title = {Extrapolation of the {Bayesian} classifier with an unknown support of the two-class mixture distribution},
     journal = {Trudy Matematicheskogo Instituta imeni V.A. Steklova},
     pages = {991--1015},
     publisher = {mathdoc},
     volume = {79},
     number = {6},
     year = {2024},
     language = {en},
     url = {http://geodesic.mathdoc.fr/item/RM_2024_79_6_a3/}
}
TY  - JOUR
AU  - K. S. Lukyanov
AU  - P. A. Yaskov
AU  - A. I. Perminov
AU  - A. P. Kovalenko
AU  - D. Y. Turdakov
TI  - Extrapolation of the Bayesian classifier with an unknown support of the two-class mixture distribution
JO  - Trudy Matematicheskogo Instituta imeni V.A. Steklova
PY  - 2024
SP  - 991
EP  - 1015
VL  - 79
IS  - 6
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/RM_2024_79_6_a3/
LA  - en
ID  - RM_2024_79_6_a3
ER  - 
%0 Journal Article
%A K. S. Lukyanov
%A P. A. Yaskov
%A A. I. Perminov
%A A. P. Kovalenko
%A D. Y. Turdakov
%T Extrapolation of the Bayesian classifier with an unknown support of the two-class mixture distribution
%J Trudy Matematicheskogo Instituta imeni V.A. Steklova
%D 2024
%P 991-1015
%V 79
%N 6
%I mathdoc
%U http://geodesic.mathdoc.fr/item/RM_2024_79_6_a3/
%G en
%F RM_2024_79_6_a3
K. S. Lukyanov; P. A. Yaskov; A. I. Perminov; A. P. Kovalenko; D. Y. Turdakov. Extrapolation of the Bayesian classifier with an unknown support of the two-class mixture distribution. Trudy Matematicheskogo Instituta imeni V.A. Steklova, Tome 79 (2024) no. 6, pp. 991-1015. http://geodesic.mathdoc.fr/item/RM_2024_79_6_a3/