Exact algorithms of searching for the largest size cluster in two integer 2-clustering problems
Sibirskij žurnal vyčislitelʹnoj matematiki, Tome 22 (2019) no. 2, pp. 121-136

Voir la notice de l'article provenant de la source Math-Net.Ru

We consider two related discrete optimization problems of searching for a subset in a finite set of points in the Euclidean space. Both problems are induced by the versions of the fundamental problem in data analysis, namely, by selecting a subset of similar elements in a set of objects. In each problem, an input set and a positive real number are given, and it is required to find a cluster (i.e., a subset) of the largest size under constraints on the value of a quadratic clusterization function. The points in the input set which are outside the sought for subset are treated as the second (complementary) cluster. In the first problem, the function under the constraint is the sum over both clusters of the intracluster sums of the squared distances between the elements of the clusters and their centers. The center of the first (i.e., the sought) cluster is unknown and determined as the centroid, while the center of the second one is fixed at a given point in the Euclidean space (without loss of generality in the origin). In the second problem, the function under the constraint is the sum over both clusters of the weighted intracluster sums of the squared distances between the elements of the clusters and their centers. As in the first problem, the center of the first cluster is unknown and determined as the centroid, while the center of the second one is fixed in the origin. In this paper, we show that both problems are strongly NP-hard. Also, we present the exact algorithms for the cases of these problems in which the input points have integer components. If the space dimension is bounded by some constant, the algorithms are pseudopolynomial.
@article{SJVM_2019_22_2_a1,
     author = {A. V. Kel'manov and A. V. Panasenko and V. I. Khandeev},
     title = {Exact algorithms of searching for the largest size cluster in two integer 2-clustering problems},
     journal = {Sibirskij \v{z}urnal vy\v{c}islitelʹnoj matematiki},
     pages = {121--136},
     publisher = {mathdoc},
     volume = {22},
     number = {2},
     year = {2019},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/SJVM_2019_22_2_a1/}
}
TY  - JOUR
AU  - A. V. Kel'manov
AU  - A. V. Panasenko
AU  - V. I. Khandeev
TI  - Exact algorithms of searching for the largest size cluster in two integer 2-clustering problems
JO  - Sibirskij žurnal vyčislitelʹnoj matematiki
PY  - 2019
SP  - 121
EP  - 136
VL  - 22
IS  - 2
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/SJVM_2019_22_2_a1/
LA  - ru
ID  - SJVM_2019_22_2_a1
ER  - 
%0 Journal Article
%A A. V. Kel'manov
%A A. V. Panasenko
%A V. I. Khandeev
%T Exact algorithms of searching for the largest size cluster in two integer 2-clustering problems
%J Sibirskij žurnal vyčislitelʹnoj matematiki
%D 2019
%P 121-136
%V 22
%N 2
%I mathdoc
%U http://geodesic.mathdoc.fr/item/SJVM_2019_22_2_a1/
%G ru
%F SJVM_2019_22_2_a1
A. V. Kel'manov; A. V. Panasenko; V. I. Khandeev. Exact algorithms of searching for the largest size cluster in two integer 2-clustering problems. Sibirskij žurnal vyčislitelʹnoj matematiki, Tome 22 (2019) no. 2, pp. 121-136. http://geodesic.mathdoc.fr/item/SJVM_2019_22_2_a1/