Density-Based Clustering with Constraints
Computer Science and Information Systems, Tome 16 (2019) no. 2.

Voir la notice de l'article provenant de la source Computer Science and Information Systems website

In this paper we present our ic-NBC and ic-DBSCAN algorithms for data clustering with constraints. The algorithms are based on density-based clustering algorithms NBC and DBSCAN but allow users to incorporate background knowledge into the process of clustering by means of instance constraints. The knowledge about anticipated groups can be applied by specifying the so-called must-link and cannot-link relationships between objects or points. These relationships are then incorporated into the clustering process. In the proposed algorithms this is achieved by properly merging resulting clusters and introducing a new notion of deferred points which are temporarily excluded from clustering and assigned to clusters based on their involvement in cannot-link relationships. To examine the algorithms, we have carried out a number of experiments. We used benchmark data sets and tested the efficiency and quality of the results. We have also measured the efficiency of the algorithms against their original versions. The experiments prove that the introduction of instance constraints improves the quality of both algorithms. The efficiency is only insignificantly reduced and is due to extra computation related to the introduced constraints.
Keywords: data mining, data clustering, semi-supervised clustering, clustering with constraints, instance-level constraints
@article{CSIS_2019_16_2_a7,
     author = {Piotr Lasek and Jarek Gryz},
     title = {Density-Based {Clustering} with {Constraints}},
     journal = {Computer Science and Information Systems},
     publisher = {mathdoc},
     volume = {16},
     number = {2},
     year = {2019},
     url = {http://geodesic.mathdoc.fr/item/CSIS_2019_16_2_a7/}
}
TY  - JOUR
AU  - Piotr Lasek
AU  - Jarek Gryz
TI  - Density-Based Clustering with Constraints
JO  - Computer Science and Information Systems
PY  - 2019
VL  - 16
IS  - 2
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/CSIS_2019_16_2_a7/
ID  - CSIS_2019_16_2_a7
ER  - 
%0 Journal Article
%A Piotr Lasek
%A Jarek Gryz
%T Density-Based Clustering with Constraints
%J Computer Science and Information Systems
%D 2019
%V 16
%N 2
%I mathdoc
%U http://geodesic.mathdoc.fr/item/CSIS_2019_16_2_a7/
%F CSIS_2019_16_2_a7
Piotr Lasek; Jarek Gryz. Density-Based Clustering with Constraints. Computer Science and Information Systems, Tome 16 (2019) no. 2. http://geodesic.mathdoc.fr/item/CSIS_2019_16_2_a7/