Graph clustering based on modularity variation estimations
The Bulletin of Irkutsk State University. Series Mathematics, Tome 25 (2018), pp. 63-78

Voir la notice de l'article provenant de la source Math-Net.Ru

Graph clustering is one of the constantly actual data analysis problems. There are various statements of this problem. In this paper we consider the statement of search for a partition of a graph vertices set into disjoint subsets. In such a way, that the density of connections between the vertices of one subset was higher than that between the vertices of different subsets. There is a lot of various approaches, and many of them use such as an a posteriori estimate of clustering quality, as modularity. The modularity functional, taking the value from $[-1, 1]$, allows us to formally evaluate the quality of partitioning into subsets. This paper deals with an approach, instead of calculating the modularity, using a less computationally complex estimate of modularity change in the operation of clusters union. Four theorems for different graph types are formulated, presenting the possibility of application of suggested estimate, instead of direct modularity calculations. A greedy algorithmic scheme and also AMVE (Algorithm based on Modularity Variation Estimation) — simple greedy algorithm based on the scheme are proposed. An attempt of comparative analysis on the test problemet of AMVE with heuristic clustering algorithms implemented in actual data analysis software is described. It is shown the comparative advantage of AMVE, both in terms of speed and quality of clustering. Also, the cases on the use of developed algorithm and its implementation for data analysis in sociology and history and criticism of literature are mentioned. In these cases, investigated graphs based on social networks data (the ratio of "friendship" in the social network between users used as the graph edges). It is demonstrated a slight superiority of AMVE in clustering quality compared to the known algorithms Louvain and Walktrap.
Keywords: graph clustering, modularity, community detection, social network analysis.
@article{IIGUM_2018_25_a4,
     author = {N. N. Martynov and O. V. Khandarova and F. V. Khandarov},
     title = {Graph clustering based on modularity variation estimations},
     journal = {The Bulletin of Irkutsk State University. Series Mathematics},
     pages = {63--78},
     publisher = {mathdoc},
     volume = {25},
     year = {2018},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/IIGUM_2018_25_a4/}
}
TY  - JOUR
AU  - N. N. Martynov
AU  - O. V. Khandarova
AU  - F. V. Khandarov
TI  - Graph clustering based on modularity variation estimations
JO  - The Bulletin of Irkutsk State University. Series Mathematics
PY  - 2018
SP  - 63
EP  - 78
VL  - 25
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/IIGUM_2018_25_a4/
LA  - ru
ID  - IIGUM_2018_25_a4
ER  - 
%0 Journal Article
%A N. N. Martynov
%A O. V. Khandarova
%A F. V. Khandarov
%T Graph clustering based on modularity variation estimations
%J The Bulletin of Irkutsk State University. Series Mathematics
%D 2018
%P 63-78
%V 25
%I mathdoc
%U http://geodesic.mathdoc.fr/item/IIGUM_2018_25_a4/
%G ru
%F IIGUM_2018_25_a4
N. N. Martynov; O. V. Khandarova; F. V. Khandarov. Graph clustering based on modularity variation estimations. The Bulletin of Irkutsk State University. Series Mathematics, Tome 25 (2018), pp. 63-78. http://geodesic.mathdoc.fr/item/IIGUM_2018_25_a4/