SimAndro-Plus: On Computing Similarity of Android Applications
Computer Science and Information Systems, Tome 18 (2021) no. 4
Cet article a éte moissonné depuis la source Computer Science and Information Systems website
In this paper, we propose SimAndro-Plus as an improved variant of the state-of-the-art method, SimAndro, to compute the similarity of Android applications (apps) regarding their functionalities. SimAndro-Plus has two major differences with SimAndro: 1) it exploits two beneficial features to similarity computation, which are totally disregarded by SimAndro; 2) to compute the similarity score of an app-pair based on strings and package name features, SimAndro-Plus considers not only those terms co-appearing in both apps but also considers those terms appearing in one app while missing in the other one. The results of our extensive experiments with three real-world datasets and a dataset constructed by human experts demonstrate that 1) each of the two aforementioned differences is really effective to achieve better accuracy and 2) SimAndro-Plus outperforms SimAndro in similarity computation by 14% in average.
Keywords:
android applications, apps data mining, feature extraction, API calls, manifest information, similarity computation
@article{CSIS_2021_18_4_a6,
author = {Masoud Reyhani Hamedani and Sang-Wook Kim},
title = {SimAndro-Plus: {On} {Computing} {Similarity} of {Android} {Applications}},
journal = {Computer Science and Information Systems},
year = {2021},
volume = {18},
number = {4},
url = {http://geodesic.mathdoc.fr/item/CSIS_2021_18_4_a6/}
}
Masoud Reyhani Hamedani; Sang-Wook Kim. SimAndro-Plus: On Computing Similarity of Android Applications. Computer Science and Information Systems, Tome 18 (2021) no. 4. http://geodesic.mathdoc.fr/item/CSIS_2021_18_4_a6/