Mitigating Out-of-Vocabulary Challenges in Embedded devices Vulnerability Classification: An Ensemble Embedding Approach with Bidirectional Context Modeling
Computer Science and Information Systems, Tome 23 (2026) no. 1
Voir la notice de l'article provenant de la source Computer Science and Information Systems website
Critical infrastructure is increasingly reliant on embedded systems, which are particularly vulnerable to cyberattacks due to their inherent complexity and interconnectivity. Accurate classification of vulnerabilities in these systems is essential for targeted analysis and mitigation strategies. While pre-trained word embeddings such as Word2Vec, GloVe, and FastText are commonly used for this purpose, their effectiveness is limited by reliance on training corpora that lack domainspecific terminology, leading to challenges with Out-of-Vocabulary words and reduced classification performance. To address this limitation, we propose a novel ensemble embedding technique that combines multiple pre-trained embeddings to improve vulnerability classification in embedded systems. Evaluated on benchmark datasets, including the National Vulnerability Database and the China National Vulnerability Database, our method achieves a 91.50% F1-score on unseen data, outperforming traditional single-embedding approaches. This advancement demonstrates significant potential for enhancing cybersecurity in critical infrastructure applications.
Keywords:
ensemble embedding, word embedding, vulnerability classification, critical infrastructure devices security
Aissa Ben Yahya; Hicham El Akhal; Abdelbaki El Belrhiti El Alaoui. Mitigating Out-of-Vocabulary Challenges in Embedded devices Vulnerability Classification: An Ensemble Embedding Approach with Bidirectional Context Modeling. Computer Science and Information Systems, Tome 23 (2026) no. 1. http://geodesic.mathdoc.fr/item/CSIS_2026_23_1_a19/
@article{CSIS_2026_23_1_a19,
author = {Aissa Ben Yahya and Hicham El Akhal and Abdelbaki El Belrhiti El Alaoui},
title = {Mitigating {Out-of-Vocabulary} {Challenges} in {Embedded} devices {Vulnerability} {Classification:} {An} {Ensemble} {Embedding} {Approach} with {Bidirectional} {Context} {Modeling}},
journal = {Computer Science and Information Systems},
year = {2026},
volume = {23},
number = {1},
url = {http://geodesic.mathdoc.fr/item/CSIS_2026_23_1_a19/}
}
TY - JOUR AU - Aissa Ben Yahya AU - Hicham El Akhal AU - Abdelbaki El Belrhiti El Alaoui TI - Mitigating Out-of-Vocabulary Challenges in Embedded devices Vulnerability Classification: An Ensemble Embedding Approach with Bidirectional Context Modeling JO - Computer Science and Information Systems PY - 2026 VL - 23 IS - 1 UR - http://geodesic.mathdoc.fr/item/CSIS_2026_23_1_a19/ ID - CSIS_2026_23_1_a19 ER -
%0 Journal Article %A Aissa Ben Yahya %A Hicham El Akhal %A Abdelbaki El Belrhiti El Alaoui %T Mitigating Out-of-Vocabulary Challenges in Embedded devices Vulnerability Classification: An Ensemble Embedding Approach with Bidirectional Context Modeling %J Computer Science and Information Systems %D 2026 %V 23 %N 1 %U http://geodesic.mathdoc.fr/item/CSIS_2026_23_1_a19/ %F CSIS_2026_23_1_a19