Enhancing Network Engineering Capabilities through LLM Fine-Tuning with Automatically Generated Datasets
Computer Science and Information Systems, Tome 23 (2026) no. 1
Claudiu Trăistaru; Florin Pop; Costin Bădică; Cătălina Mancaş; and Ionuţ Murareţu. Enhancing Network Engineering Capabilities through LLM Fine-Tuning with Automatically Generated Datasets. Computer Science and Information Systems, Tome 23 (2026) no. 1. http://geodesic.mathdoc.fr/item/CSIS_2026_23_1_a24/
@article{CSIS_2026_23_1_a24,
     author = {Claudiu Tr\u{a}istaru and Florin Pop and Costin B\u{a}dic\u{a} and C\u{a}t\u{a}lina Manca\c{s} and and Ionu\c{t} Murare\c{t}u},
     title = {Enhancing {Network} {Engineering} {Capabilities} through {LLM} {Fine-Tuning} with {Automatically} {Generated} {Datasets}},
     journal = {Computer Science and Information Systems},
     year = {2026},
     volume = {23},
     number = {1},
     url = {http://geodesic.mathdoc.fr/item/CSIS_2026_23_1_a24/}
}
TY  - JOUR
AU  - Claudiu Trăistaru
AU  - Florin Pop
AU  - Costin Bădică
AU  - Cătălina Mancaş
AU  - and Ionuţ Murareţu
TI  - Enhancing Network Engineering Capabilities through LLM Fine-Tuning with Automatically Generated Datasets
JO  - Computer Science and Information Systems
PY  - 2026
VL  - 23
IS  - 1
UR  - http://geodesic.mathdoc.fr/item/CSIS_2026_23_1_a24/
ID  - CSIS_2026_23_1_a24
ER  - 
%0 Journal Article
%A Claudiu Trăistaru
%A Florin Pop
%A Costin Bădică
%A Cătălina Mancaş
%A and Ionuţ Murareţu
%T Enhancing Network Engineering Capabilities through LLM Fine-Tuning with Automatically Generated Datasets
%J Computer Science and Information Systems
%D 2026
%V 23
%N 1
%U http://geodesic.mathdoc.fr/item/CSIS_2026_23_1_a24/
%F CSIS_2026_23_1_a24

Voir la notice de l'article provenant de la source Computer Science and Information Systems website

The paper presents a method for automatically generating domain-specific datasets to fine-tune open-source LLMs in network engineering. Our objective is to address the increasingly complex nature of network configuration and management jobs by supplying LLMs with high-quality training data. We evaluated datasets generated using open-source LLMs, including DeepSeek-R1 671B, LLaMA 3.1 70B, Qwen 2.5 72B, and Mixtral 8x7B, analyzing the quality of unprocessed knowledge data and the efficacy of cleaning and deduplication methods. The resulting dataset addresses various subjects related to routing, security, and network services. After-ward, we fine-tuned smaller LLaMA 3.2 1B, LLaMA 3.2 3B and Qwen 2.5 1.5B models using Low-Rank Adaptation, thereby minimizing computational demands while maintaining the quality of domain knowledge.