Image Semantic Segmentation Based on Multi-layer Feature Information Fusion and Dual Convolutional Attention Mechanism
Computer Science and Information Systems, Tome 22 (2025) no. 3

Voir la notice de l'article provenant de la source Computer Science and Information Systems website

Traditional semantic segmentation methods have problems such as poor multi-scale feature extraction ability, weak lightweight backbone network feature extraction ability, lack of effective fusion of context information, resulting in edge segmentation errors and feature discontinuity. In this paper, a novel semantic segmentation model based on multi-layer information fusion and dual convolutional attention mechanism is proposed. In this method, SegFormer network is used as the backbone network, and multi-scale features of encoder output are fused with overlapping features. The feature extraction subnetwork is optimized by constructing the object region enhancement module, and the intermediate feature map is refined adaptively in each convolutional block of the deep network, so as to strengthen the fine extraction of multi-dimensional feature information of complex images. Dual convolutional attention module is used to fusion high-level semantic information to avoid the loss of feature information caused by up-sampling operation and the influence of introducing noise, and refine the effect of target edge segmentation. At the same time, the feature pyramid grid is proposed to process the overlapping features, obtain the context information of different scales, and enhance the semantic expression of features. Finally, the features processed by the feature pyramid grid module are combined to improve the segmentation effect. The experimental results on the public data set show that the proposed method has better performance than the existing methods, and has better segmentation effect on the object edge in the scene.
Keywords: Semantic segmentation, multi-layer information fusion, dual convolutional attention mechanism, feature pyramid grid
Lin Teng; Yulong Qiao; Jinfeng Wang; Mirjana Ivanović; Shoulin Yin. Image Semantic Segmentation Based on Multi-layer Feature Information Fusion and Dual Convolutional Attention Mechanism. Computer Science and Information Systems, Tome 22 (2025) no. 3. http://geodesic.mathdoc.fr/item/CSIS_2025_22_3_a11/
@article{CSIS_2025_22_3_a11,
     author = {Lin Teng and Yulong Qiao and Jinfeng Wang and Mirjana Ivanovi\'c and Shoulin Yin},
     title = {Image {Semantic} {Segmentation} {Based} on {Multi-layer} {Feature} {Information} {Fusion} and {Dual} {Convolutional} {Attention} {Mechanism}},
     journal = {Computer Science and Information Systems},
     year = {2025},
     volume = {22},
     number = {3},
     url = {http://geodesic.mathdoc.fr/item/CSIS_2025_22_3_a11/}
}
TY  - JOUR
AU  - Lin Teng
AU  - Yulong Qiao
AU  - Jinfeng Wang
AU  - Mirjana Ivanović
AU  - Shoulin Yin
TI  - Image Semantic Segmentation Based on Multi-layer Feature Information Fusion and Dual Convolutional Attention Mechanism
JO  - Computer Science and Information Systems
PY  - 2025
VL  - 22
IS  - 3
UR  - http://geodesic.mathdoc.fr/item/CSIS_2025_22_3_a11/
ID  - CSIS_2025_22_3_a11
ER  - 
%0 Journal Article
%A Lin Teng
%A Yulong Qiao
%A Jinfeng Wang
%A Mirjana Ivanović
%A Shoulin Yin
%T Image Semantic Segmentation Based on Multi-layer Feature Information Fusion and Dual Convolutional Attention Mechanism
%J Computer Science and Information Systems
%D 2025
%V 22
%N 3
%U http://geodesic.mathdoc.fr/item/CSIS_2025_22_3_a11/
%F CSIS_2025_22_3_a11