ECW-EGNet: Exploring Cross-Modal Weighting and Edge-Guided Decoder Network for RGB-D Salient Object Detection

Chenxing Xia; Feng Yang; Songsong Duan; Xiuju Gao; Bin Ge; Kuan-Ching Li; Xianjin Fang; Yan Zhang; Ke Yang

Chenxing Xia ; Feng Yang ; Songsong Duan ; Xiuju Gao ; Bin Ge ; Kuan-Ching Li ; Xianjin Fang ; Yan Zhang ; Ke Yang

Computer Science and Information Systems, Tome 21 (2024) no. 3

Cet article a éte moissonné depuis la source Computer Science and Information Systems website

Voir la notice de l'article

Résumé

Existing RGB-D salient object detection (SOD) techniques concentrate on combining data from multiple modalities (e.g., depth and RGB) and extracting multi-scale data for improved saliency reasoning. However, they frequently perform poorly as a factor of the drawbacks of low-quality depth maps and the lack of correlation between the extracted multi-scale data. In this paper, we propose a Exploring Cross-Modal Weighting and Edge-Guided Decoder Network (ECW-EGNet) for RGB-D SOD, which includes three prominent components. Firstly, we deploy a Cross-Modality Weighting Fusion (CMWF) module that utilizes Channel-Spatial Attention Feature Enhancement (CSAE) mechanism and Depth-Quality Assessment (DQA) mechanism to achieve the cross-modal feature interaction. The former parallels channel attention and spatial attention enhances the features of extracted RGB streams and depth streams while the latter assesses the depth-quality reduces the detrimental influence of the low-quality depth maps during the cross-modal fusion. Then, in order to effectively integrate multi-scale features for high-level and produce salient objects with precise locations, we construct a Bi-directional Scale-Correlation Convolution (BSCC) module in a bi-directional structure. Finally, we construct an Edge-Guided (EG) decoder that uses the edge detection operator to obtain edge masks to guide the enhancement of salient map edge details. The comprehensive experiments on five benchmark RGB-D SOD datasets demonstrate that the proposed ECW-EGNet outperforms 21 state-of-the-art (SOTA) saliency detectors in four widely used evaluation metrics.

Keywords: cross-modality fusion, depth-quality, edge-guided, RGB-D images, salient object detection

@article{CSIS_2024_21_3_a12,
     author = {Chenxing Xia and Feng Yang and Songsong Duan and Xiuju Gao and Bin Ge and Kuan-Ching Li and Xianjin Fang and Yan Zhang and Ke Yang},
     title = {ECW-EGNet: {Exploring} {Cross-Modal} {Weighting} and {Edge-Guided} {Decoder} {Network} for {RGB-D} {Salient} {Object} {Detection}},
     journal = {Computer Science and Information Systems},
     year = {2024},
     volume = {21},
     number = {3},
     url = {http://geodesic.mathdoc.fr/item/CSIS_2024_21_3_a12/}
}

TY  - JOUR
AU  - Chenxing Xia
AU  - Feng Yang
AU  - Songsong Duan
AU  - Xiuju Gao
AU  - Bin Ge
AU  - Kuan-Ching Li
AU  - Xianjin Fang
AU  - Yan Zhang
AU  - Ke Yang
TI  - ECW-EGNet: Exploring Cross-Modal Weighting and Edge-Guided Decoder Network for RGB-D Salient Object Detection
JO  - Computer Science and Information Systems
PY  - 2024
VL  - 21
IS  - 3
UR  - http://geodesic.mathdoc.fr/item/CSIS_2024_21_3_a12/
ID  - CSIS_2024_21_3_a12
ER  -

%0 Journal Article
%A Chenxing Xia
%A Feng Yang
%A Songsong Duan
%A Xiuju Gao
%A Bin Ge
%A Kuan-Ching Li
%A Xianjin Fang
%A Yan Zhang
%A Ke Yang
%T ECW-EGNet: Exploring Cross-Modal Weighting and Edge-Guided Decoder Network for RGB-D Salient Object Detection
%J Computer Science and Information Systems
%D 2024
%V 21
%N 3
%U http://geodesic.mathdoc.fr/item/CSIS_2024_21_3_a12/
%F CSIS_2024_21_3_a12

Chenxing Xia; Feng Yang; Songsong Duan; Xiuju Gao; Bin Ge; Kuan-Ching Li; Xianjin Fang; Yan Zhang; Ke Yang. ECW-EGNet: Exploring Cross-Modal Weighting and Edge-Guided Decoder Network for RGB-D Salient Object Detection. Computer Science and Information Systems, Tome 21 (2024) no. 3. http://geodesic.mathdoc.fr/item/CSIS_2024_21_3_a12/

Parcourir par

Geodesic

Parcourir par