基于 BC2 FNet 网络的 RGB⁃D 显著性目标检测

Feng Wang; Yongmei Cheng

doi:10.1051/jnwpu/20244261135

基于 BC² FNet 网络的 RGB⁃D 显著性目标检测

Feng Wang, Yongmei Cheng

自动化学院

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

In the face of complex scene images, the introduction of depth information can greatly improve the performance of salient object detection. However, up-sampling and down-sampling operations in neural networks maybe blur the boundaries of objects in the saliency map, thereby reducing the performance of salient object detection. Aiming at this problem, a boundary-driven cross-modal and cross-layer fusion network (BC²FNet) for RGB-D salient object detection is proposed in this paper, which preserves the boundary of the object by adding the guidance of boundary information to the cross-modal and cross-layer fusion, respectively. Firstly, a boundary generation module is designed to extract two kinds of boundary information from low-level features of RGB and depth modalities, respectively. Secondly, a boundary-driven feature selection module is designed, which is dedicated to simultaneously focusing on important feature information and preserving boundary details in the process of RGB and depth modality fusion. Finally, a boundary-driven cross-layer fusion module is proposed which simultaneously adds two kinds of boundary information in the process of up-sampling fusion on adjacent layers. By embedding this module into the top-down information fusion flow, the predicted saliency map can contain accurate objects and sharp boundaries. Simulation results on five standard RGB-D data sets show that the proposed model can achieve better performance.

投稿的翻译标题	RGB⁃D salient object detection based on BC² FNet network
源语言	繁体中文
页（从-至）	1135-1143
页数	9
期刊	Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University
卷	42
期	6
DOI	https://doi.org/10.1051/jnwpu/20244261135
出版状态	已出版 - 12月 2024

关键词

boundary-driven
cross-layer fusion
cross-modal fusion
salient object detection

访问文件

10.1051/jnwpu/20244261135

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{3337c4a6bb3f4907b96cf736e0b47649,

title = "基于 BC2 FNet 网络的 RGB⁃D 显著性目标检测",

abstract = "In the face of complex scene images, the introduction of depth information can greatly improve the performance of salient object detection. However, up-sampling and down-sampling operations in neural networks maybe blur the boundaries of objects in the saliency map, thereby reducing the performance of salient object detection. Aiming at this problem, a boundary-driven cross-modal and cross-layer fusion network (BC2FNet) for RGB-D salient object detection is proposed in this paper, which preserves the boundary of the object by adding the guidance of boundary information to the cross-modal and cross-layer fusion, respectively. Firstly, a boundary generation module is designed to extract two kinds of boundary information from low-level features of RGB and depth modalities, respectively. Secondly, a boundary-driven feature selection module is designed, which is dedicated to simultaneously focusing on important feature information and preserving boundary details in the process of RGB and depth modality fusion. Finally, a boundary-driven cross-layer fusion module is proposed which simultaneously adds two kinds of boundary information in the process of up-sampling fusion on adjacent layers. By embedding this module into the top-down information fusion flow, the predicted saliency map can contain accurate objects and sharp boundaries. Simulation results on five standard RGB-D data sets show that the proposed model can achieve better performance.",

keywords = "boundary-driven, cross-layer fusion, cross-modal fusion, salient object detection",

author = "Feng Wang and Yongmei Cheng",

note = "Publisher Copyright: {\textcopyright}2024 Journal of Northwestern Polytechnical University.",

year = "2024",

month = dec,

doi = "10.1051/jnwpu/20244261135",

language = "繁体中文",

volume = "42",

pages = "1135--1143",

journal = "Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University",

issn = "1000-2758",

publisher = "Northwestern Polytechnical University",

number = "6",

}

TY - JOUR

T1 - 基于 BC2 FNet 网络的 RGB⁃D 显著性目标检测

AU - Wang, Feng

AU - Cheng, Yongmei

PY - 2024/12

Y1 - 2024/12

N2 - In the face of complex scene images, the introduction of depth information can greatly improve the performance of salient object detection. However, up-sampling and down-sampling operations in neural networks maybe blur the boundaries of objects in the saliency map, thereby reducing the performance of salient object detection. Aiming at this problem, a boundary-driven cross-modal and cross-layer fusion network (BC2FNet) for RGB-D salient object detection is proposed in this paper, which preserves the boundary of the object by adding the guidance of boundary information to the cross-modal and cross-layer fusion, respectively. Firstly, a boundary generation module is designed to extract two kinds of boundary information from low-level features of RGB and depth modalities, respectively. Secondly, a boundary-driven feature selection module is designed, which is dedicated to simultaneously focusing on important feature information and preserving boundary details in the process of RGB and depth modality fusion. Finally, a boundary-driven cross-layer fusion module is proposed which simultaneously adds two kinds of boundary information in the process of up-sampling fusion on adjacent layers. By embedding this module into the top-down information fusion flow, the predicted saliency map can contain accurate objects and sharp boundaries. Simulation results on five standard RGB-D data sets show that the proposed model can achieve better performance.

AB - In the face of complex scene images, the introduction of depth information can greatly improve the performance of salient object detection. However, up-sampling and down-sampling operations in neural networks maybe blur the boundaries of objects in the saliency map, thereby reducing the performance of salient object detection. Aiming at this problem, a boundary-driven cross-modal and cross-layer fusion network (BC2FNet) for RGB-D salient object detection is proposed in this paper, which preserves the boundary of the object by adding the guidance of boundary information to the cross-modal and cross-layer fusion, respectively. Firstly, a boundary generation module is designed to extract two kinds of boundary information from low-level features of RGB and depth modalities, respectively. Secondly, a boundary-driven feature selection module is designed, which is dedicated to simultaneously focusing on important feature information and preserving boundary details in the process of RGB and depth modality fusion. Finally, a boundary-driven cross-layer fusion module is proposed which simultaneously adds two kinds of boundary information in the process of up-sampling fusion on adjacent layers. By embedding this module into the top-down information fusion flow, the predicted saliency map can contain accurate objects and sharp boundaries. Simulation results on five standard RGB-D data sets show that the proposed model can achieve better performance.

KW - boundary-driven

KW - cross-layer fusion

KW - cross-modal fusion

KW - salient object detection

UR - http://www.scopus.com/inward/record.url?scp=85214477792&partnerID=8YFLogxK

U2 - 10.1051/jnwpu/20244261135

DO - 10.1051/jnwpu/20244261135

M3 - 文章

AN - SCOPUS:85214477792

SN - 1000-2758

VL - 42

SP - 1135

EP - 1143

JO - Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University

JF - Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University

IS - 6

ER -

基于 BC2 FNet 网络的 RGB⁃D 显著性目标检测

摘要

关键词

访问文件

其它文件与链接

指纹

引用此

基于 BC² FNet 网络的 RGB⁃D 显著性目标检测