Depth-guided Deformable Convolutions for RGB-D Saliency Object Detection

Fei Li; Jiangbin Zheng; Yuan Fang Zhang

doi:10.1109/CCISP52774.2021.9639345

Depth-guided Deformable Convolutions for RGB-D Saliency Object Detection

Fei Li, Jiangbin Zheng, Yuan Fang Zhang

School of Software

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

1 Scopus citations

Abstract

Recently, RGB-D salient object detection(SOD) has attracted increasing research interests, and existing methods have achieved huge success owing to well-designed feature extraction and fusion. However, in existing methods, the depth maps cannot be utilized entirely since RGB and depth are usually concatenated together as an entirety and then feed into the backbone to extract features, which cannot achieve the spatial supervision between both modals. In this letter, we propose a Depth-guided Deformable 3D Convolution (Guided-Conv) to solve this problem. Specifically, the Guided-Conv obtains the sampling offset of the 3D convolution kernel guided by the extra depth input, enabling the convolutional layer to change the receptive field and adapt to geometric cross-modal transformations. Besides, the Guided-Conv also incorporates geometric cues into the forward propagation by producing spatially adaptive filter weights. Based on comprehensive experiments on several extensively used benchmarks, the Guided-Conv yields strong results against several state-of-the-art RGB-D SOD approaches based on four key evaluation metrics.

Original language	English
Title of host publication	Proceedings - 2021 6th International Conference on Communication, Image and Signal Processings, CCISP 2021
Editors	Jing Zhang
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	234-239
Number of pages	6
ISBN (Electronic)	9781665432795
DOIs	https://doi.org/10.1109/CCISP52774.2021.9639345
State	Published - 2021
Event	6th International Conference on Communication, Image and Signal Processings, CCISP 2021 - Virtual, Online, China Duration: 20 Nov 2021 → …

Publication series

Name	Proceedings - 2021 6th International Conference on Communication, Image and Signal Processings, CCISP 2021

Conference

Conference	6th International Conference on Communication, Image and Signal Processings, CCISP 2021
Country/Territory	China
City	Virtual, Online
Period	20/11/21 → …

Keywords

3D Convolution
Generate Offset
RGB-D
Salient Object Detection

Access to Document

10.1109/CCISP52774.2021.9639345

Cite this

Li, F., Zheng, J., & Zhang, Y. F. (2021). Depth-guided Deformable Convolutions for RGB-D Saliency Object Detection. In J. Zhang (Ed.), Proceedings - 2021 6th International Conference on Communication, Image and Signal Processings, CCISP 2021 (pp. 234-239). (Proceedings - 2021 6th International Conference on Communication, Image and Signal Processings, CCISP 2021). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CCISP52774.2021.9639345

Li, Fei ; Zheng, Jiangbin ; Zhang, Yuan Fang. / Depth-guided Deformable Convolutions for RGB-D Saliency Object Detection. Proceedings - 2021 6th International Conference on Communication, Image and Signal Processings, CCISP 2021. editor / Jing Zhang. Institute of Electrical and Electronics Engineers Inc., 2021. pp. 234-239 (Proceedings - 2021 6th International Conference on Communication, Image and Signal Processings, CCISP 2021).

@inproceedings{d272d741907b4f9a9619ceb39f9fa9d1,

title = "Depth-guided Deformable Convolutions for RGB-D Saliency Object Detection",

abstract = "Recently, RGB-D salient object detection(SOD) has attracted increasing research interests, and existing methods have achieved huge success owing to well-designed feature extraction and fusion. However, in existing methods, the depth maps cannot be utilized entirely since RGB and depth are usually concatenated together as an entirety and then feed into the backbone to extract features, which cannot achieve the spatial supervision between both modals. In this letter, we propose a Depth-guided Deformable 3D Convolution (Guided-Conv) to solve this problem. Specifically, the Guided-Conv obtains the sampling offset of the 3D convolution kernel guided by the extra depth input, enabling the convolutional layer to change the receptive field and adapt to geometric cross-modal transformations. Besides, the Guided-Conv also incorporates geometric cues into the forward propagation by producing spatially adaptive filter weights. Based on comprehensive experiments on several extensively used benchmarks, the Guided-Conv yields strong results against several state-of-the-art RGB-D SOD approaches based on four key evaluation metrics.",

keywords = "3D Convolution, Generate Offset, RGB-D, Salient Object Detection",

author = "Fei Li and Jiangbin Zheng and Zhang, {Yuan Fang}",

note = "Publisher Copyright: {\textcopyright} 2021 IEEE.; 6th International Conference on Communication, Image and Signal Processings, CCISP 2021 ; Conference date: 20-11-2021",

year = "2021",

doi = "10.1109/CCISP52774.2021.9639345",

language = "英语",

series = "Proceedings - 2021 6th International Conference on Communication, Image and Signal Processings, CCISP 2021",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "234--239",

editor = "Jing Zhang",

booktitle = "Proceedings - 2021 6th International Conference on Communication, Image and Signal Processings, CCISP 2021",

}

Li, F, Zheng, J & Zhang, YF 2021, Depth-guided Deformable Convolutions for RGB-D Saliency Object Detection. in J Zhang (ed.), Proceedings - 2021 6th International Conference on Communication, Image and Signal Processings, CCISP 2021. Proceedings - 2021 6th International Conference on Communication, Image and Signal Processings, CCISP 2021, Institute of Electrical and Electronics Engineers Inc., pp. 234-239, 6th International Conference on Communication, Image and Signal Processings, CCISP 2021, Virtual, Online, China, 20/11/21. https://doi.org/10.1109/CCISP52774.2021.9639345

Depth-guided Deformable Convolutions for RGB-D Saliency Object Detection. / Li, Fei; Zheng, Jiangbin; Zhang, Yuan Fang.
Proceedings - 2021 6th International Conference on Communication, Image and Signal Processings, CCISP 2021. ed. / Jing Zhang. Institute of Electrical and Electronics Engineers Inc., 2021. p. 234-239 (Proceedings - 2021 6th International Conference on Communication, Image and Signal Processings, CCISP 2021).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Depth-guided Deformable Convolutions for RGB-D Saliency Object Detection

AU - Li, Fei

AU - Zheng, Jiangbin

AU - Zhang, Yuan Fang

PY - 2021

Y1 - 2021

N2 - Recently, RGB-D salient object detection(SOD) has attracted increasing research interests, and existing methods have achieved huge success owing to well-designed feature extraction and fusion. However, in existing methods, the depth maps cannot be utilized entirely since RGB and depth are usually concatenated together as an entirety and then feed into the backbone to extract features, which cannot achieve the spatial supervision between both modals. In this letter, we propose a Depth-guided Deformable 3D Convolution (Guided-Conv) to solve this problem. Specifically, the Guided-Conv obtains the sampling offset of the 3D convolution kernel guided by the extra depth input, enabling the convolutional layer to change the receptive field and adapt to geometric cross-modal transformations. Besides, the Guided-Conv also incorporates geometric cues into the forward propagation by producing spatially adaptive filter weights. Based on comprehensive experiments on several extensively used benchmarks, the Guided-Conv yields strong results against several state-of-the-art RGB-D SOD approaches based on four key evaluation metrics.

AB - Recently, RGB-D salient object detection(SOD) has attracted increasing research interests, and existing methods have achieved huge success owing to well-designed feature extraction and fusion. However, in existing methods, the depth maps cannot be utilized entirely since RGB and depth are usually concatenated together as an entirety and then feed into the backbone to extract features, which cannot achieve the spatial supervision between both modals. In this letter, we propose a Depth-guided Deformable 3D Convolution (Guided-Conv) to solve this problem. Specifically, the Guided-Conv obtains the sampling offset of the 3D convolution kernel guided by the extra depth input, enabling the convolutional layer to change the receptive field and adapt to geometric cross-modal transformations. Besides, the Guided-Conv also incorporates geometric cues into the forward propagation by producing spatially adaptive filter weights. Based on comprehensive experiments on several extensively used benchmarks, the Guided-Conv yields strong results against several state-of-the-art RGB-D SOD approaches based on four key evaluation metrics.

KW - 3D Convolution

KW - Generate Offset

KW - RGB-D

KW - Salient Object Detection

UR - http://www.scopus.com/inward/record.url?scp=85123792205&partnerID=8YFLogxK

U2 - 10.1109/CCISP52774.2021.9639345

DO - 10.1109/CCISP52774.2021.9639345

M3 - 会议稿件

AN - SCOPUS:85123792205

T3 - Proceedings - 2021 6th International Conference on Communication, Image and Signal Processings, CCISP 2021

SP - 234

EP - 239

BT - Proceedings - 2021 6th International Conference on Communication, Image and Signal Processings, CCISP 2021

A2 - Zhang, Jing

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 6th International Conference on Communication, Image and Signal Processings, CCISP 2021

Y2 - 20 November 2021

ER -

Li F, Zheng J, Zhang YF. Depth-guided Deformable Convolutions for RGB-D Saliency Object Detection. In Zhang J, editor, Proceedings - 2021 6th International Conference on Communication, Image and Signal Processings, CCISP 2021. Institute of Electrical and Electronics Engineers Inc. 2021. p. 234-239. (Proceedings - 2021 6th International Conference on Communication, Image and Signal Processings, CCISP 2021). doi: 10.1109/CCISP52774.2021.9639345

Depth-guided Deformable Convolutions for RGB-D Saliency Object Detection

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this