Depth-guided Deformable Convolutions for RGB-D Saliency Object Detection

Fei Li, Jiangbin Zheng, Yuan Fang Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Recently, RGB-D salient object detection(SOD) has attracted increasing research interests, and existing methods have achieved huge success owing to well-designed feature extraction and fusion. However, in existing methods, the depth maps cannot be utilized entirely since RGB and depth are usually concatenated together as an entirety and then feed into the backbone to extract features, which cannot achieve the spatial supervision between both modals. In this letter, we propose a Depth-guided Deformable 3D Convolution (Guided-Conv) to solve this problem. Specifically, the Guided-Conv obtains the sampling offset of the 3D convolution kernel guided by the extra depth input, enabling the convolutional layer to change the receptive field and adapt to geometric cross-modal transformations. Besides, the Guided-Conv also incorporates geometric cues into the forward propagation by producing spatially adaptive filter weights. Based on comprehensive experiments on several extensively used benchmarks, the Guided-Conv yields strong results against several state-of-the-art RGB-D SOD approaches based on four key evaluation metrics.

Original languageEnglish
Title of host publicationProceedings - 2021 6th International Conference on Communication, Image and Signal Processings, CCISP 2021
EditorsJing Zhang
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages234-239
Number of pages6
ISBN (Electronic)9781665432795
DOIs
StatePublished - 2021
Event6th International Conference on Communication, Image and Signal Processings, CCISP 2021 - Virtual, Online, China
Duration: 20 Nov 2021 → …

Publication series

NameProceedings - 2021 6th International Conference on Communication, Image and Signal Processings, CCISP 2021

Conference

Conference6th International Conference on Communication, Image and Signal Processings, CCISP 2021
Country/TerritoryChina
CityVirtual, Online
Period20/11/21 → …

Keywords

  • 3D Convolution
  • Generate Offset
  • RGB-D
  • Salient Object Detection

Fingerprint

Dive into the research topics of 'Depth-guided Deformable Convolutions for RGB-D Saliency Object Detection'. Together they form a unique fingerprint.

Cite this