Uncertainty Inspired RGB-D Saliency Detection

Jing Zhang; Deng Ping Fan; Yuchao Dai; Saeed Anwar; Fatemeh Saleh; Sadegh Aliakbarian; Nick Barnes

doi:10.1109/TPAMI.2021.3073564

Uncertainty Inspired RGB-D Saliency Detection

Jing Zhang, Deng Ping Fan, Yuchao Dai, Saeed Anwar, Fatemeh Saleh, Sadegh Aliakbarian, Nick Barnes

School of Electronics and Information

Research output: Contribution to journal › Article › peer-review

95 Scopus citations

Abstract

We propose the first stochastic framework to employ uncertainty for RGB-D saliency detection by learning from the data labeling process. Existing RGB-D saliency detection models treat this task as a point estimation problem by predicting a single saliency map following a deterministic learning pipeline. We argue that, however, the deterministic solution is relatively ill-posed. Inspired by the saliency data labeling process, we propose a generative architecture to achieve probabilistic RGB-D saliency detection which utilizes a latent variable to model the labeling variations. Our framework includes two main models: 1) a generator model, which maps the input image and latent variable to stochastic saliency prediction, and 2) an inference model, which gradually updates the latent variable by sampling it from the true or approximate posterior distribution. The generator model is an encoder-decoder saliency network. To infer the latent variable, we introduce two different solutions: i) a Conditional Variational Auto-encoder with an extra encoder to approximate the posterior distribution of the latent variable; and ii) an Alternating Back-Propagation technique, which directly samples the latent variable from the true posterior distribution. Qualitative and quantitative results on six challenging RGB-D benchmark datasets show our approach's superior performance in learning the distribution of saliency maps. The source code is publicly available via our project page: https://github.com/JingZhang617/UCNet.

Original language	English
Pages (from-to)	5761-5779
Number of pages	19
Journal	IEEE Transactions on Pattern Analysis and Machine Intelligence
Volume	44
Issue number	9
DOIs	https://doi.org/10.1109/TPAMI.2021.3073564
State	Published - 1 Sep 2022

Keywords

RGB-D saliency detection
Uncertainty
alternating back-propagation
conditional variational autoencoders

Access to Document

10.1109/TPAMI.2021.3073564

Cite this

@article{eacf6dab53d740629e07c5cbd1f92071,

title = "Uncertainty Inspired RGB-D Saliency Detection",

abstract = "We propose the first stochastic framework to employ uncertainty for RGB-D saliency detection by learning from the data labeling process. Existing RGB-D saliency detection models treat this task as a point estimation problem by predicting a single saliency map following a deterministic learning pipeline. We argue that, however, the deterministic solution is relatively ill-posed. Inspired by the saliency data labeling process, we propose a generative architecture to achieve probabilistic RGB-D saliency detection which utilizes a latent variable to model the labeling variations. Our framework includes two main models: 1) a generator model, which maps the input image and latent variable to stochastic saliency prediction, and 2) an inference model, which gradually updates the latent variable by sampling it from the true or approximate posterior distribution. The generator model is an encoder-decoder saliency network. To infer the latent variable, we introduce two different solutions: i) a Conditional Variational Auto-encoder with an extra encoder to approximate the posterior distribution of the latent variable; and ii) an Alternating Back-Propagation technique, which directly samples the latent variable from the true posterior distribution. Qualitative and quantitative results on six challenging RGB-D benchmark datasets show our approach's superior performance in learning the distribution of saliency maps. The source code is publicly available via our project page: https://github.com/JingZhang617/UCNet.",

keywords = "RGB-D saliency detection, Uncertainty, alternating back-propagation, conditional variational autoencoders",

author = "Jing Zhang and Fan, {Deng Ping} and Yuchao Dai and Saeed Anwar and Fatemeh Saleh and Sadegh Aliakbarian and Nick Barnes",

note = "Publisher Copyright: {\textcopyright} 1979-2012 IEEE.",

year = "2022",

month = sep,

day = "1",

doi = "10.1109/TPAMI.2021.3073564",

language = "英语",

volume = "44",

pages = "5761--5779",

journal = "IEEE Transactions on Pattern Analysis and Machine Intelligence",

issn = "0162-8828",

publisher = "IEEE Computer Society",

number = "9",

}

TY - JOUR

T1 - Uncertainty Inspired RGB-D Saliency Detection

AU - Zhang, Jing

AU - Fan, Deng Ping

AU - Dai, Yuchao

AU - Anwar, Saeed

AU - Saleh, Fatemeh

AU - Aliakbarian, Sadegh

AU - Barnes, Nick

PY - 2022/9/1

Y1 - 2022/9/1

N2 - We propose the first stochastic framework to employ uncertainty for RGB-D saliency detection by learning from the data labeling process. Existing RGB-D saliency detection models treat this task as a point estimation problem by predicting a single saliency map following a deterministic learning pipeline. We argue that, however, the deterministic solution is relatively ill-posed. Inspired by the saliency data labeling process, we propose a generative architecture to achieve probabilistic RGB-D saliency detection which utilizes a latent variable to model the labeling variations. Our framework includes two main models: 1) a generator model, which maps the input image and latent variable to stochastic saliency prediction, and 2) an inference model, which gradually updates the latent variable by sampling it from the true or approximate posterior distribution. The generator model is an encoder-decoder saliency network. To infer the latent variable, we introduce two different solutions: i) a Conditional Variational Auto-encoder with an extra encoder to approximate the posterior distribution of the latent variable; and ii) an Alternating Back-Propagation technique, which directly samples the latent variable from the true posterior distribution. Qualitative and quantitative results on six challenging RGB-D benchmark datasets show our approach's superior performance in learning the distribution of saliency maps. The source code is publicly available via our project page: https://github.com/JingZhang617/UCNet.

AB - We propose the first stochastic framework to employ uncertainty for RGB-D saliency detection by learning from the data labeling process. Existing RGB-D saliency detection models treat this task as a point estimation problem by predicting a single saliency map following a deterministic learning pipeline. We argue that, however, the deterministic solution is relatively ill-posed. Inspired by the saliency data labeling process, we propose a generative architecture to achieve probabilistic RGB-D saliency detection which utilizes a latent variable to model the labeling variations. Our framework includes two main models: 1) a generator model, which maps the input image and latent variable to stochastic saliency prediction, and 2) an inference model, which gradually updates the latent variable by sampling it from the true or approximate posterior distribution. The generator model is an encoder-decoder saliency network. To infer the latent variable, we introduce two different solutions: i) a Conditional Variational Auto-encoder with an extra encoder to approximate the posterior distribution of the latent variable; and ii) an Alternating Back-Propagation technique, which directly samples the latent variable from the true posterior distribution. Qualitative and quantitative results on six challenging RGB-D benchmark datasets show our approach's superior performance in learning the distribution of saliency maps. The source code is publicly available via our project page: https://github.com/JingZhang617/UCNet.

KW - RGB-D saliency detection

KW - Uncertainty

KW - alternating back-propagation

KW - conditional variational autoencoders

UR - http://www.scopus.com/inward/record.url?scp=85104635037&partnerID=8YFLogxK

U2 - 10.1109/TPAMI.2021.3073564

DO - 10.1109/TPAMI.2021.3073564

M3 - 文章

C2 - 33856982

AN - SCOPUS:85104635037

SN - 0162-8828

VL - 44

SP - 5761

EP - 5779

JO - IEEE Transactions on Pattern Analysis and Machine Intelligence

JF - IEEE Transactions on Pattern Analysis and Machine Intelligence

IS - 9

ER -

Uncertainty Inspired RGB-D Saliency Detection

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this