Scene parsing using inference Embedded Deep Networks

Shuhui Bu, Pengcheng Han, Zhenbao Liu, Junwei Han

Research output: Contribution to journalArticlepeer-review

28 Scopus citations

Abstract

Effective features and graphical model are two key points for realizing high performance scene parsing. Recently, Convolutional Neural Networks (CNNs) have shown great ability of learning features and attained remarkable performance. However, most researches use CNNs and graphical model separately, and do not exploit full advantages of both methods. In order to achieve better performance, this work aims to design a novel neural network architecture called Inference Embedded Deep Networks (IEDNs), which incorporates a novel designed inference layer based on graphical model. Through the IEDNs, the network can learn hybrid features, the advantages of which are that they not only provide a powerful representation capturing hierarchical information, but also encapsulate spatial relationship information among adjacent objects. We apply the proposed networks to scene labeling, and several experiments are conducted on SIFT Flow and PASCAL VOC Dataset. The results demonstrate that the proposed IEDNs can achieve better performance.

Original languageEnglish
Pages (from-to)188-198
Number of pages11
JournalPattern Recognition
Volume59
DOIs
StatePublished - 1 Nov 2016

Keywords

  • Conditional Random Fields (CRFs)
  • Convolutional Neural Networks (CNNs)
  • Hybrid Features
  • Inference Embedded Deep Networks (IEDNs)

Fingerprint

Dive into the research topics of 'Scene parsing using inference Embedded Deep Networks'. Together they form a unique fingerprint.

Cite this