Scene parsing with deep features and spatial structure learning

Hui Yu, Yuecheng Song, Wenyu Ju, Zhenbao Liu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Conditional Random Field (CRF) is a powerful tool for labeling tasks, and has always played a key role in object recognition and semantic segmentation. However, the quality of CRF labeling depends on selected features, which becomes the bottleneck of the accuracy improvement. In this paper, our semantic segmentation problem is calculated in the same way within the framework of Conditional Random Field. Different from other CRF-based strategies, which use appearance features of image, revealing only little information, we combined our framework together with deep learning strategy, such as Convolutional Neural Networks (CNNs), for feature extraction, which have shown strong ability and remarkable performance. This combination strategy is called deepfeature CRF (dCRF). Through dCRF, the deep informantion of image is illustrated and gets ultilized, and the segmentation accuracy is also increased. The proposed deep CRF strategy is adopted on SIFT-Flow and VOC2007 datasets. The segmentation results reveals that if we use features learned from deep networks into our CRF framework, the performance of our semantic segmentation strategy would increase significantly.

Original languageEnglish
Title of host publicationAdvances in Multimedia Information Processing – 17th Pacific-Rim Conference on Multimedia, PCM 2016, Proceedings
EditorsEnqing Chen, Yun Tie, Yihong Gong
PublisherSpringer Verlag
Pages715-722
Number of pages8
ISBN (Print)9783319488950
DOIs
StatePublished - 2016
Event17th Pacific-Rim Conference on Multimedia, PCM 2016 - Xi’an, China
Duration: 15 Sep 201616 Sep 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9917 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference17th Pacific-Rim Conference on Multimedia, PCM 2016
Country/TerritoryChina
CityXi’an
Period15/09/1616/09/16

Keywords

  • Conditional random fields (CRFs)
  • Convolutional neural networks (CNNs)
  • Deep feature CRF
  • Deep learning
  • Scene parsing

Fingerprint

Dive into the research topics of 'Scene parsing with deep features and spatial structure learning'. Together they form a unique fingerprint.

Cite this