Scene parsing with deep features and spatial structure learning

Hui Yu; Yuecheng Song; Wenyu Ju; Zhenbao Liu

doi:10.1007/978-3-319-48896-7_71

Scene parsing with deep features and spatial structure learning

Hui Yu, Yuecheng Song, Wenyu Ju, Zhenbao Liu

School of Civil Aviation

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Conditional Random Field (CRF) is a powerful tool for labeling tasks, and has always played a key role in object recognition and semantic segmentation. However, the quality of CRF labeling depends on selected features, which becomes the bottleneck of the accuracy improvement. In this paper, our semantic segmentation problem is calculated in the same way within the framework of Conditional Random Field. Different from other CRF-based strategies, which use appearance features of image, revealing only little information, we combined our framework together with deep learning strategy, such as Convolutional Neural Networks (CNNs), for feature extraction, which have shown strong ability and remarkable performance. This combination strategy is called deepfeature CRF (dCRF). Through dCRF, the deep informantion of image is illustrated and gets ultilized, and the segmentation accuracy is also increased. The proposed deep CRF strategy is adopted on SIFT-Flow and VOC2007 datasets. The segmentation results reveals that if we use features learned from deep networks into our CRF framework, the performance of our semantic segmentation strategy would increase significantly.

Original language	English
Title of host publication	Advances in Multimedia Information Processing – 17th Pacific-Rim Conference on Multimedia, PCM 2016, Proceedings
Editors	Enqing Chen, Yun Tie, Yihong Gong
Publisher	Springer Verlag
Pages	715-722
Number of pages	8
ISBN (Print)	9783319488950
DOIs	https://doi.org/10.1007/978-3-319-48896-7_71
State	Published - 2016
Event	17th Pacific-Rim Conference on Multimedia, PCM 2016 - Xi’an, China Duration: 15 Sep 2016 → 16 Sep 2016

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	9917 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	17th Pacific-Rim Conference on Multimedia, PCM 2016
Country/Territory	China
City	Xi’an
Period	15/09/16 → 16/09/16

Keywords

Conditional random fields (CRFs)
Convolutional neural networks (CNNs)
Deep feature CRF
Deep learning
Scene parsing

Access to Document

10.1007/978-3-319-48896-7_71

Cite this

Yu, H., Song, Y., Ju, W., & Liu, Z. (2016). Scene parsing with deep features and spatial structure learning. In E. Chen, Y. Tie, & Y. Gong (Eds.), Advances in Multimedia Information Processing – 17th Pacific-Rim Conference on Multimedia, PCM 2016, Proceedings (pp. 715-722). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9917 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-319-48896-7_71

Yu, Hui ; Song, Yuecheng ; Ju, Wenyu et al. / Scene parsing with deep features and spatial structure learning. Advances in Multimedia Information Processing – 17th Pacific-Rim Conference on Multimedia, PCM 2016, Proceedings. editor / Enqing Chen ; Yun Tie ; Yihong Gong. Springer Verlag, 2016. pp. 715-722 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{34dd648817a746b69fd732b547ec4aa5,

title = "Scene parsing with deep features and spatial structure learning",

abstract = "Conditional Random Field (CRF) is a powerful tool for labeling tasks, and has always played a key role in object recognition and semantic segmentation. However, the quality of CRF labeling depends on selected features, which becomes the bottleneck of the accuracy improvement. In this paper, our semantic segmentation problem is calculated in the same way within the framework of Conditional Random Field. Different from other CRF-based strategies, which use appearance features of image, revealing only little information, we combined our framework together with deep learning strategy, such as Convolutional Neural Networks (CNNs), for feature extraction, which have shown strong ability and remarkable performance. This combination strategy is called deepfeature CRF (dCRF). Through dCRF, the deep informantion of image is illustrated and gets ultilized, and the segmentation accuracy is also increased. The proposed deep CRF strategy is adopted on SIFT-Flow and VOC2007 datasets. The segmentation results reveals that if we use features learned from deep networks into our CRF framework, the performance of our semantic segmentation strategy would increase significantly.",

keywords = "Conditional random fields (CRFs), Convolutional neural networks (CNNs), Deep feature CRF, Deep learning, Scene parsing",

author = "Hui Yu and Yuecheng Song and Wenyu Ju and Zhenbao Liu",

note = "Publisher Copyright: {\textcopyright} Springer International Publishing AG 2016.; 17th Pacific-Rim Conference on Multimedia, PCM 2016 ; Conference date: 15-09-2016 Through 16-09-2016",

year = "2016",

doi = "10.1007/978-3-319-48896-7_71",

language = "英语",

isbn = "9783319488950",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "715--722",

editor = "Enqing Chen and Yun Tie and Yihong Gong",

booktitle = "Advances in Multimedia Information Processing – 17th Pacific-Rim Conference on Multimedia, PCM 2016, Proceedings",

}

Yu, H, Song, Y, Ju, W & Liu, Z 2016, Scene parsing with deep features and spatial structure learning. in E Chen, Y Tie & Y Gong (eds), Advances in Multimedia Information Processing – 17th Pacific-Rim Conference on Multimedia, PCM 2016, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 9917 LNCS, Springer Verlag, pp. 715-722, 17th Pacific-Rim Conference on Multimedia, PCM 2016, Xi’an, China, 15/09/16. https://doi.org/10.1007/978-3-319-48896-7_71

Scene parsing with deep features and spatial structure learning. / Yu, Hui; Song, Yuecheng; Ju, Wenyu et al.
Advances in Multimedia Information Processing – 17th Pacific-Rim Conference on Multimedia, PCM 2016, Proceedings. ed. / Enqing Chen; Yun Tie; Yihong Gong. Springer Verlag, 2016. p. 715-722 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9917 LNCS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Scene parsing with deep features and spatial structure learning

AU - Yu, Hui

AU - Song, Yuecheng

AU - Ju, Wenyu

AU - Liu, Zhenbao

N1 - Publisher Copyright: © Springer International Publishing AG 2016.

PY - 2016

Y1 - 2016

N2 - Conditional Random Field (CRF) is a powerful tool for labeling tasks, and has always played a key role in object recognition and semantic segmentation. However, the quality of CRF labeling depends on selected features, which becomes the bottleneck of the accuracy improvement. In this paper, our semantic segmentation problem is calculated in the same way within the framework of Conditional Random Field. Different from other CRF-based strategies, which use appearance features of image, revealing only little information, we combined our framework together with deep learning strategy, such as Convolutional Neural Networks (CNNs), for feature extraction, which have shown strong ability and remarkable performance. This combination strategy is called deepfeature CRF (dCRF). Through dCRF, the deep informantion of image is illustrated and gets ultilized, and the segmentation accuracy is also increased. The proposed deep CRF strategy is adopted on SIFT-Flow and VOC2007 datasets. The segmentation results reveals that if we use features learned from deep networks into our CRF framework, the performance of our semantic segmentation strategy would increase significantly.

AB - Conditional Random Field (CRF) is a powerful tool for labeling tasks, and has always played a key role in object recognition and semantic segmentation. However, the quality of CRF labeling depends on selected features, which becomes the bottleneck of the accuracy improvement. In this paper, our semantic segmentation problem is calculated in the same way within the framework of Conditional Random Field. Different from other CRF-based strategies, which use appearance features of image, revealing only little information, we combined our framework together with deep learning strategy, such as Convolutional Neural Networks (CNNs), for feature extraction, which have shown strong ability and remarkable performance. This combination strategy is called deepfeature CRF (dCRF). Through dCRF, the deep informantion of image is illustrated and gets ultilized, and the segmentation accuracy is also increased. The proposed deep CRF strategy is adopted on SIFT-Flow and VOC2007 datasets. The segmentation results reveals that if we use features learned from deep networks into our CRF framework, the performance of our semantic segmentation strategy would increase significantly.

KW - Conditional random fields (CRFs)

KW - Convolutional neural networks (CNNs)

KW - Deep feature CRF

KW - Deep learning

KW - Scene parsing

UR - http://www.scopus.com/inward/record.url?scp=85006900519&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-48896-7_71

DO - 10.1007/978-3-319-48896-7_71

M3 - 会议稿件

AN - SCOPUS:85006900519

SN - 9783319488950

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 715

EP - 722

BT - Advances in Multimedia Information Processing – 17th Pacific-Rim Conference on Multimedia, PCM 2016, Proceedings

A2 - Chen, Enqing

A2 - Tie, Yun

A2 - Gong, Yihong

PB - Springer Verlag

T2 - 17th Pacific-Rim Conference on Multimedia, PCM 2016

Y2 - 15 September 2016 through 16 September 2016

ER -

Yu H, Song Y, Ju W, Liu Z. Scene parsing with deep features and spatial structure learning. In Chen E, Tie Y, Gong Y, editors, Advances in Multimedia Information Processing – 17th Pacific-Rim Conference on Multimedia, PCM 2016, Proceedings. Springer Verlag. 2016. p. 715-722. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-319-48896-7_71

Scene parsing with deep features and spatial structure learning

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this