A Joint Convolutional Neural Networks and Context Transfer for Street Scenes Labeling

Qi Wang; Junyu Gao; Yuan Yuan

doi:10.1109/TITS.2017.2726546

A Joint Convolutional Neural Networks and Context Transfer for Street Scenes Labeling

Qi Wang, Junyu Gao, Yuan Yuan

School of Artificial Intelligence, OPtics and Electronics

Unmanned System Research Institute

Research output: Contribution to journal › Article › peer-review

123 Scopus citations

Abstract

Street scene understanding is an essential task for autonomous driving. One important step toward this direction is scene labeling, which annotates each pixel in the images with a correct class label. Although many approaches have been developed, there are still some weak points. First, many methods are based on the hand-crafted features whose image representation ability is limited. Second, they cannot label foreground objects accurately due to the data set bias. Third, in the refinement stage, the traditional Markov random filed inference is prone to over smoothness. For improving the above problems, this paper proposes a joint method of priori convolutional neural networks at superpixel level (called as 'priori s-CNNs') and soft restricted context transfer. Our contributions are threefold: 1) a priori s-CNNs model that learns priori location information at superpixel level is proposed to describe various objects discriminatingly; 2) a hierarchical data augmentation method is presented to alleviate data set bias in the priori s-CNNs training stage, which improves foreground objects labeling significantly; and 3) a soft restricted MRF energy function is defined to improve the priori s-CNNs model's labeling performance and reduce the over smoothness at the same time. The proposed approach is verified on CamVid data set (11 classes) and SIFT Flow Street data set (16 classes) and achieves a competitive performance.

Original language	English
Pages (from-to)	1457-1470
Number of pages	14
Journal	IEEE Transactions on Intelligent Transportation Systems
Volume	19
Issue number	5
DOIs	https://doi.org/10.1109/TITS.2017.2726546
State	Published - May 2018

Keywords

convolutional neural networks
data augmentation
deep learning
label transfer
Scene labeling
street scenes

Access to Document

10.1109/TITS.2017.2726546

Cite this

@article{a002f7ea2d134518b4b35a9e35085c7a,

title = "A Joint Convolutional Neural Networks and Context Transfer for Street Scenes Labeling",

abstract = "Street scene understanding is an essential task for autonomous driving. One important step toward this direction is scene labeling, which annotates each pixel in the images with a correct class label. Although many approaches have been developed, there are still some weak points. First, many methods are based on the hand-crafted features whose image representation ability is limited. Second, they cannot label foreground objects accurately due to the data set bias. Third, in the refinement stage, the traditional Markov random filed inference is prone to over smoothness. For improving the above problems, this paper proposes a joint method of priori convolutional neural networks at superpixel level (called as 'priori s-CNNs') and soft restricted context transfer. Our contributions are threefold: 1) a priori s-CNNs model that learns priori location information at superpixel level is proposed to describe various objects discriminatingly; 2) a hierarchical data augmentation method is presented to alleviate data set bias in the priori s-CNNs training stage, which improves foreground objects labeling significantly; and 3) a soft restricted MRF energy function is defined to improve the priori s-CNNs model's labeling performance and reduce the over smoothness at the same time. The proposed approach is verified on CamVid data set (11 classes) and SIFT Flow Street data set (16 classes) and achieves a competitive performance.",

keywords = "convolutional neural networks, data augmentation, deep learning, label transfer, Scene labeling, street scenes",

author = "Qi Wang and Junyu Gao and Yuan Yuan",

note = "Publisher Copyright: {\textcopyright} 2000-2011 IEEE.",

year = "2018",

month = may,

doi = "10.1109/TITS.2017.2726546",

language = "英语",

volume = "19",

pages = "1457--1470",

journal = "IEEE Transactions on Intelligent Transportation Systems",

issn = "1524-9050",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "5",

}

TY - JOUR

T1 - A Joint Convolutional Neural Networks and Context Transfer for Street Scenes Labeling

AU - Wang, Qi

AU - Gao, Junyu

AU - Yuan, Yuan

PY - 2018/5

Y1 - 2018/5

N2 - Street scene understanding is an essential task for autonomous driving. One important step toward this direction is scene labeling, which annotates each pixel in the images with a correct class label. Although many approaches have been developed, there are still some weak points. First, many methods are based on the hand-crafted features whose image representation ability is limited. Second, they cannot label foreground objects accurately due to the data set bias. Third, in the refinement stage, the traditional Markov random filed inference is prone to over smoothness. For improving the above problems, this paper proposes a joint method of priori convolutional neural networks at superpixel level (called as 'priori s-CNNs') and soft restricted context transfer. Our contributions are threefold: 1) a priori s-CNNs model that learns priori location information at superpixel level is proposed to describe various objects discriminatingly; 2) a hierarchical data augmentation method is presented to alleviate data set bias in the priori s-CNNs training stage, which improves foreground objects labeling significantly; and 3) a soft restricted MRF energy function is defined to improve the priori s-CNNs model's labeling performance and reduce the over smoothness at the same time. The proposed approach is verified on CamVid data set (11 classes) and SIFT Flow Street data set (16 classes) and achieves a competitive performance.

AB - Street scene understanding is an essential task for autonomous driving. One important step toward this direction is scene labeling, which annotates each pixel in the images with a correct class label. Although many approaches have been developed, there are still some weak points. First, many methods are based on the hand-crafted features whose image representation ability is limited. Second, they cannot label foreground objects accurately due to the data set bias. Third, in the refinement stage, the traditional Markov random filed inference is prone to over smoothness. For improving the above problems, this paper proposes a joint method of priori convolutional neural networks at superpixel level (called as 'priori s-CNNs') and soft restricted context transfer. Our contributions are threefold: 1) a priori s-CNNs model that learns priori location information at superpixel level is proposed to describe various objects discriminatingly; 2) a hierarchical data augmentation method is presented to alleviate data set bias in the priori s-CNNs training stage, which improves foreground objects labeling significantly; and 3) a soft restricted MRF energy function is defined to improve the priori s-CNNs model's labeling performance and reduce the over smoothness at the same time. The proposed approach is verified on CamVid data set (11 classes) and SIFT Flow Street data set (16 classes) and achieves a competitive performance.

KW - convolutional neural networks

KW - data augmentation

KW - deep learning

KW - label transfer

KW - Scene labeling

KW - street scenes

UR - http://www.scopus.com/inward/record.url?scp=85028507989&partnerID=8YFLogxK

U2 - 10.1109/TITS.2017.2726546

DO - 10.1109/TITS.2017.2726546

M3 - 文章

AN - SCOPUS:85028507989

SN - 1524-9050

VL - 19

SP - 1457

EP - 1470

JO - IEEE Transactions on Intelligent Transportation Systems

JF - IEEE Transactions on Intelligent Transportation Systems

IS - 5

ER -

A Joint Convolutional Neural Networks and Context Transfer for Street Scenes Labeling

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this