A novel multi-loss-based deep adversarial network for handling challenging cases in semi-supervised image semantic segmentation

Wei Huang; Zhanfei Shao; Mingyuan Luo; Peng Zhang; Yufei Zha

doi:10.1016/j.patrec.2021.03.017

A novel multi-loss-based deep adversarial network for handling challenging cases in semi-supervised image semantic segmentation

Wei Huang, Zhanfei Shao, Mingyuan Luo, Peng Zhang, Yufei Zha

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

9 引用（Scopus）

摘要

Image semantic segmentation is popular in computer vision and pattern recognition, since the high-level semantic understanding of images can be effectively realized. Based on whether and to what extent the training data should be labeled, most image semantic segmentation methods can be categorized into fully-supervised learning-based methods, weakly-supervised learning-based methods, and semi-supervised learning-based methods. Among them, semi-supervised image semantic segmentation receives increasing popularity recently, because of its flexibility and convenience in requiring partial training data to be labeled. Although semi-supervised image semantic segmentation is promising, its state-of-the-arts haven't obtained satisfactory performance when handling challenging cases, including poor illumination, small-sized targets, multi-targets with the same semantics, etc. To tackle the above dilemmas, a novel multi-loss-based deep adversarial network is proposed in this paper. Technically, the more robust WGAN-GP model is utilized as the backbone of the novel network, instead of the conventional GAN model. Moreover, multiple losses including the cross entropy loss, the edge detection loss, the adversarial loss, and the semi-supervised loss, are all incorporated during the novel network's training. Experimental analyses based on challenging cases shortlisted from the Pascal VOC 2012 dataset and the Cityscapes dataset suggest that, the novel network is capable to outperform state-of-the-arts in semi-supervised image semantic segmentation.

源语言	英语
页（从-至）	208-214
页数	7
期刊	Pattern Recognition Letters
卷	146
DOI	https://doi.org/10.1016/j.patrec.2021.03.017
出版状态	已出版 - 6月 2021

访问文件

10.1016/j.patrec.2021.03.017

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{6a270862398c4a97921d46e5c8a7d161,

title = "A novel multi-loss-based deep adversarial network for handling challenging cases in semi-supervised image semantic segmentation",

abstract = "Image semantic segmentation is popular in computer vision and pattern recognition, since the high-level semantic understanding of images can be effectively realized. Based on whether and to what extent the training data should be labeled, most image semantic segmentation methods can be categorized into fully-supervised learning-based methods, weakly-supervised learning-based methods, and semi-supervised learning-based methods. Among them, semi-supervised image semantic segmentation receives increasing popularity recently, because of its flexibility and convenience in requiring partial training data to be labeled. Although semi-supervised image semantic segmentation is promising, its state-of-the-arts haven't obtained satisfactory performance when handling challenging cases, including poor illumination, small-sized targets, multi-targets with the same semantics, etc. To tackle the above dilemmas, a novel multi-loss-based deep adversarial network is proposed in this paper. Technically, the more robust WGAN-GP model is utilized as the backbone of the novel network, instead of the conventional GAN model. Moreover, multiple losses including the cross entropy loss, the edge detection loss, the adversarial loss, and the semi-supervised loss, are all incorporated during the novel network's training. Experimental analyses based on challenging cases shortlisted from the Pascal VOC 2012 dataset and the Cityscapes dataset suggest that, the novel network is capable to outperform state-of-the-arts in semi-supervised image semantic segmentation.",

keywords = "Deep adversarial network, Image semantic segmentation, Semi-supervised learning",

author = "Wei Huang and Zhanfei Shao and Mingyuan Luo and Peng Zhang and Yufei Zha",

note = "Publisher Copyright: {\textcopyright} 2021 Elsevier B.V.",

year = "2021",

month = jun,

doi = "10.1016/j.patrec.2021.03.017",

language = "英语",

volume = "146",

pages = "208--214",

journal = "Pattern Recognition Letters",

issn = "0167-8655",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - A novel multi-loss-based deep adversarial network for handling challenging cases in semi-supervised image semantic segmentation

AU - Huang, Wei

AU - Shao, Zhanfei

AU - Luo, Mingyuan

AU - Zhang, Peng

AU - Zha, Yufei

PY - 2021/6

Y1 - 2021/6

N2 - Image semantic segmentation is popular in computer vision and pattern recognition, since the high-level semantic understanding of images can be effectively realized. Based on whether and to what extent the training data should be labeled, most image semantic segmentation methods can be categorized into fully-supervised learning-based methods, weakly-supervised learning-based methods, and semi-supervised learning-based methods. Among them, semi-supervised image semantic segmentation receives increasing popularity recently, because of its flexibility and convenience in requiring partial training data to be labeled. Although semi-supervised image semantic segmentation is promising, its state-of-the-arts haven't obtained satisfactory performance when handling challenging cases, including poor illumination, small-sized targets, multi-targets with the same semantics, etc. To tackle the above dilemmas, a novel multi-loss-based deep adversarial network is proposed in this paper. Technically, the more robust WGAN-GP model is utilized as the backbone of the novel network, instead of the conventional GAN model. Moreover, multiple losses including the cross entropy loss, the edge detection loss, the adversarial loss, and the semi-supervised loss, are all incorporated during the novel network's training. Experimental analyses based on challenging cases shortlisted from the Pascal VOC 2012 dataset and the Cityscapes dataset suggest that, the novel network is capable to outperform state-of-the-arts in semi-supervised image semantic segmentation.

AB - Image semantic segmentation is popular in computer vision and pattern recognition, since the high-level semantic understanding of images can be effectively realized. Based on whether and to what extent the training data should be labeled, most image semantic segmentation methods can be categorized into fully-supervised learning-based methods, weakly-supervised learning-based methods, and semi-supervised learning-based methods. Among them, semi-supervised image semantic segmentation receives increasing popularity recently, because of its flexibility and convenience in requiring partial training data to be labeled. Although semi-supervised image semantic segmentation is promising, its state-of-the-arts haven't obtained satisfactory performance when handling challenging cases, including poor illumination, small-sized targets, multi-targets with the same semantics, etc. To tackle the above dilemmas, a novel multi-loss-based deep adversarial network is proposed in this paper. Technically, the more robust WGAN-GP model is utilized as the backbone of the novel network, instead of the conventional GAN model. Moreover, multiple losses including the cross entropy loss, the edge detection loss, the adversarial loss, and the semi-supervised loss, are all incorporated during the novel network's training. Experimental analyses based on challenging cases shortlisted from the Pascal VOC 2012 dataset and the Cityscapes dataset suggest that, the novel network is capable to outperform state-of-the-arts in semi-supervised image semantic segmentation.

KW - Deep adversarial network

KW - Image semantic segmentation

KW - Semi-supervised learning

UR - http://www.scopus.com/inward/record.url?scp=85103654920&partnerID=8YFLogxK

U2 - 10.1016/j.patrec.2021.03.017

DO - 10.1016/j.patrec.2021.03.017

M3 - 文章

AN - SCOPUS:85103654920

SN - 0167-8655

VL - 146

SP - 208

EP - 214

JO - Pattern Recognition Letters

JF - Pattern Recognition Letters

ER -

A novel multi-loss-based deep adversarial network for handling challenging cases in semi-supervised image semantic segmentation

摘要

访问文件

其它文件与链接

指纹

引用此