TY - JOUR
T1 - A novel multi-loss-based deep adversarial network for handling challenging cases in semi-supervised image semantic segmentation
AU - Huang, Wei
AU - Shao, Zhanfei
AU - Luo, Mingyuan
AU - Zhang, Peng
AU - Zha, Yufei
N1 - Publisher Copyright:
© 2021 Elsevier B.V.
PY - 2021/6
Y1 - 2021/6
N2 - Image semantic segmentation is popular in computer vision and pattern recognition, since the high-level semantic understanding of images can be effectively realized. Based on whether and to what extent the training data should be labeled, most image semantic segmentation methods can be categorized into fully-supervised learning-based methods, weakly-supervised learning-based methods, and semi-supervised learning-based methods. Among them, semi-supervised image semantic segmentation receives increasing popularity recently, because of its flexibility and convenience in requiring partial training data to be labeled. Although semi-supervised image semantic segmentation is promising, its state-of-the-arts haven't obtained satisfactory performance when handling challenging cases, including poor illumination, small-sized targets, multi-targets with the same semantics, etc. To tackle the above dilemmas, a novel multi-loss-based deep adversarial network is proposed in this paper. Technically, the more robust WGAN-GP model is utilized as the backbone of the novel network, instead of the conventional GAN model. Moreover, multiple losses including the cross entropy loss, the edge detection loss, the adversarial loss, and the semi-supervised loss, are all incorporated during the novel network's training. Experimental analyses based on challenging cases shortlisted from the Pascal VOC 2012 dataset and the Cityscapes dataset suggest that, the novel network is capable to outperform state-of-the-arts in semi-supervised image semantic segmentation.
AB - Image semantic segmentation is popular in computer vision and pattern recognition, since the high-level semantic understanding of images can be effectively realized. Based on whether and to what extent the training data should be labeled, most image semantic segmentation methods can be categorized into fully-supervised learning-based methods, weakly-supervised learning-based methods, and semi-supervised learning-based methods. Among them, semi-supervised image semantic segmentation receives increasing popularity recently, because of its flexibility and convenience in requiring partial training data to be labeled. Although semi-supervised image semantic segmentation is promising, its state-of-the-arts haven't obtained satisfactory performance when handling challenging cases, including poor illumination, small-sized targets, multi-targets with the same semantics, etc. To tackle the above dilemmas, a novel multi-loss-based deep adversarial network is proposed in this paper. Technically, the more robust WGAN-GP model is utilized as the backbone of the novel network, instead of the conventional GAN model. Moreover, multiple losses including the cross entropy loss, the edge detection loss, the adversarial loss, and the semi-supervised loss, are all incorporated during the novel network's training. Experimental analyses based on challenging cases shortlisted from the Pascal VOC 2012 dataset and the Cityscapes dataset suggest that, the novel network is capable to outperform state-of-the-arts in semi-supervised image semantic segmentation.
KW - Deep adversarial network
KW - Image semantic segmentation
KW - Semi-supervised learning
UR - http://www.scopus.com/inward/record.url?scp=85103654920&partnerID=8YFLogxK
U2 - 10.1016/j.patrec.2021.03.017
DO - 10.1016/j.patrec.2021.03.017
M3 - 文章
AN - SCOPUS:85103654920
SN - 0167-8655
VL - 146
SP - 208
EP - 214
JO - Pattern Recognition Letters
JF - Pattern Recognition Letters
ER -