Weakly Supervised Adversarial Domain Adaptation for Semantic Segmentation in Urban Scenes

Qi Wang; Junyu Gao; Xuelong Li

doi:10.1109/TIP.2019.2910667

Weakly Supervised Adversarial Domain Adaptation for Semantic Segmentation in Urban Scenes

Qi Wang, Junyu Gao, Xuelong Li

School of Artificial Intelligence, OPtics and Electronics

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

211 Scopus citations

Abstract

Semantic segmentation, a pixel-level vision task, is rapidly developed by using convolutional neural networks (CNNs). Training CNNs requires a large amount of labeled data, but manually annotating data is difficult. For emancipating manpower, in recent years, some synthetic datasets are released. However, they are still different from real scenes, which causes that training a model on the synthetic data (source domain) cannot achieve a good performance on real urban scenes (target domain). In this paper, we propose a weakly supervised adversarial domain adaptation to improve the segmentation performance from synthetic data to real scenes, which consists of three deep neural networks. A detection and segmentation (DS) model focuses on detecting objects and predicting segmentation map; a pixel-level domain classifier (PDC) tries to distinguish the image features from which domains; and an object-level domain classifier (ODC) discriminates the objects from which domains and predicts object classes. PDC and ODC are treated as the discriminators, and DS is considered as the generator. By the adversarial learning, DS is supposed to learn domain-invariant features. In experiments, our proposed method yields the new record of mIoU metric in the same problem.

Original language	English
Article number	8693661
Pages (from-to)	4376-4386
Number of pages	11
Journal	IEEE Transactions on Image Processing
Volume	28
Issue number	9
DOIs	https://doi.org/10.1109/TIP.2019.2910667
State	Published - Sep 2019

Keywords

adversarial learning
domain adaptation
Semantic segmentation
weakly supervision

Access to Document

10.1109/TIP.2019.2910667

Cite this

@article{2b453ac3c4d94515b1c645aa2ab4b8cf,

title = "Weakly Supervised Adversarial Domain Adaptation for Semantic Segmentation in Urban Scenes",

abstract = "Semantic segmentation, a pixel-level vision task, is rapidly developed by using convolutional neural networks (CNNs). Training CNNs requires a large amount of labeled data, but manually annotating data is difficult. For emancipating manpower, in recent years, some synthetic datasets are released. However, they are still different from real scenes, which causes that training a model on the synthetic data (source domain) cannot achieve a good performance on real urban scenes (target domain). In this paper, we propose a weakly supervised adversarial domain adaptation to improve the segmentation performance from synthetic data to real scenes, which consists of three deep neural networks. A detection and segmentation (DS) model focuses on detecting objects and predicting segmentation map; a pixel-level domain classifier (PDC) tries to distinguish the image features from which domains; and an object-level domain classifier (ODC) discriminates the objects from which domains and predicts object classes. PDC and ODC are treated as the discriminators, and DS is considered as the generator. By the adversarial learning, DS is supposed to learn domain-invariant features. In experiments, our proposed method yields the new record of mIoU metric in the same problem.",

keywords = "adversarial learning, domain adaptation, Semantic segmentation, weakly supervision",

author = "Qi Wang and Junyu Gao and Xuelong Li",

note = "Publisher Copyright: {\textcopyright} 1992-2012 IEEE.",

year = "2019",

month = sep,

doi = "10.1109/TIP.2019.2910667",

language = "英语",

volume = "28",

pages = "4376--4386",

journal = "IEEE Transactions on Image Processing",

issn = "1057-7149",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "9",

}

TY - JOUR

T1 - Weakly Supervised Adversarial Domain Adaptation for Semantic Segmentation in Urban Scenes

AU - Wang, Qi

AU - Gao, Junyu

AU - Li, Xuelong

PY - 2019/9

Y1 - 2019/9

N2 - Semantic segmentation, a pixel-level vision task, is rapidly developed by using convolutional neural networks (CNNs). Training CNNs requires a large amount of labeled data, but manually annotating data is difficult. For emancipating manpower, in recent years, some synthetic datasets are released. However, they are still different from real scenes, which causes that training a model on the synthetic data (source domain) cannot achieve a good performance on real urban scenes (target domain). In this paper, we propose a weakly supervised adversarial domain adaptation to improve the segmentation performance from synthetic data to real scenes, which consists of three deep neural networks. A detection and segmentation (DS) model focuses on detecting objects and predicting segmentation map; a pixel-level domain classifier (PDC) tries to distinguish the image features from which domains; and an object-level domain classifier (ODC) discriminates the objects from which domains and predicts object classes. PDC and ODC are treated as the discriminators, and DS is considered as the generator. By the adversarial learning, DS is supposed to learn domain-invariant features. In experiments, our proposed method yields the new record of mIoU metric in the same problem.

AB - Semantic segmentation, a pixel-level vision task, is rapidly developed by using convolutional neural networks (CNNs). Training CNNs requires a large amount of labeled data, but manually annotating data is difficult. For emancipating manpower, in recent years, some synthetic datasets are released. However, they are still different from real scenes, which causes that training a model on the synthetic data (source domain) cannot achieve a good performance on real urban scenes (target domain). In this paper, we propose a weakly supervised adversarial domain adaptation to improve the segmentation performance from synthetic data to real scenes, which consists of three deep neural networks. A detection and segmentation (DS) model focuses on detecting objects and predicting segmentation map; a pixel-level domain classifier (PDC) tries to distinguish the image features from which domains; and an object-level domain classifier (ODC) discriminates the objects from which domains and predicts object classes. PDC and ODC are treated as the discriminators, and DS is considered as the generator. By the adversarial learning, DS is supposed to learn domain-invariant features. In experiments, our proposed method yields the new record of mIoU metric in the same problem.

KW - adversarial learning

KW - domain adaptation

KW - Semantic segmentation

KW - weakly supervision

UR - http://www.scopus.com/inward/record.url?scp=85068384610&partnerID=8YFLogxK

U2 - 10.1109/TIP.2019.2910667

DO - 10.1109/TIP.2019.2910667

M3 - 文章

C2 - 30998470

AN - SCOPUS:85068384610

SN - 1057-7149

VL - 28

SP - 4376

EP - 4386

JO - IEEE Transactions on Image Processing

JF - IEEE Transactions on Image Processing

IS - 9

M1 - 8693661

ER -

Weakly Supervised Adversarial Domain Adaptation for Semantic Segmentation in Urban Scenes

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this