TY - JOUR
T1 - Weakly Supervised Adversarial Domain Adaptation for Semantic Segmentation in Urban Scenes
AU - Wang, Qi
AU - Gao, Junyu
AU - Li, Xuelong
N1 - Publisher Copyright:
© 1992-2012 IEEE.
PY - 2019/9
Y1 - 2019/9
N2 - Semantic segmentation, a pixel-level vision task, is rapidly developed by using convolutional neural networks (CNNs). Training CNNs requires a large amount of labeled data, but manually annotating data is difficult. For emancipating manpower, in recent years, some synthetic datasets are released. However, they are still different from real scenes, which causes that training a model on the synthetic data (source domain) cannot achieve a good performance on real urban scenes (target domain). In this paper, we propose a weakly supervised adversarial domain adaptation to improve the segmentation performance from synthetic data to real scenes, which consists of three deep neural networks. A detection and segmentation (DS) model focuses on detecting objects and predicting segmentation map; a pixel-level domain classifier (PDC) tries to distinguish the image features from which domains; and an object-level domain classifier (ODC) discriminates the objects from which domains and predicts object classes. PDC and ODC are treated as the discriminators, and DS is considered as the generator. By the adversarial learning, DS is supposed to learn domain-invariant features. In experiments, our proposed method yields the new record of mIoU metric in the same problem.
AB - Semantic segmentation, a pixel-level vision task, is rapidly developed by using convolutional neural networks (CNNs). Training CNNs requires a large amount of labeled data, but manually annotating data is difficult. For emancipating manpower, in recent years, some synthetic datasets are released. However, they are still different from real scenes, which causes that training a model on the synthetic data (source domain) cannot achieve a good performance on real urban scenes (target domain). In this paper, we propose a weakly supervised adversarial domain adaptation to improve the segmentation performance from synthetic data to real scenes, which consists of three deep neural networks. A detection and segmentation (DS) model focuses on detecting objects and predicting segmentation map; a pixel-level domain classifier (PDC) tries to distinguish the image features from which domains; and an object-level domain classifier (ODC) discriminates the objects from which domains and predicts object classes. PDC and ODC are treated as the discriminators, and DS is considered as the generator. By the adversarial learning, DS is supposed to learn domain-invariant features. In experiments, our proposed method yields the new record of mIoU metric in the same problem.
KW - adversarial learning
KW - domain adaptation
KW - Semantic segmentation
KW - weakly supervision
UR - http://www.scopus.com/inward/record.url?scp=85068384610&partnerID=8YFLogxK
U2 - 10.1109/TIP.2019.2910667
DO - 10.1109/TIP.2019.2910667
M3 - 文章
C2 - 30998470
AN - SCOPUS:85068384610
SN - 1057-7149
VL - 28
SP - 4376
EP - 4386
JO - IEEE Transactions on Image Processing
JF - IEEE Transactions on Image Processing
IS - 9
M1 - 8693661
ER -