TY - GEN
T1 - Semantic Segmentation of Remote Sensing Images with Multi-scale Features and Attention Mechanism
AU - Xiaoyu, Huang
AU - He, Renjie
AU - Dai, Yuchao
AU - He, Mingyi
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - The performance of existing semantic segmentation algorithms for optical remote sensing images is often affected by the issues of real-world datasets, such as the complexity of ground object categories, the dimensional changes between different classes, and the unbalanced proportion of foreground and background. To address this problem, DRCNet, a semantic segmentation model based on attention mechanism and multi-scale fusion is proposed in this paper. The DRCNet is built upon the DeepLab v3+, which introduces a dual attention module to enhance useful features, thereby alleviating the imbalance of foreground-background labels in the datasets. Meanwhile, a residual dense atrous spatial pyramid pooling model based on residual connection is designed to address the checkerboard effect caused by continuous stacking of dilated convolutions. Furthermore, the accuracy of segmentation for different-sized targets is also improved through the cross-layer interaction of multi-scale feature maps. Extensive experimental results demonstrate that the proposed DRCNet achieves an MIoU of 70.176% and a MPA of 88.203% on the Postdam dataset, which represents improvements of 1.676% and 0.339% compared to the DeepLab v3+ model, respectively.
AB - The performance of existing semantic segmentation algorithms for optical remote sensing images is often affected by the issues of real-world datasets, such as the complexity of ground object categories, the dimensional changes between different classes, and the unbalanced proportion of foreground and background. To address this problem, DRCNet, a semantic segmentation model based on attention mechanism and multi-scale fusion is proposed in this paper. The DRCNet is built upon the DeepLab v3+, which introduces a dual attention module to enhance useful features, thereby alleviating the imbalance of foreground-background labels in the datasets. Meanwhile, a residual dense atrous spatial pyramid pooling model based on residual connection is designed to address the checkerboard effect caused by continuous stacking of dilated convolutions. Furthermore, the accuracy of segmentation for different-sized targets is also improved through the cross-layer interaction of multi-scale feature maps. Extensive experimental results demonstrate that the proposed DRCNet achieves an MIoU of 70.176% and a MPA of 88.203% on the Postdam dataset, which represents improvements of 1.676% and 0.339% compared to the DeepLab v3+ model, respectively.
KW - Attention mechanism
KW - Multi-scale fusion
KW - Semantic segmentation
UR - http://www.scopus.com/inward/record.url?scp=85173630403&partnerID=8YFLogxK
U2 - 10.1109/ICIEA58696.2023.10241417
DO - 10.1109/ICIEA58696.2023.10241417
M3 - 会议稿件
AN - SCOPUS:85173630403
T3 - Proceedings of the 18th IEEE Conference on Industrial Electronics and Applications, ICIEA 2023
SP - 1448
EP - 1453
BT - Proceedings of the 18th IEEE Conference on Industrial Electronics and Applications, ICIEA 2023
A2 - Cai, Wenjian
A2 - Yang, Guilin
A2 - Qiu, Jun
A2 - Gao, Tingting
A2 - Jiang, Lijun
A2 - Zheng, Tianjiang
A2 - Wang, Xinli
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 18th IEEE Conference on Industrial Electronics and Applications, ICIEA 2023
Y2 - 18 August 2023 through 22 August 2023
ER -