TY - JOUR
T1 - Dual-Path Feature Aware Network for Remote Sensing Image Semantic Segmentation
AU - Geng, Jie
AU - Song, Shuai
AU - Jiang, Wen
N1 - Publisher Copyright:
© 2024 Institute of Electrical and Electronics Engineers Inc.. All rights reserved.
PY - 2024/5/1
Y1 - 2024/5/1
N2 - Semantic segmentation is a significant task for remote sensing interpretation, which takes advantage of contextual semantic information to classify each pixel into a specific category. Most current methods apply convolutional neural networks (CNN) to learn feature representation from remote sensing images, which may ignore the global dependencies due to the limitation of convolutional kernels. Inspired by the global feature learning ability of Transformer, we propose a novel deep model called dual-path feature aware network (DPFANet), which combines the structure of CNN and Transformer for semantic segmentation of remote sensing images. DPFANet aims to learn effective modeling ability from local to global features of images. Simultaneously, an adaptive feature fusion network is developed to fuse features from dual-path networks. Moreover, an edge optimization block is applied to constrain the edge features, whose purpose is to obtain more representative features for segmentation. Experimental results on three public remote sensing datasets verify that our proposed network yields better segmentation performance compared to other related methods.
AB - Semantic segmentation is a significant task for remote sensing interpretation, which takes advantage of contextual semantic information to classify each pixel into a specific category. Most current methods apply convolutional neural networks (CNN) to learn feature representation from remote sensing images, which may ignore the global dependencies due to the limitation of convolutional kernels. Inspired by the global feature learning ability of Transformer, we propose a novel deep model called dual-path feature aware network (DPFANet), which combines the structure of CNN and Transformer for semantic segmentation of remote sensing images. DPFANet aims to learn effective modeling ability from local to global features of images. Simultaneously, an adaptive feature fusion network is developed to fuse features from dual-path networks. Moreover, an edge optimization block is applied to constrain the edge features, whose purpose is to obtain more representative features for segmentation. Experimental results on three public remote sensing datasets verify that our proposed network yields better segmentation performance compared to other related methods.
KW - Semantic segmentation
KW - attention mechanism
KW - feature fusion
KW - remote sensing image
KW - transformer
UR - http://www.scopus.com/inward/record.url?scp=85181565487&partnerID=8YFLogxK
U2 - 10.1109/TCSVT.2023.3317937
DO - 10.1109/TCSVT.2023.3317937
M3 - 文章
AN - SCOPUS:85181565487
SN - 1051-8215
VL - 34
SP - 3674
EP - 3686
JO - IEEE Transactions on Circuits and Systems for Video Technology
JF - IEEE Transactions on Circuits and Systems for Video Technology
IS - 5
ER -