TY - JOUR
T1 - FDENet
T2 - Frequency-Guided Dual-Encoder Network for Building Footprint Extraction From Remote Sensing Images
AU - Yuan, Shenao
AU - Wang, Zhen
AU - Li, Jiayuan
AU - Xu, Nan
AU - You, Zhuhong
AU - Huang, De Shuang
N1 - Publisher Copyright:
© 2008-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - Building footprint extraction is a fundamental task in remote sensing image analysis, with wide-ranging applications in smart city development, urban planning, and disaster response. Despite recent advances, existing methods still struggle to simultaneously capture fine-grained boundary details, corresponding to high-frequency information such as sharp edges and textures, and maintain the integrity of overall building shapes, which depends on low-frequency global structural information. These limitations often lead to blurred or inaccurate boundaries and incomplete building representations, especially in complex urban environments with densely packed or irregularly shaped buildings. This article presents a novel Frequency-Guided Dual-Encoder Network (FDENet) for building footprint extraction, which combines a graph-aware wavelet decomposition with a frequency-controlled visual state-space model within a dual-encoder attention framework. Specifically, the proposed framework incorporates a Graph-Constrained Wavelet Transformer Block to establish a dual-encoder structure that operates across both spatial and frequency domains. The high-frequency branch utilizes residual convolution blocks to enhance texture and boundary feature representation, while the low-frequency branch integrates a Frequency-Selective Visual State-Space Model to capture global structural information. In addition, a Mixed-Frequency Attention Mechanism is introduced to facilitate the deep fusion of high- and low-frequency features, improving their interaction and complementarity. To address the persistent issue of blurred boundaries, a cross-stage adaptive fusion loss function is designed, incorporating an Edge-Aware Loss to further enhance boundary precision. Comprehensive experiments on multiple public building extraction datasets demonstrate that FDENet achieves state-of-the-art performance, significantly improving both boundary sharpness and structural consistency.
AB - Building footprint extraction is a fundamental task in remote sensing image analysis, with wide-ranging applications in smart city development, urban planning, and disaster response. Despite recent advances, existing methods still struggle to simultaneously capture fine-grained boundary details, corresponding to high-frequency information such as sharp edges and textures, and maintain the integrity of overall building shapes, which depends on low-frequency global structural information. These limitations often lead to blurred or inaccurate boundaries and incomplete building representations, especially in complex urban environments with densely packed or irregularly shaped buildings. This article presents a novel Frequency-Guided Dual-Encoder Network (FDENet) for building footprint extraction, which combines a graph-aware wavelet decomposition with a frequency-controlled visual state-space model within a dual-encoder attention framework. Specifically, the proposed framework incorporates a Graph-Constrained Wavelet Transformer Block to establish a dual-encoder structure that operates across both spatial and frequency domains. The high-frequency branch utilizes residual convolution blocks to enhance texture and boundary feature representation, while the low-frequency branch integrates a Frequency-Selective Visual State-Space Model to capture global structural information. In addition, a Mixed-Frequency Attention Mechanism is introduced to facilitate the deep fusion of high- and low-frequency features, improving their interaction and complementarity. To address the persistent issue of blurred boundaries, a cross-stage adaptive fusion loss function is designed, incorporating an Edge-Aware Loss to further enhance boundary precision. Comprehensive experiments on multiple public building extraction datasets demonstrate that FDENet achieves state-of-the-art performance, significantly improving both boundary sharpness and structural consistency.
KW - Building footprint extraction
KW - edge-aware loss
KW - frequency-guided dual-encoder framework
KW - remote sensing images (RSIs)
KW - wavelet decomposition
UR - https://www.scopus.com/pages/publications/105013644710
U2 - 10.1109/JSTARS.2025.3601023
DO - 10.1109/JSTARS.2025.3601023
M3 - 文章
AN - SCOPUS:105013644710
SN - 1939-1404
VL - 18
SP - 22403
EP - 22420
JO - IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
JF - IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
ER -