TY - GEN
T1 - Non-Local Proposal Dynamic Enhancement Learning for Few-Shot Object Detection in Remote Sensing Images
AU - Wang, Haoyu
AU - Zhang, Lei
AU - Wei, Wei
AU - Ding, Chen
AU - Zhang, Yanning
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Deep neural networks have underpinned much of recent progress in few-shot object detection (FSOD) in remote sensing images. The key lies in accurately inferring the object categories and bounding boxes depending on the feature of each proposal region. However, due to lack of sufficient labeled samples for training model well-fitting, the feature of each proposal fails to be discriminative and informative enough for accurate inference, thus limiting the generalization capacity. To mitigate this problem, we propose a non-local proposal dynamic enhancement learning (NPDEL) methods for FSOD in remote sensing images. In contrast to directly utilizing the proposal features extracted from the backbone, we propose to enhance them before inference using a non-local dynamic enhancement module which first carries out a non-local graph convolution on all proposal features and then dynamically fuses the convolved results with the original features for enhancement. By doing this, the enhanced proposal features can adaptively aggregate the related semantic information from the whole image, thus improving their discriminability as well as the generalization capacity in FSOD. Experiments results on different FSOD tasks demonstrate the efficacy of the proposed method.
AB - Deep neural networks have underpinned much of recent progress in few-shot object detection (FSOD) in remote sensing images. The key lies in accurately inferring the object categories and bounding boxes depending on the feature of each proposal region. However, due to lack of sufficient labeled samples for training model well-fitting, the feature of each proposal fails to be discriminative and informative enough for accurate inference, thus limiting the generalization capacity. To mitigate this problem, we propose a non-local proposal dynamic enhancement learning (NPDEL) methods for FSOD in remote sensing images. In contrast to directly utilizing the proposal features extracted from the backbone, we propose to enhance them before inference using a non-local dynamic enhancement module which first carries out a non-local graph convolution on all proposal features and then dynamically fuses the convolved results with the original features for enhancement. By doing this, the enhanced proposal features can adaptively aggregate the related semantic information from the whole image, thus improving their discriminability as well as the generalization capacity in FSOD. Experiments results on different FSOD tasks demonstrate the efficacy of the proposed method.
KW - Deep learning
KW - Few shot Object detection
KW - Non-local graph convolution
KW - Remote sensing images
UR - http://www.scopus.com/inward/record.url?scp=85140378110&partnerID=8YFLogxK
U2 - 10.1109/IGARSS46834.2022.9883058
DO - 10.1109/IGARSS46834.2022.9883058
M3 - 会议稿件
AN - SCOPUS:85140378110
T3 - International Geoscience and Remote Sensing Symposium (IGARSS)
SP - 1888
EP - 1891
BT - IGARSS 2022 - 2022 IEEE International Geoscience and Remote Sensing Symposium
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2022
Y2 - 17 July 2022 through 22 July 2022
ER -