TY - JOUR
T1 - Retentive Compensation and Personality Filtering for Few-Shot Remote Sensing Object Detection
AU - Wu, Jiashan
AU - Lang, Chunbo
AU - Cheng, Gong
AU - Xie, Xingxing
AU - Han, Junwei
N1 - Publisher Copyright:
© 1991-2012 IEEE.
PY - 2024
Y1 - 2024
N2 - In recent years, few-shot object detection (FSOD) in remote sensing images has attracted increasing attention. Numerous studies address the challenges posed by both intra-class and inter-class variance through strategies such as augmenting sample diversity and incorporating multi-scale features. However, these features still encompass a considerable amount of noise attributes due to the complex characteristic of satellite images, persistently and adversely affecting classification. In contrast, we advocate for the belief that a limited yet refined set of features surpasses a multitude of coarse features. Accordingly, we tackle above issues through the meticulous refinement of representative category features, enhancing performance by eliminating irrelevant attributes that interfere with classification. Specifically, two pivotal modules: retentive compensation module (RCM) and personality filtering module (PFM), are introduced. The former module RCM systematically scrutinizes features proximate to the category center, yielding prototypes that exhibit both intra-class compactness and inter-class distinctiveness. Furthermore, the latter module PFM utilizes previous obtained prototypes to supervise the filtering process, diminishing the intra-class variance by excluding personality features which could impede the classification task. The integration of the above two modules enables a holistic feature representation, capturing inherent similarities within individual classes while accentuating distinctions between classes. Experiments have been conducted on the DIOR and NWPU VHR-10.v2 datasets, and the results demonstrate that our proposed approach exceeds several state-of-the-art methods. Code is available at https://github.com/yomik-js/RP-FSOD.
AB - In recent years, few-shot object detection (FSOD) in remote sensing images has attracted increasing attention. Numerous studies address the challenges posed by both intra-class and inter-class variance through strategies such as augmenting sample diversity and incorporating multi-scale features. However, these features still encompass a considerable amount of noise attributes due to the complex characteristic of satellite images, persistently and adversely affecting classification. In contrast, we advocate for the belief that a limited yet refined set of features surpasses a multitude of coarse features. Accordingly, we tackle above issues through the meticulous refinement of representative category features, enhancing performance by eliminating irrelevant attributes that interfere with classification. Specifically, two pivotal modules: retentive compensation module (RCM) and personality filtering module (PFM), are introduced. The former module RCM systematically scrutinizes features proximate to the category center, yielding prototypes that exhibit both intra-class compactness and inter-class distinctiveness. Furthermore, the latter module PFM utilizes previous obtained prototypes to supervise the filtering process, diminishing the intra-class variance by excluding personality features which could impede the classification task. The integration of the above two modules enables a holistic feature representation, capturing inherent similarities within individual classes while accentuating distinctions between classes. Experiments have been conducted on the DIOR and NWPU VHR-10.v2 datasets, and the results demonstrate that our proposed approach exceeds several state-of-the-art methods. Code is available at https://github.com/yomik-js/RP-FSOD.
KW - Few-shot object detection
KW - fine-tuning
KW - metric learning
KW - remote sensing
UR - http://www.scopus.com/inward/record.url?scp=85186110368&partnerID=8YFLogxK
U2 - 10.1109/TCSVT.2024.3367168
DO - 10.1109/TCSVT.2024.3367168
M3 - 文章
AN - SCOPUS:85186110368
SN - 1051-8215
VL - 34
SP - 5805
EP - 5817
JO - IEEE Transactions on Circuits and Systems for Video Technology
JF - IEEE Transactions on Circuits and Systems for Video Technology
IS - 7
ER -