Graph-Based Spatio-Temporal Semantic Reasoning Model for Anti-Occlusion Infrared Aerial Target Recognition

Xi Yang; Shaoyi Li; Saisai Niu; Binbin Yan; Zhongjie Meng

doi:10.1109/TMM.2024.3408051

Graph-Based Spatio-Temporal Semantic Reasoning Model for Anti-Occlusion Infrared Aerial Target Recognition

Xi Yang, Shaoyi Li, Saisai Niu, Binbin Yan, Zhongjie Meng

School of Astronautics

Research output: Contribution to journal › Article › peer-review

1 Scopus citations

Abstract

Infrared target recognition and anti-interference incomplex battlefields is one of the key technologies enablingthe precise strike capability of aircraft. Currently, infrared-guided aircraft face complex interference such as naturalbackgrounds and artificial decoys, leading to a decrease in theperformance of infrared target recognition. A particular challengeto infrared target recognition and anti-interference capabilitiesis the strong interference situation caused by the combination oftarget maneuvering and the dense, continuous, and coordinateddeployment of infrared decoys. To address extreme issues such ascomplete loss of target feature information and inability to identifydue to target occlusion, we develop an anti-interference recognitionmethod based on a visually inspired Spatio-Temporal SemanticReasoning Model (STSRM). Firstly, inspired by the functionalcharacteristics of visual semantic reasoning, the STSRM isproposed to simplify the reasoning of relationships among multipleregions into modeling relationships between corresponding regionnode features in a graph-based module. Secondly, an anti-occlusiontarget recognition model based on STSRM is constructed, whichintroduces a reasoning graph module connecting node regions toinfer semantic information and predict targets between regions.The test results on the infrared dataset established in this paperindicate that the proposed anti-interference recognition model canmake accurate target predictions in large-scale or full-occlusionconditions, and we achieve 13.9% and 3.1% improvement on mAPscores and mIoU scores, compared to current advanced method onour simulated infrared dataset.

Original language	English
Pages (from-to)	10530-10544
Number of pages	15
Journal	IEEE Transactions on Multimedia
Volume	26
DOIs	https://doi.org/10.1109/TMM.2024.3408051
State	Published - 2024

Keywords

Anti-occlusion recognition
graph convolution network
infrared aerial targets
semantic reasoning model
vision-inspired

Access to Document

10.1109/TMM.2024.3408051

Cite this

@article{47aeb502c2ac435ba63c20243d488dbb,

title = "Graph-Based Spatio-Temporal Semantic Reasoning Model for Anti-Occlusion Infrared Aerial Target Recognition",

abstract = "Infrared target recognition and anti-interference incomplex battlefields is one of the key technologies enablingthe precise strike capability of aircraft. Currently, infrared-guided aircraft face complex interference such as naturalbackgrounds and artificial decoys, leading to a decrease in theperformance of infrared target recognition. A particular challengeto infrared target recognition and anti-interference capabilitiesis the strong interference situation caused by the combination oftarget maneuvering and the dense, continuous, and coordinateddeployment of infrared decoys. To address extreme issues such ascomplete loss of target feature information and inability to identifydue to target occlusion, we develop an anti-interference recognitionmethod based on a visually inspired Spatio-Temporal SemanticReasoning Model (STSRM). Firstly, inspired by the functionalcharacteristics of visual semantic reasoning, the STSRM isproposed to simplify the reasoning of relationships among multipleregions into modeling relationships between corresponding regionnode features in a graph-based module. Secondly, an anti-occlusiontarget recognition model based on STSRM is constructed, whichintroduces a reasoning graph module connecting node regions toinfer semantic information and predict targets between regions.The test results on the infrared dataset established in this paperindicate that the proposed anti-interference recognition model canmake accurate target predictions in large-scale or full-occlusionconditions, and we achieve 13.9% and 3.1% improvement on mAPscores and mIoU scores, compared to current advanced method onour simulated infrared dataset.",

keywords = "Anti-occlusion recognition, graph convolution network, infrared aerial targets, semantic reasoning model, vision-inspired",

author = "Xi Yang and Shaoyi Li and Saisai Niu and Binbin Yan and Zhongjie Meng",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.",

year = "2024",

doi = "10.1109/TMM.2024.3408051",

language = "英语",

volume = "26",

pages = "10530--10544",

journal = "IEEE Transactions on Multimedia",

issn = "1520-9210",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Graph-Based Spatio-Temporal Semantic Reasoning Model for Anti-Occlusion Infrared Aerial Target Recognition

AU - Yang, Xi

AU - Li, Shaoyi

AU - Niu, Saisai

AU - Yan, Binbin

AU - Meng, Zhongjie

PY - 2024

Y1 - 2024

N2 - Infrared target recognition and anti-interference incomplex battlefields is one of the key technologies enablingthe precise strike capability of aircraft. Currently, infrared-guided aircraft face complex interference such as naturalbackgrounds and artificial decoys, leading to a decrease in theperformance of infrared target recognition. A particular challengeto infrared target recognition and anti-interference capabilitiesis the strong interference situation caused by the combination oftarget maneuvering and the dense, continuous, and coordinateddeployment of infrared decoys. To address extreme issues such ascomplete loss of target feature information and inability to identifydue to target occlusion, we develop an anti-interference recognitionmethod based on a visually inspired Spatio-Temporal SemanticReasoning Model (STSRM). Firstly, inspired by the functionalcharacteristics of visual semantic reasoning, the STSRM isproposed to simplify the reasoning of relationships among multipleregions into modeling relationships between corresponding regionnode features in a graph-based module. Secondly, an anti-occlusiontarget recognition model based on STSRM is constructed, whichintroduces a reasoning graph module connecting node regions toinfer semantic information and predict targets between regions.The test results on the infrared dataset established in this paperindicate that the proposed anti-interference recognition model canmake accurate target predictions in large-scale or full-occlusionconditions, and we achieve 13.9% and 3.1% improvement on mAPscores and mIoU scores, compared to current advanced method onour simulated infrared dataset.

AB - Infrared target recognition and anti-interference incomplex battlefields is one of the key technologies enablingthe precise strike capability of aircraft. Currently, infrared-guided aircraft face complex interference such as naturalbackgrounds and artificial decoys, leading to a decrease in theperformance of infrared target recognition. A particular challengeto infrared target recognition and anti-interference capabilitiesis the strong interference situation caused by the combination oftarget maneuvering and the dense, continuous, and coordinateddeployment of infrared decoys. To address extreme issues such ascomplete loss of target feature information and inability to identifydue to target occlusion, we develop an anti-interference recognitionmethod based on a visually inspired Spatio-Temporal SemanticReasoning Model (STSRM). Firstly, inspired by the functionalcharacteristics of visual semantic reasoning, the STSRM isproposed to simplify the reasoning of relationships among multipleregions into modeling relationships between corresponding regionnode features in a graph-based module. Secondly, an anti-occlusiontarget recognition model based on STSRM is constructed, whichintroduces a reasoning graph module connecting node regions toinfer semantic information and predict targets between regions.The test results on the infrared dataset established in this paperindicate that the proposed anti-interference recognition model canmake accurate target predictions in large-scale or full-occlusionconditions, and we achieve 13.9% and 3.1% improvement on mAPscores and mIoU scores, compared to current advanced method onour simulated infrared dataset.

KW - Anti-occlusion recognition

KW - graph convolution network

KW - infrared aerial targets

KW - semantic reasoning model

KW - vision-inspired

UR - http://www.scopus.com/inward/record.url?scp=85194830664&partnerID=8YFLogxK

U2 - 10.1109/TMM.2024.3408051

DO - 10.1109/TMM.2024.3408051

M3 - 文章

AN - SCOPUS:85194830664

SN - 1520-9210

VL - 26

SP - 10530

EP - 10544

JO - IEEE Transactions on Multimedia

JF - IEEE Transactions on Multimedia

ER -

Graph-Based Spatio-Temporal Semantic Reasoning Model for Anti-Occlusion Infrared Aerial Target Recognition

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this