Remote Sensing Imagery Object Detection Model Compression via Tucker Decomposition

  • Lang Huyan
  • , Ying Li
  • , Dongmei Jiang
  • , Yanning Zhang
  • , Quan Zhou
  • , Bo Li
  • , Jiayuan Wei
  • , Juanni Liu
  • , Yi Zhang
  • , Peng Wang
  • , Hai Fang

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

Although convolutional neural networks (CNNs) have made significant progress, their deployment onboard is still challenging because of their complexity and high processing cost. Tensors provide a natural and compact representation of CNN weights via suitable low-rank approximations. A novel decomposed module called DecomResnet based on Tucker decomposition was proposed to deploy a CNN object detection model on a satellite. We proposed a remote sensing image object detection model compression framework based on low-rank decomposition which consisted of four steps, namely (1) model initialization, (2) initial training, (3) decomposition of the trained model and reconstruction of the decomposed model, and (4) fine-tuning. To validate the performance of the decomposed model in our real mission, we constructed a dataset containing only two classes of objects based on the DOTA and HRSC2016. The proposed method was comprehensively evaluated on the NWPU VHR-10 dataset and the CAST-RS2 dataset created in this work. The experimental results demonstrated that the proposed method, which was based on Resnet-50, could achieve up to 4.44 times the compression ratio and 5.71 times the speedup ratio with merely a 1.9% decrease in the mAP (mean average precision) of the CAST-RS2 dataset and a 5.3% decrease the mAP of the NWPU VHR-10 dataset.

Original languageEnglish
Article number856
JournalMathematics
Volume11
Issue number4
DOIs
StatePublished - Feb 2023

Keywords

  • Tucker decomposition
  • model compression
  • onboard object detection
  • rank selection
  • remote sensing imagery
  • tensor decomposition

Fingerprint

Dive into the research topics of 'Remote Sensing Imagery Object Detection Model Compression via Tucker Decomposition'. Together they form a unique fingerprint.

Cite this