Ssd7-ffam: A real-time object detection network friendly to embedded devices from scratch

Qing Li, Yingcheng Lin, Wei He

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

The high requirements for computing and memory are the biggest challenges in deploying existing object detection networks to embedded devices. Living lightweight object detectors directly use lightweight neural network architectures such as MobileNet or ShuffleNet pre-trained on large-scale classification datasets, which results in poor network structure flexibility and is not suitable for some specific scenarios. In this paper, we propose a lightweight object detection network Single-Shot MultiBox Detector (SSD)7-Feature Fusion and Attention Mechanism (FFAM), which saves storage space and reduces the amount of calculation by reducing the number of convolutional layers. We offer a novel Feature Fusion and Attention Mechanism (FFAM) method to improve detection accuracy. Firstly, the FFAM method fuses high-level semantic information-rich feature maps with low-level feature maps to improve small objects’ detection accuracy. The lightweight attention mechanism cascaded by channels and spatial attention modules is employed to enhance the target’s contextual information and guide the network to focus on its easy-to-recognize features. The SSD7-FFAM achieves 83.7% mean Average Precision (mAP), 1.66 MB parameters, and 0.033 s average running time on the NWPU VHR-10 dataset. The results indicate that the proposed SSD7-FFAM is more suitable for deployment to embedded devices for real-time object detection.

Original languageEnglish
Article number1096
Pages (from-to)1-17
Number of pages17
JournalApplied Sciences (Switzerland)
Volume11
Issue number3
DOIs
StatePublished - 1 Feb 2021
Externally publishedYes

Keywords

  • Attention mechanism
  • Embedded devices
  • Feature fusion
  • Lightweight network
  • Object detection

Fingerprint

Dive into the research topics of 'Ssd7-ffam: A real-time object detection network friendly to embedded devices from scratch'. Together they form a unique fingerprint.

Cite this