在轨高效目标检测加速技术

Lang Huyan; Ying Li; Dongmei Jiang; Yanning Zhang; Quan Zhou; Jiayuan Wei; Juanni Liu

doi:10.3873/j.issn.1000-1328.2022.11.011

在轨高效目标检测加速技术

Translated title of the contribution: Efficient Acceleration Technology for On board Object Detection

Lang Huyan, Ying Li, Dongmei Jiang, Yanning Zhang, Quan Zhou, Jiayuan Wei, Juanni Liu

School of Computer Science

Research output: Contribution to journal › Article › peer-review

Abstract

To solve the problem that deep convolutional neural network object detection algorithms are difficult to deploy on board due to their large number of parameters, large computation, limitations of onboard computing resources, storage resources, and power consumption, an efficient on board object detection algorithm acceleration framework and implementation method are proposed. First of all, a computing engine that can be compatible with three convolutional operators is designed, which effectively improves resource utilization. Secondly, the object detection algorithm model is expanded from the two dimensions of channel and convolution kernel, which realizes the high parallelization and scalability of the accelerator. Finally, the accelerator was implemented on multiple FPGA platforms and its performance was evaluated. Experimental results show that the proposed FPGA based accelerator can achieve up to 1843.2 GFLOPs throughput, and the inference time is 0.22 ms. Compared with accelerators proposed in related literature, the accelerator proposed in this paper has great advantages in terms of performance, power consumption, energy efficiency ratio, and inference time. It is suitable for deployment in resource constrained environments and has good application prospects and values on satellites.

Translated title of the contribution	Efficient Acceleration Technology for On board Object Detection
Original language	Chinese (Traditional)
Pages (from-to)	1544-1556
Number of pages	13
Journal	Yuhang Xuebao/Journal of Astronautics
Volume	43
Issue number	11
DOIs	https://doi.org/10.3873/j.issn.1000-1328.2022.11.011
State	Published - Nov 2022

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.3873/j.issn.1000-1328.2022.11.011

Cite this

@article{7a8984fb1949438a8fc3bd9f8668df4f,

title = "在轨高效目标检测加速技术",

abstract = "To solve the problem that deep convolutional neural network object detection algorithms are difficult to deploy on board due to their large number of parameters, large computation, limitations of onboard computing resources, storage resources, and power consumption, an efficient on board object detection algorithm acceleration framework and implementation method are proposed. First of all, a computing engine that can be compatible with three convolutional operators is designed, which effectively improves resource utilization. Secondly, the object detection algorithm model is expanded from the two dimensions of channel and convolution kernel, which realizes the high parallelization and scalability of the accelerator. Finally, the accelerator was implemented on multiple FPGA platforms and its performance was evaluated. Experimental results show that the proposed FPGA based accelerator can achieve up to 1843.2 GFLOPs throughput, and the inference time is 0.22 ms. Compared with accelerators proposed in related literature, the accelerator proposed in this paper has great advantages in terms of performance, power consumption, energy efficiency ratio, and inference time. It is suitable for deployment in resource constrained environments and has good application prospects and values on satellites.",

keywords = "Computational intensity, Convolutional neural networks, Model acceleration, Model quantization, Object detection",

author = "Lang Huyan and Ying Li and Dongmei Jiang and Yanning Zhang and Quan Zhou and Jiayuan Wei and Juanni Liu",

year = "2022",

month = nov,

doi = "10.3873/j.issn.1000-1328.2022.11.011",

language = "繁体中文",

volume = "43",

pages = "1544--1556",

journal = "Yuhang Xuebao/Journal of Astronautics",

issn = "1000-1328",

publisher = "Chinese Society of Astronautics",

number = "11",

}

TY - JOUR

T1 - 在轨高效目标检测加速技术

AU - Huyan, Lang

AU - Li, Ying

AU - Jiang, Dongmei

AU - Zhang, Yanning

AU - Zhou, Quan

AU - Wei, Jiayuan

AU - Liu, Juanni

PY - 2022/11

Y1 - 2022/11

N2 - To solve the problem that deep convolutional neural network object detection algorithms are difficult to deploy on board due to their large number of parameters, large computation, limitations of onboard computing resources, storage resources, and power consumption, an efficient on board object detection algorithm acceleration framework and implementation method are proposed. First of all, a computing engine that can be compatible with three convolutional operators is designed, which effectively improves resource utilization. Secondly, the object detection algorithm model is expanded from the two dimensions of channel and convolution kernel, which realizes the high parallelization and scalability of the accelerator. Finally, the accelerator was implemented on multiple FPGA platforms and its performance was evaluated. Experimental results show that the proposed FPGA based accelerator can achieve up to 1843.2 GFLOPs throughput, and the inference time is 0.22 ms. Compared with accelerators proposed in related literature, the accelerator proposed in this paper has great advantages in terms of performance, power consumption, energy efficiency ratio, and inference time. It is suitable for deployment in resource constrained environments and has good application prospects and values on satellites.

AB - To solve the problem that deep convolutional neural network object detection algorithms are difficult to deploy on board due to their large number of parameters, large computation, limitations of onboard computing resources, storage resources, and power consumption, an efficient on board object detection algorithm acceleration framework and implementation method are proposed. First of all, a computing engine that can be compatible with three convolutional operators is designed, which effectively improves resource utilization. Secondly, the object detection algorithm model is expanded from the two dimensions of channel and convolution kernel, which realizes the high parallelization and scalability of the accelerator. Finally, the accelerator was implemented on multiple FPGA platforms and its performance was evaluated. Experimental results show that the proposed FPGA based accelerator can achieve up to 1843.2 GFLOPs throughput, and the inference time is 0.22 ms. Compared with accelerators proposed in related literature, the accelerator proposed in this paper has great advantages in terms of performance, power consumption, energy efficiency ratio, and inference time. It is suitable for deployment in resource constrained environments and has good application prospects and values on satellites.

KW - Computational intensity

KW - Convolutional neural networks

KW - Model acceleration

KW - Model quantization

KW - Object detection

UR - http://www.scopus.com/inward/record.url?scp=85146762487&partnerID=8YFLogxK

U2 - 10.3873/j.issn.1000-1328.2022.11.011

DO - 10.3873/j.issn.1000-1328.2022.11.011

M3 - 文章

AN - SCOPUS:85146762487

SN - 1000-1328

VL - 43

SP - 1544

EP - 1556

JO - Yuhang Xuebao/Journal of Astronautics

JF - Yuhang Xuebao/Journal of Astronautics

IS - 11

ER -

在轨高效目标检测加速技术

Abstract

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this