Scale-Balanced Real-Time Object Detection With Varying Input-Image Resolution

Longbin Yan, Yunxiao Qin, Jie Chen

Research output: Contribution to journalArticlepeer-review

13 Scopus citations

Abstract

Current object-detection methods for small-scale objects are often marred by poor performance. Using relatively high-resolution input images can be considered a remedy for this issue, but it usually leads to performance degeneration for large-scale objects. We define this problem as the imbalance of detection performance for multi-scale objects when the resolution of input images varies. In addition, the use of high-resolution images results in significant computational resource consumption and inference-speed impairment. In this paper, we propose a friendly varying-resolution object-detection method for multi-scale objects. We analyze in detail the reasons leading to the performance degradation in the detection of large-scale objects with increasing input-image resolution, and propose a novel lightweight bidirectional feature-flow module to enhance the performance of multi-scale object detection in high-resolution images, especially for large-scale objects. The proposed approach can also ease the problems of computational resource consumption and inference-speed impairment caused by high-resolution images. Additionally, a decoupled detection head is designed to further improve performance by separating classification and regression sub-tasks, and an adaptive feature-fusion module is designed to better fuse different feature levels. The proposed scheme alleviates the negative effects of using high-resolution input images and achieves an excellent balance between inference speed and precision. Experiments on the MS COCO dataset show that the scheme achieves 44.6 AP at 42.6 FPS and 47 AP at 26.7 FPS, showing significant advantages over the methods to which it is compared.

Original languageEnglish
Pages (from-to)242-256
Number of pages15
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume33
Issue number1
DOIs
StatePublished - 1 Jan 2023

Keywords

  • Deep convolution neural network (CNN)
  • multi-scale features fusion
  • object detection

Fingerprint

Dive into the research topics of 'Scale-Balanced Real-Time Object Detection With Varying Input-Image Resolution'. Together they form a unique fingerprint.

Cite this