TY - GEN
T1 - Vehicle Detection Based on YOLOv7 for Drone Aerial Visible and Infrared Images
AU - Zhou, Tao
AU - Xin, Biqiao
AU - Zheng, Jiangbin
AU - Zhang, Guanghui
AU - Wang, Bingshu
N1 - Publisher Copyright:
© 2024 ACM.
PY - 2024/1/12
Y1 - 2024/1/12
N2 - Object detection using drone-captured aerial images holds great significance for real-time traffic detection and control by traffic management authorities. This paper introduces a novel algorithm based on the widely-used YOLOv7 model. The algorithm presents an enhanced evaluation metric, the Normalized Gaussian Wasserstein Distance (NWD), utilizing normalized Wasserstein distance for small object detection. NWD measures object similarity by considering associated Gaussian distributions. To address challenges from complex backgrounds and redundant features due to irrelevant noise in aerial images, the paper introduces the CBAM attention mechanism. This mechanism improves feature expression for vehicle detection, enabling selective focus on target regions. The proposed algorithm is evaluated on the Drone Vehicle dataset and compared with state-of-the-art algorithms. Experimental results demonstrate favorable average precision values in both visible light and infrared images. The improved network model leads to a 1.2% increase in mean average precision (mAP) for visible light image detection and a 0.9% increase for infrared image detection.
AB - Object detection using drone-captured aerial images holds great significance for real-time traffic detection and control by traffic management authorities. This paper introduces a novel algorithm based on the widely-used YOLOv7 model. The algorithm presents an enhanced evaluation metric, the Normalized Gaussian Wasserstein Distance (NWD), utilizing normalized Wasserstein distance for small object detection. NWD measures object similarity by considering associated Gaussian distributions. To address challenges from complex backgrounds and redundant features due to irrelevant noise in aerial images, the paper introduces the CBAM attention mechanism. This mechanism improves feature expression for vehicle detection, enabling selective focus on target regions. The proposed algorithm is evaluated on the Drone Vehicle dataset and compared with state-of-the-art algorithms. Experimental results demonstrate favorable average precision values in both visible light and infrared images. The improved network model leads to a 1.2% increase in mean average precision (mAP) for visible light image detection and a 0.9% increase for infrared image detection.
KW - Loss function
KW - Remote sensing images
KW - Small object detection
KW - Visible light and infrared images
KW - YOLOv7
UR - http://www.scopus.com/inward/record.url?scp=85192812378&partnerID=8YFLogxK
U2 - 10.1145/3645259.3645265
DO - 10.1145/3645259.3645265
M3 - 会议稿件
AN - SCOPUS:85192812378
T3 - ACM International Conference Proceeding Series
SP - 30
EP - 35
BT - IPMV 2024 - Proceedings of 2024 6th International Conference on Image Processing and Machine Vision
PB - Association for Computing Machinery
T2 - 6th International Conference on Image Processing and Machine Vision, IPMV 2024
Y2 - 12 January 2024 through 14 January 2024
ER -