PEFNet: Position Enhancement Faster Network for Object Detection in Roadside Perception System

Lei Huang; Wenzhun Huang; Hai Gong; Changqing Yu; Zhuhong You

doi:10.1109/ACCESS.2023.3292881

PEFNet: Position Enhancement Faster Network for Object Detection in Roadside Perception System

Lei Huang, Wenzhun Huang, Hai Gong, Changqing Yu, Zhuhong You

计算机学院

Xijing University

科研成果: 期刊稿件 › 文章 › 同行评审

8 引用（Scopus）

摘要

Roadside perception is a challenging research area that presents even greater difficulties than vehicle perception. Due to the different locations and angles of cameras, roadside objects exhibit violent multi-scale variations, while the vast sensing field introduces more small-scale targets and complex backgrounds, making target recognition more challenging. To address these problems, we focus on position information encoding to achieve accurate roadside object detection by proposing the position enhancement faster network (PEFNet). Based on YOLOv6, the FasterNet Block is introduced into Backbone and Neck networks to provide efficient feature extraction while achieving model lightweight transformation. To improve small target detection performance, a position-Aware feature pyramid network (PA-PAN) is proposed to enhance position information encoding, and the SPD-Conv is applied in the PA-PAN to further enhance effective feature extraction. Finally, the TSCODE is integrated into the detection head to achieve accurate target recognition and suppress background noise interference. Experiments on the Rope3D and UA-DETRAC datasets show that our model outperforms advanced YOLOv6, YOLOX, and FCOS in roadside object detection. Compared with YOLOv6, our method improves the mAP0.50 on the Rope3D dataset from 78.18% to 82.39%, with the AP of small objects such as pedestrians increasing by 7.01%. Furthermore, PEFNet reduces the weight of the network by 43.1% while maintaining detection speed at 75fps and achieving higher accuracy than previous algorithms for the same number of frames.

源语言	英语
页（从-至）	73007-73023
页数	17
期刊	IEEE Access
卷	11
DOI	https://doi.org/10.1109/ACCESS.2023.3292881
出版状态	已出版 - 2023

访问文件

10.1109/ACCESS.2023.3292881

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{f4dadc8e21a0480dbfca4768298fe655,

title = "PEFNet: Position Enhancement Faster Network for Object Detection in Roadside Perception System",

abstract = "Roadside perception is a challenging research area that presents even greater difficulties than vehicle perception. Due to the different locations and angles of cameras, roadside objects exhibit violent multi-scale variations, while the vast sensing field introduces more small-scale targets and complex backgrounds, making target recognition more challenging. To address these problems, we focus on position information encoding to achieve accurate roadside object detection by proposing the position enhancement faster network (PEFNet). Based on YOLOv6, the FasterNet Block is introduced into Backbone and Neck networks to provide efficient feature extraction while achieving model lightweight transformation. To improve small target detection performance, a position-Aware feature pyramid network (PA-PAN) is proposed to enhance position information encoding, and the SPD-Conv is applied in the PA-PAN to further enhance effective feature extraction. Finally, the TSCODE is integrated into the detection head to achieve accurate target recognition and suppress background noise interference. Experiments on the Rope3D and UA-DETRAC datasets show that our model outperforms advanced YOLOv6, YOLOX, and FCOS in roadside object detection. Compared with YOLOv6, our method improves the mAP0.50 on the Rope3D dataset from 78.18% to 82.39%, with the AP of small objects such as pedestrians increasing by 7.01%. Furthermore, PEFNet reduces the weight of the network by 43.1% while maintaining detection speed at 75fps and achieving higher accuracy than previous algorithms for the same number of frames.",

keywords = "decoupled head, feature aggregation, Feature extraction, object detection, position enhancement, roadside images",

author = "Lei Huang and Wenzhun Huang and Hai Gong and Changqing Yu and Zhuhong You",

note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",

year = "2023",

doi = "10.1109/ACCESS.2023.3292881",

language = "英语",

volume = "11",

pages = "73007--73023",

journal = "IEEE Access",

issn = "2169-3536",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - PEFNet

T2 - Position Enhancement Faster Network for Object Detection in Roadside Perception System

AU - Huang, Lei

AU - Huang, Wenzhun

AU - Gong, Hai

AU - Yu, Changqing

AU - You, Zhuhong

PY - 2023

Y1 - 2023

N2 - Roadside perception is a challenging research area that presents even greater difficulties than vehicle perception. Due to the different locations and angles of cameras, roadside objects exhibit violent multi-scale variations, while the vast sensing field introduces more small-scale targets and complex backgrounds, making target recognition more challenging. To address these problems, we focus on position information encoding to achieve accurate roadside object detection by proposing the position enhancement faster network (PEFNet). Based on YOLOv6, the FasterNet Block is introduced into Backbone and Neck networks to provide efficient feature extraction while achieving model lightweight transformation. To improve small target detection performance, a position-Aware feature pyramid network (PA-PAN) is proposed to enhance position information encoding, and the SPD-Conv is applied in the PA-PAN to further enhance effective feature extraction. Finally, the TSCODE is integrated into the detection head to achieve accurate target recognition and suppress background noise interference. Experiments on the Rope3D and UA-DETRAC datasets show that our model outperforms advanced YOLOv6, YOLOX, and FCOS in roadside object detection. Compared with YOLOv6, our method improves the mAP0.50 on the Rope3D dataset from 78.18% to 82.39%, with the AP of small objects such as pedestrians increasing by 7.01%. Furthermore, PEFNet reduces the weight of the network by 43.1% while maintaining detection speed at 75fps and achieving higher accuracy than previous algorithms for the same number of frames.

AB - Roadside perception is a challenging research area that presents even greater difficulties than vehicle perception. Due to the different locations and angles of cameras, roadside objects exhibit violent multi-scale variations, while the vast sensing field introduces more small-scale targets and complex backgrounds, making target recognition more challenging. To address these problems, we focus on position information encoding to achieve accurate roadside object detection by proposing the position enhancement faster network (PEFNet). Based on YOLOv6, the FasterNet Block is introduced into Backbone and Neck networks to provide efficient feature extraction while achieving model lightweight transformation. To improve small target detection performance, a position-Aware feature pyramid network (PA-PAN) is proposed to enhance position information encoding, and the SPD-Conv is applied in the PA-PAN to further enhance effective feature extraction. Finally, the TSCODE is integrated into the detection head to achieve accurate target recognition and suppress background noise interference. Experiments on the Rope3D and UA-DETRAC datasets show that our model outperforms advanced YOLOv6, YOLOX, and FCOS in roadside object detection. Compared with YOLOv6, our method improves the mAP0.50 on the Rope3D dataset from 78.18% to 82.39%, with the AP of small objects such as pedestrians increasing by 7.01%. Furthermore, PEFNet reduces the weight of the network by 43.1% while maintaining detection speed at 75fps and achieving higher accuracy than previous algorithms for the same number of frames.

KW - decoupled head

KW - feature aggregation

KW - Feature extraction

KW - object detection

KW - position enhancement

KW - roadside images

UR - http://www.scopus.com/inward/record.url?scp=85164421528&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2023.3292881

DO - 10.1109/ACCESS.2023.3292881

M3 - 文章

AN - SCOPUS:85164421528

SN - 2169-3536

VL - 11

SP - 73007

EP - 73023

JO - IEEE Access

JF - IEEE Access

ER -

PEFNet: Position Enhancement Faster Network for Object Detection in Roadside Perception System

摘要

访问文件

其它文件与链接

指纹

引用此