TY - GEN
T1 - A Shape-Based Quadrangle Detector for Aerial Images
AU - Rao, Chaofan
AU - Li, Wenbo
AU - Xie, Xingxing
AU - Cheng, Gong
N1 - Publisher Copyright:
© 2024, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
PY - 2024
Y1 - 2024
N2 - The performance of oriented object detectors has been adversely impacted by the substantial variations in object orientation. In this paper, we propose a simple but efficient object detection framework for oriented objects in aerial images, termed QuadDet. Instead of adopting oriented bounding box to represent the object, we directly predict the four vertices of the object’s quadrilateral. Specially, we introduce a fast sorting method for four vertexes of quadrangles, called the Vertex Sorting Function. The function confirms that the vertexes can compose a valid quadrangle by sorting tangents of the vertexes. Furthermore, we employ an efficient polygon IoU loss function, named the PolyIoU Loss Function, to progressively align the predicted quadrangle’s shape with the ground truth. Under these strategies, our model achieves competitive performance. Without bells and whistles, our method with ResNet50 achieves 73.63% mAP on the DOTA-v1.0 dataset running at 23.4 FPS, which surpasses all recent one-stage oriented object detectors by a significant margin. Moreover, on the largest dataset DOTA-v2.0, our QuadDet with ResNet50 obtains 51.54% mAP. The code and models are available at https://github.com/DDGRCF/QuadDet.
AB - The performance of oriented object detectors has been adversely impacted by the substantial variations in object orientation. In this paper, we propose a simple but efficient object detection framework for oriented objects in aerial images, termed QuadDet. Instead of adopting oriented bounding box to represent the object, we directly predict the four vertices of the object’s quadrilateral. Specially, we introduce a fast sorting method for four vertexes of quadrangles, called the Vertex Sorting Function. The function confirms that the vertexes can compose a valid quadrangle by sorting tangents of the vertexes. Furthermore, we employ an efficient polygon IoU loss function, named the PolyIoU Loss Function, to progressively align the predicted quadrangle’s shape with the ground truth. Under these strategies, our model achieves competitive performance. Without bells and whistles, our method with ResNet50 achieves 73.63% mAP on the DOTA-v1.0 dataset running at 23.4 FPS, which surpasses all recent one-stage oriented object detectors by a significant margin. Moreover, on the largest dataset DOTA-v2.0, our QuadDet with ResNet50 obtains 51.54% mAP. The code and models are available at https://github.com/DDGRCF/QuadDet.
KW - Aerial Images
KW - Oriented Object Detection
KW - Quadrangle Representation
KW - Vertex Sorting
UR - http://www.scopus.com/inward/record.url?scp=85181771990&partnerID=8YFLogxK
U2 - 10.1007/978-981-99-8462-6_30
DO - 10.1007/978-981-99-8462-6_30
M3 - 会议稿件
AN - SCOPUS:85181771990
SN - 9789819984619
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 368
EP - 379
BT - Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Proceedings
A2 - Liu, Qingshan
A2 - Wang, Hanzi
A2 - Ji, Rongrong
A2 - Ma, Zhanyu
A2 - Zheng, Weishi
A2 - Zha, Hongbin
A2 - Chen, Xilin
A2 - Wang, Liang
PB - Springer Science and Business Media Deutschland GmbH
T2 - 6th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2023
Y2 - 13 October 2023 through 15 October 2023
ER -