NAS-FCOS: Efficient Search for Object Detection Architectures

Ning Wang; Yang Gao; Hao Chen; Peng Wang; Zhi Tian; Chunhua Shen; Yanning Zhang

doi:10.1007/s11263-021-01523-2

NAS-FCOS: Efficient Search for Object Detection Architectures

Ning Wang, Yang Gao, Hao Chen, Peng Wang, Zhi Tian, Chunhua Shen, Yanning Zhang

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

19 引用（Scopus）

摘要

Neural Architecture Search (NAS) has shown great potential in effectively reducing manual effort in network design by automatically discovering optimal architectures. What is noteworthy is that as of now, object detection is less touched by NAS algorithms despite its significant importance in computer vision. To the best of our knowledge, most of the recent NAS studies on object detection tasks fail to satisfactorily strike a balance between performance and efficiency of the resulting models, let alone the excessive amount of computational resources cost by those algorithms. Here we propose an efficient method to obtain better object detectors by searching for the feature pyramid network as well as the prediction head of a simple anchor-free object detector, namely, FCOS (Tian et al. in FCOS: Fully convolutional one-stage object detection, 2019), using a tailored reinforcement learning paradigm. With carefully designed search space, search algorithms, and strategies for evaluating network quality, we are able to find top-performing detection architectures within 4 days using 8 V100 GPUs. The discovered architectures surpass state-of-the-art object detection models (such as Faster R-CNN, RetinaNet and, FCOS) by 1.0 to 5.4% points in AP on the COCO dataset, with comparable computation complexity and memory footprint, demonstrating the efficacy of the proposed NAS method for object detection. Code is available at https://github.com/Lausannen/NAS-FCOS.

源语言	英语
页（从-至）	3299-3312
页数	14
期刊	International Journal of Computer Vision
卷	129
期	12
DOI	https://doi.org/10.1007/s11263-021-01523-2
出版状态	已出版 - 12月 2021

访问文件

10.1007/s11263-021-01523-2

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{04e19ef4edae4b4f9e94dc45a2f374a6,

title = "NAS-FCOS: Efficient Search for Object Detection Architectures",

abstract = "Neural Architecture Search (NAS) has shown great potential in effectively reducing manual effort in network design by automatically discovering optimal architectures. What is noteworthy is that as of now, object detection is less touched by NAS algorithms despite its significant importance in computer vision. To the best of our knowledge, most of the recent NAS studies on object detection tasks fail to satisfactorily strike a balance between performance and efficiency of the resulting models, let alone the excessive amount of computational resources cost by those algorithms. Here we propose an efficient method to obtain better object detectors by searching for the feature pyramid network as well as the prediction head of a simple anchor-free object detector, namely, FCOS (Tian et al. in FCOS: Fully convolutional one-stage object detection, 2019), using a tailored reinforcement learning paradigm. With carefully designed search space, search algorithms, and strategies for evaluating network quality, we are able to find top-performing detection architectures within 4 days using 8 V100 GPUs. The discovered architectures surpass state-of-the-art object detection models (such as Faster R-CNN, RetinaNet and, FCOS) by 1.0 to 5.4% points in AP on the COCO dataset, with comparable computation complexity and memory footprint, demonstrating the efficacy of the proposed NAS method for object detection. Code is available at https://github.com/Lausannen/NAS-FCOS.",

keywords = "Deep learning, Neural architecture search, Object detection, Reinforcement learning",

author = "Ning Wang and Yang Gao and Hao Chen and Peng Wang and Zhi Tian and Chunhua Shen and Yanning Zhang",

note = "Publisher Copyright: {\textcopyright} 2021, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.",

year = "2021",

month = dec,

doi = "10.1007/s11263-021-01523-2",

language = "英语",

volume = "129",

pages = "3299--3312",

journal = "International Journal of Computer Vision",

issn = "0920-5691",

publisher = "Springer Netherlands",

number = "12",

}

TY - JOUR

T1 - NAS-FCOS

T2 - Efficient Search for Object Detection Architectures

AU - Wang, Ning

AU - Gao, Yang

AU - Chen, Hao

AU - Wang, Peng

AU - Tian, Zhi

AU - Shen, Chunhua

AU - Zhang, Yanning

PY - 2021/12

Y1 - 2021/12

N2 - Neural Architecture Search (NAS) has shown great potential in effectively reducing manual effort in network design by automatically discovering optimal architectures. What is noteworthy is that as of now, object detection is less touched by NAS algorithms despite its significant importance in computer vision. To the best of our knowledge, most of the recent NAS studies on object detection tasks fail to satisfactorily strike a balance between performance and efficiency of the resulting models, let alone the excessive amount of computational resources cost by those algorithms. Here we propose an efficient method to obtain better object detectors by searching for the feature pyramid network as well as the prediction head of a simple anchor-free object detector, namely, FCOS (Tian et al. in FCOS: Fully convolutional one-stage object detection, 2019), using a tailored reinforcement learning paradigm. With carefully designed search space, search algorithms, and strategies for evaluating network quality, we are able to find top-performing detection architectures within 4 days using 8 V100 GPUs. The discovered architectures surpass state-of-the-art object detection models (such as Faster R-CNN, RetinaNet and, FCOS) by 1.0 to 5.4% points in AP on the COCO dataset, with comparable computation complexity and memory footprint, demonstrating the efficacy of the proposed NAS method for object detection. Code is available at https://github.com/Lausannen/NAS-FCOS.

AB - Neural Architecture Search (NAS) has shown great potential in effectively reducing manual effort in network design by automatically discovering optimal architectures. What is noteworthy is that as of now, object detection is less touched by NAS algorithms despite its significant importance in computer vision. To the best of our knowledge, most of the recent NAS studies on object detection tasks fail to satisfactorily strike a balance between performance and efficiency of the resulting models, let alone the excessive amount of computational resources cost by those algorithms. Here we propose an efficient method to obtain better object detectors by searching for the feature pyramid network as well as the prediction head of a simple anchor-free object detector, namely, FCOS (Tian et al. in FCOS: Fully convolutional one-stage object detection, 2019), using a tailored reinforcement learning paradigm. With carefully designed search space, search algorithms, and strategies for evaluating network quality, we are able to find top-performing detection architectures within 4 days using 8 V100 GPUs. The discovered architectures surpass state-of-the-art object detection models (such as Faster R-CNN, RetinaNet and, FCOS) by 1.0 to 5.4% points in AP on the COCO dataset, with comparable computation complexity and memory footprint, demonstrating the efficacy of the proposed NAS method for object detection. Code is available at https://github.com/Lausannen/NAS-FCOS.

KW - Deep learning

KW - Neural architecture search

KW - Object detection

KW - Reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85117146917&partnerID=8YFLogxK

U2 - 10.1007/s11263-021-01523-2

DO - 10.1007/s11263-021-01523-2

M3 - 文章

AN - SCOPUS:85117146917

SN - 0920-5691

VL - 129

SP - 3299

EP - 3312

JO - International Journal of Computer Vision

JF - International Journal of Computer Vision

IS - 12

ER -

NAS-FCOS: Efficient Search for Object Detection Architectures

摘要

访问文件

其它文件与链接

指纹

引用此