Autonomous deep learning: A genetic DCNN designer for image classification

Benteng Ma; Xiang Li; Yong Xia; Yanning Zhang

doi:10.1016/j.neucom.2019.10.007

Autonomous deep learning: A genetic DCNN designer for image classification

Benteng Ma, Xiang Li, Yong Xia, Yanning Zhang

School of Computer Science

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

87 Scopus citations

Abstract

Recent years have witnessed the breakthrough success of deep convolutional neural networks (DCNNs) in image classification and other vision applications. DCNNs have distinct advantages over traditional solutions in providing a uniform feature extraction-classification framework to free users from troublesome handcrafted feature extraction. However, DCNNs are far from autonomous, since their performance relies heavily on the handcrafted architectures, which also requires a lot expertise and experience to design, and cannot be continuously improved once the tuning of hyper-parameters converges. In this paper, we propose an autonomous and continuous learning (ACL) algorithm to generate automatically a DCNN architecture for each given vision task. We first partition a DCNN into multiple stacked meta convolutional blocks and fully connected blocks, each of which may contain the operations of convolution, pooling, fully connection, batch normalization, activation and drop out, and thus convert the architecture into an integer code. Then, we use genetic evolutionary operations, including selection, mutation and crossover to evolve a population of DCNN architectures. We have evaluated this algorithm on six image classification tasks, i.e., MNIST, Fashion-MNIST, EMNIST-Letters, EMNIST-Digits, CIFAR10 and CIFAR100. Our results indicate that the proposed ACL algorithm is able to evolve the DCNN architecture continuously if more time cost is allowed and can find a suboptimal DCNN architecture, whose performance is comparable to the state of the art.

Original language	English
Pages (from-to)	152-161
Number of pages	10
Journal	Neurocomputing
Volume	379
DOIs	https://doi.org/10.1016/j.neucom.2019.10.007
State	Published - 28 Feb 2020

Keywords

Deep convolutional neural networks (DCNNs)
Genetic algorithm (GA)
Image classification
Neural architecture search

Access to Document

10.1016/j.neucom.2019.10.007

Cite this

@article{27b588cd00764362be1d76ab8dcdb30e,

title = "Autonomous deep learning: A genetic DCNN designer for image classification",

abstract = "Recent years have witnessed the breakthrough success of deep convolutional neural networks (DCNNs) in image classification and other vision applications. DCNNs have distinct advantages over traditional solutions in providing a uniform feature extraction-classification framework to free users from troublesome handcrafted feature extraction. However, DCNNs are far from autonomous, since their performance relies heavily on the handcrafted architectures, which also requires a lot expertise and experience to design, and cannot be continuously improved once the tuning of hyper-parameters converges. In this paper, we propose an autonomous and continuous learning (ACL) algorithm to generate automatically a DCNN architecture for each given vision task. We first partition a DCNN into multiple stacked meta convolutional blocks and fully connected blocks, each of which may contain the operations of convolution, pooling, fully connection, batch normalization, activation and drop out, and thus convert the architecture into an integer code. Then, we use genetic evolutionary operations, including selection, mutation and crossover to evolve a population of DCNN architectures. We have evaluated this algorithm on six image classification tasks, i.e., MNIST, Fashion-MNIST, EMNIST-Letters, EMNIST-Digits, CIFAR10 and CIFAR100. Our results indicate that the proposed ACL algorithm is able to evolve the DCNN architecture continuously if more time cost is allowed and can find a suboptimal DCNN architecture, whose performance is comparable to the state of the art.",

keywords = "Deep convolutional neural networks (DCNNs), Genetic algorithm (GA), Image classification, Neural architecture search",

author = "Benteng Ma and Xiang Li and Yong Xia and Yanning Zhang",

note = "Publisher Copyright: {\textcopyright} 2019 Elsevier B.V.",

year = "2020",

month = feb,

day = "28",

doi = "10.1016/j.neucom.2019.10.007",

language = "英语",

volume = "379",

pages = "152--161",

journal = "Neurocomputing",

issn = "0925-2312",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Autonomous deep learning

T2 - A genetic DCNN designer for image classification

AU - Ma, Benteng

AU - Li, Xiang

AU - Xia, Yong

AU - Zhang, Yanning

PY - 2020/2/28

Y1 - 2020/2/28

N2 - Recent years have witnessed the breakthrough success of deep convolutional neural networks (DCNNs) in image classification and other vision applications. DCNNs have distinct advantages over traditional solutions in providing a uniform feature extraction-classification framework to free users from troublesome handcrafted feature extraction. However, DCNNs are far from autonomous, since their performance relies heavily on the handcrafted architectures, which also requires a lot expertise and experience to design, and cannot be continuously improved once the tuning of hyper-parameters converges. In this paper, we propose an autonomous and continuous learning (ACL) algorithm to generate automatically a DCNN architecture for each given vision task. We first partition a DCNN into multiple stacked meta convolutional blocks and fully connected blocks, each of which may contain the operations of convolution, pooling, fully connection, batch normalization, activation and drop out, and thus convert the architecture into an integer code. Then, we use genetic evolutionary operations, including selection, mutation and crossover to evolve a population of DCNN architectures. We have evaluated this algorithm on six image classification tasks, i.e., MNIST, Fashion-MNIST, EMNIST-Letters, EMNIST-Digits, CIFAR10 and CIFAR100. Our results indicate that the proposed ACL algorithm is able to evolve the DCNN architecture continuously if more time cost is allowed and can find a suboptimal DCNN architecture, whose performance is comparable to the state of the art.

AB - Recent years have witnessed the breakthrough success of deep convolutional neural networks (DCNNs) in image classification and other vision applications. DCNNs have distinct advantages over traditional solutions in providing a uniform feature extraction-classification framework to free users from troublesome handcrafted feature extraction. However, DCNNs are far from autonomous, since their performance relies heavily on the handcrafted architectures, which also requires a lot expertise and experience to design, and cannot be continuously improved once the tuning of hyper-parameters converges. In this paper, we propose an autonomous and continuous learning (ACL) algorithm to generate automatically a DCNN architecture for each given vision task. We first partition a DCNN into multiple stacked meta convolutional blocks and fully connected blocks, each of which may contain the operations of convolution, pooling, fully connection, batch normalization, activation and drop out, and thus convert the architecture into an integer code. Then, we use genetic evolutionary operations, including selection, mutation and crossover to evolve a population of DCNN architectures. We have evaluated this algorithm on six image classification tasks, i.e., MNIST, Fashion-MNIST, EMNIST-Letters, EMNIST-Digits, CIFAR10 and CIFAR100. Our results indicate that the proposed ACL algorithm is able to evolve the DCNN architecture continuously if more time cost is allowed and can find a suboptimal DCNN architecture, whose performance is comparable to the state of the art.

KW - Deep convolutional neural networks (DCNNs)

KW - Genetic algorithm (GA)

KW - Image classification

KW - Neural architecture search

UR - http://www.scopus.com/inward/record.url?scp=85075525112&partnerID=8YFLogxK

U2 - 10.1016/j.neucom.2019.10.007

DO - 10.1016/j.neucom.2019.10.007

M3 - 文章

AN - SCOPUS:85075525112

SN - 0925-2312

VL - 379

SP - 152

EP - 161

JO - Neurocomputing

JF - Neurocomputing

ER -

Autonomous deep learning: A genetic DCNN designer for image classification

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this