VNAS: Variational Neural Architecture Search

Benteng Ma; Jing Zhang; Yong Xia; Dacheng Tao

doi:10.1007/s11263-024-02014-w

VNAS: Variational Neural Architecture Search

Benteng Ma, Jing Zhang, Yong Xia, Dacheng Tao

School of Computer Science

Research output: Contribution to journal › Article › peer-review

8 Scopus citations

Abstract

Differentiable neural architecture search delivers point estimation to the optimal architecture, which yields arbitrarily high confidence to the learned architecture. This approach thus suffers in calibration and robustness, in contrast with the maximum a posteriori estimation scheme. In this paper, we propose a novel Variational Neural Architecture Search (VNAS) method that estimates and exploits the weight variability in the following three steps. VNAS first learns the weight distribution through variational inference which minimizes the expected lower bound on the marginal likelihood of architecture using unbiased Monte Carlo gradient estimation. A group of optimal architecture candidates is then drawn according to the learned weight distribution with the complexity constraint. The optimal architecture is further inferred under a novel training-free architecture-performance estimator, designed to score the network architectures at initialization without training, which significantly reduces the computational cost of the optimal architecture estimator. Extensive experiments show that VNAS significantly outperforms the state-of-the-art methods in classification performance and adversarial robustness.

Original language	English
Pages (from-to)	3689-3713
Number of pages	25
Journal	International Journal of Computer Vision
Volume	132
Issue number	9
DOIs	https://doi.org/10.1007/s11263-024-02014-w
State	Published - Sep 2024

Keywords

Image classification
Neural architecture search
Neural network

Access to Document

10.1007/s11263-024-02014-w

Cite this

@article{9b1c90406fbd4e6cb019cacca0b6156e,

title = "VNAS: Variational Neural Architecture Search",

abstract = "Differentiable neural architecture search delivers point estimation to the optimal architecture, which yields arbitrarily high confidence to the learned architecture. This approach thus suffers in calibration and robustness, in contrast with the maximum a posteriori estimation scheme. In this paper, we propose a novel Variational Neural Architecture Search (VNAS) method that estimates and exploits the weight variability in the following three steps. VNAS first learns the weight distribution through variational inference which minimizes the expected lower bound on the marginal likelihood of architecture using unbiased Monte Carlo gradient estimation. A group of optimal architecture candidates is then drawn according to the learned weight distribution with the complexity constraint. The optimal architecture is further inferred under a novel training-free architecture-performance estimator, designed to score the network architectures at initialization without training, which significantly reduces the computational cost of the optimal architecture estimator. Extensive experiments show that VNAS significantly outperforms the state-of-the-art methods in classification performance and adversarial robustness.",

keywords = "Image classification, Neural architecture search, Neural network",

author = "Benteng Ma and Jing Zhang and Yong Xia and Dacheng Tao",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.",

year = "2024",

month = sep,

doi = "10.1007/s11263-024-02014-w",

language = "英语",

volume = "132",

pages = "3689--3713",

journal = "International Journal of Computer Vision",

issn = "0920-5691",

publisher = "Springer Netherlands",

number = "9",

}

TY - JOUR

T1 - VNAS

T2 - Variational Neural Architecture Search

AU - Ma, Benteng

AU - Zhang, Jing

AU - Xia, Yong

AU - Tao, Dacheng

N1 - Publisher Copyright: © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.

PY - 2024/9

Y1 - 2024/9

N2 - Differentiable neural architecture search delivers point estimation to the optimal architecture, which yields arbitrarily high confidence to the learned architecture. This approach thus suffers in calibration and robustness, in contrast with the maximum a posteriori estimation scheme. In this paper, we propose a novel Variational Neural Architecture Search (VNAS) method that estimates and exploits the weight variability in the following three steps. VNAS first learns the weight distribution through variational inference which minimizes the expected lower bound on the marginal likelihood of architecture using unbiased Monte Carlo gradient estimation. A group of optimal architecture candidates is then drawn according to the learned weight distribution with the complexity constraint. The optimal architecture is further inferred under a novel training-free architecture-performance estimator, designed to score the network architectures at initialization without training, which significantly reduces the computational cost of the optimal architecture estimator. Extensive experiments show that VNAS significantly outperforms the state-of-the-art methods in classification performance and adversarial robustness.

AB - Differentiable neural architecture search delivers point estimation to the optimal architecture, which yields arbitrarily high confidence to the learned architecture. This approach thus suffers in calibration and robustness, in contrast with the maximum a posteriori estimation scheme. In this paper, we propose a novel Variational Neural Architecture Search (VNAS) method that estimates and exploits the weight variability in the following three steps. VNAS first learns the weight distribution through variational inference which minimizes the expected lower bound on the marginal likelihood of architecture using unbiased Monte Carlo gradient estimation. A group of optimal architecture candidates is then drawn according to the learned weight distribution with the complexity constraint. The optimal architecture is further inferred under a novel training-free architecture-performance estimator, designed to score the network architectures at initialization without training, which significantly reduces the computational cost of the optimal architecture estimator. Extensive experiments show that VNAS significantly outperforms the state-of-the-art methods in classification performance and adversarial robustness.

KW - Image classification

KW - Neural architecture search

KW - Neural network

UR - http://www.scopus.com/inward/record.url?scp=85191090122&partnerID=8YFLogxK

U2 - 10.1007/s11263-024-02014-w

DO - 10.1007/s11263-024-02014-w

M3 - 文章

AN - SCOPUS:85191090122

SN - 0920-5691

VL - 132

SP - 3689

EP - 3713

JO - International Journal of Computer Vision

JF - International Journal of Computer Vision

IS - 9

ER -

VNAS: Variational Neural Architecture Search

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this