VNAS: Variational Neural Architecture Search

Benteng Ma, Jing Zhang, Yong Xia, Dacheng Tao

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

Differentiable neural architecture search delivers point estimation to the optimal architecture, which yields arbitrarily high confidence to the learned architecture. This approach thus suffers in calibration and robustness, in contrast with the maximum a posteriori estimation scheme. In this paper, we propose a novel Variational Neural Architecture Search (VNAS) method that estimates and exploits the weight variability in the following three steps. VNAS first learns the weight distribution through variational inference which minimizes the expected lower bound on the marginal likelihood of architecture using unbiased Monte Carlo gradient estimation. A group of optimal architecture candidates is then drawn according to the learned weight distribution with the complexity constraint. The optimal architecture is further inferred under a novel training-free architecture-performance estimator, designed to score the network architectures at initialization without training, which significantly reduces the computational cost of the optimal architecture estimator. Extensive experiments show that VNAS significantly outperforms the state-of-the-art methods in classification performance and adversarial robustness.

Original languageEnglish
Pages (from-to)3689-3713
Number of pages25
JournalInternational Journal of Computer Vision
Volume132
Issue number9
DOIs
StatePublished - Sep 2024

Keywords

  • Image classification
  • Neural architecture search
  • Neural network

Fingerprint

Dive into the research topics of 'VNAS: Variational Neural Architecture Search'. Together they form a unique fingerprint.

Cite this