TY - GEN
T1 - Performance Prediction Based on Neural Architecture Features
AU - Long, Duo
AU - Zhang, Shizhou
AU - Zhang, Yanning
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/9
Y1 - 2019/9
N2 - Nerual Architecture Search (NAS) usually requires to train quantities of candidate neural networks on dataset for choosing high performance network architecture and optimizing hyperparameters, which is very time consuming and computationally expensive. In order to resolve the issue we try to use performance prediction method to predict the model performance with little or even no training steps. We assume that the performance is determined once the architecture and other hyperparameters are chosen. So we firstly extract the sequence features of the chain-structured neural architecture by introducing N-grams model to process architecture textual description. Subsequently, based on the extracted neural architecture features, we use appropriate regression model to predict validation accuracies for modeling learning curve. Through a series of experimental comparisons, we verify the effectiveness of our proposed performance prediction method and the effective acceleration of the NAS process.
AB - Nerual Architecture Search (NAS) usually requires to train quantities of candidate neural networks on dataset for choosing high performance network architecture and optimizing hyperparameters, which is very time consuming and computationally expensive. In order to resolve the issue we try to use performance prediction method to predict the model performance with little or even no training steps. We assume that the performance is determined once the architecture and other hyperparameters are chosen. So we firstly extract the sequence features of the chain-structured neural architecture by introducing N-grams model to process architecture textual description. Subsequently, based on the extracted neural architecture features, we use appropriate regression model to predict validation accuracies for modeling learning curve. Through a series of experimental comparisons, we verify the effectiveness of our proposed performance prediction method and the effective acceleration of the NAS process.
KW - architecture feature
KW - NAS
KW - performance prediction
KW - regression model
UR - http://www.scopus.com/inward/record.url?scp=85075718671&partnerID=8YFLogxK
U2 - 10.1109/CCHI.2019.8901943
DO - 10.1109/CCHI.2019.8901943
M3 - 会议稿件
AN - SCOPUS:85075718671
T3 - Proceedings - 2nd China Symposium on Cognitive Computing and Hybrid Intelligence, CCHI 2019
SP - 77
EP - 80
BT - Proceedings - 2nd China Symposium on Cognitive Computing and Hybrid Intelligence, CCHI 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2nd China Symposium on Cognitive Computing and Hybrid Intelligence, CCHI 2019
Y2 - 21 September 2019 through 22 September 2019
ER -