학술논문

Single-Domain Generalized Predictor for Neural Architecture Search System

Document Type

Periodical

Author

Ma, L.; Kang, H.; Yu, G.; Li, Q.; He, Q.

Source

IEEE Transactions on Computers IEEE Trans. Comput. Computers, IEEE Transactions on. 73(5):1400-1413 May, 2024

Subject

Computing and Processing
Computer architecture
Feature extraction
Metalearning
Training
Task analysis
Costs
Search problems
Neural architecture search
performance predictor
domain generalization

Language

ISSN

0018-9340
1557-9956
2326-3814

Abstract

Performance predictors are used to reduce architecture evaluation costs in neural architecture search, which however suffers from a large amount of budget consumption in annotating substantial architectures trained from scratch. Hence, how to leverage existing annotated architectures to train a generalized predictor to find the optimal architecture on unseen target search spaces becomes a new research topic. To solve this issue, we propose a Single-Domain Generalized Predictor (SDGP), which aims to make the predictor only trained on a single source search space but perform well on target search spaces. In meta-learning, we firstly adopt feature extractor in learning the domain-invariant features of the architectures. Then, a neural predictor is trained to map the architectures to the accuracy of the candidate architectures over the target domain simulated on the source search space. Moreover, a novel multi-head attention driven regularizer is designed to regulate the predictor to further improve the generalization ability of the predictor for the feature extractor. A series of experimental results have shown that the proposed predictor outperforms the state-of-the-art predictors in generalization and achieves significant performance gains in finding the optimal architectures with test error 2.40% on CIFAR-10 and 23.20% on ImageNet1k within 0.01 GPU days.

Online Access

Full Text (IEEE) Web of Science JCR 저널정보 Scopus Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송