학술논문

BHE-DARTS: Bilevel Optimization Based on Hypergradient Estimation for Differentiable Architecture Search
Document Type
Conference
Source
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Acoustics, Speech and Signal Processing (ICASSP), ICASSP 2023 - 2023 IEEE International Conference on. :1-5 Jun, 2023
Subject
Bioengineering
Communication, Networking and Broadcast Technologies
Computing and Processing
Signal Processing and Analysis
Jacobian matrices
Stochastic processes
Estimation
Signal processing
Acoustics
Speech processing
Optimization
Neural Architecture Search
Bilevel Optimization
Jacobian- and Hessian-vector product
Language
ISSN
2379-190X
Abstract
In this paper, we propose a stochastic bilevel optimization approach based on a hypergradient estimator, called BHE- DARTS, as a remedy for this issue that it is easy to search for locally optimal structures rather than globally optimal ones in Differentiable Architecture Search (DARTS) bilevel optimization model. To be specific, we apply a stochastic gradient for updating the lower level variable ω and design a hypergradient estimator, which is built by the Jacobian- and Hessianvector product, to assist in updating the upper level variable α. This operation can more fully apply the gradient information to escape the trap of local optimal in the NAS bilevel model. Compared to state-of-the-art DARTS methods, experimental studies have shown the competitive performance of the proposed BHE-DARTS in the DARTS search space (CIFAR-100: a test accuracy rate of 82.69 % ) and NAS-Bench-201 search space (ImageNet16-120: a test accuracy rate of 42.44%).