학술논문

De-Biased Sparse PCA: Inference for Eigenstructure of Large Covariance Matrices
Document Type
Periodical
Source
IEEE Transactions on Information Theory IEEE Trans. Inform. Theory Information Theory, IEEE Transactions on. 67(4):2507-2527 Apr, 2021
Subject
Communication, Networking and Broadcast Technologies
Signal Processing and Analysis
Eigenvalues and eigenfunctions
Statistics
Sociology
Covariance matrices
Principal component analysis
Estimation
Loading
Covariance matrix
eigenvectors
eigenvalues
PCA
high-dimensional model
sparsity
Lasso
asymptotic normality
confidence intervals
Language
ISSN
0018-9448
1557-9654
Abstract
Sparse principal component analysis has become one of the most widely used techniques for dimensionality reduction in high-dimensional datasets. While many methods are available for point estimation of eigenstructure in high-dimensional settings, in this paper we propose methodology for uncertainty quantification, such as construction of confidence intervals and tests for the principal eigenvector and the corresponding largest eigenvalue. We base our methodology on an M-estimator with Lasso penalty which achieves minimax optimal rates and is used to construct a de-biased sparse PCA estimator. The novel estimator has a Gaussian limiting distribution and can be used for hypothesis testing or support recovery of the first eigenvector. The empirical performance of the new estimator is demonstrated on synthetic data and we also show that the estimator compares favourably with the classical PCA in moderately high-dimensional regimes.