학술논문

Information-Corrected Estimation: A Generalization Error Reducing Parameter Estimation Method

Document Type

article

Author

Matthew Dixon; Tyler Ward

Source

Entropy, Vol 23, Iss 11, p 1419 (2021)

Subject

generalization error
overfitting
information criteria
entropy
Science
Astrophysics
QB460-466
Physics
QC1-999

Language

English

ISSN

1099-4300

Abstract

Modern computational models in supervised machine learning are often highly parameterized universal approximators. As such, the value of the parameters is unimportant, and only the out of sample performance is considered. On the other hand much of the literature on model estimation assumes that the parameters themselves have intrinsic value, and thus is concerned with bias and variance of parameter estimates, which may not have any simple relationship to out of sample model performance. Therefore, within supervised machine learning, heavy use is made of ridge regression (i.e., L2 regularization), which requires the the estimation of hyperparameters and can be rendered ineffective by certain model parameterizations. We introduce an objective function which we refer to as Information-Corrected Estimation (ICE) that reduces KL divergence based generalization error for supervised machine learning. ICE attempts to directly maximize a corrected likelihood function as an estimator of the KL divergence. Such an approach is proven, theoretically, to be effective for a wide class of models, with only mild regularity restrictions. Under finite sample sizes, this corrected estimation procedure is shown experimentally to lead to significant reduction in generalization error compared to maximum likelihood estimation and L2 regularization.

Online Access

EBSCOHost PDF Full Text (Gale Academic Onefile) Full Text (ProQuest Central) Open Access (DOAJ) Web of Science JCR 저널정보 Scopus Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송