학술논문

The Local Learning Coefficient: A Singularity-Aware Complexity Measure

Document Type

Working Paper

Author

Lau, Edmund; Furman, Zach; Wang, George; Murfet, Daniel; Wei, Susan

Source

Subject

Statistics - Machine Learning
Computer Science - Artificial Intelligence
Computer Science - Machine Learning
62F15, 68T07, 14B05

Language

Abstract

The Local Learning Coefficient (LLC) is introduced as a novel complexity measure for deep neural networks (DNNs). Recognizing the limitations of traditional complexity measures, the LLC leverages Singular Learning Theory (SLT), which has long recognized the significance of singularities in the loss landscape geometry. This paper provides an extensive exploration of the LLC's theoretical underpinnings, offering both a clear definition and intuitive insights into its application. Moreover, we propose a new scalable estimator for the LLC, which is then effectively applied across diverse architectures including deep linear networks up to 100M parameters, ResNet image models, and transformer language models. Empirical evidence suggests that the LLC provides valuable insights into how training heuristics might influence the effective complexity of DNNs. Ultimately, the LLC emerges as a crucial tool for reconciling the apparent contradiction between deep learning's complexity and the principle of parsimony.
Comment: This version contains new empirical results and merged content from a related paper (arXiv:2402.03698) to provide a more comprehensive study

Online Access

Open Access (Arxiv) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송