학술논문

Improving Log-Based Anomaly Detection by Pre-Training Hierarchical Transformers

Document Type

Periodical

Author

Huang, S.; Liu, Y.; Fung, C.; Wang, H.; Yang, H.; Luan, Z.

Source

IEEE Transactions on Computers IEEE Trans. Comput. Computers, IEEE Transactions on. 72(9):2656-2667 Sep, 2023

Subject

Computing and Processing
Task analysis
Bit error rate
Anomaly detection
Transformers
Feature extraction
Computational modeling
Semantics
Log-based anomaly detection
pre-training
hierarchical transformers
robustness

Language

ISSN

0018-9340
1557-9956
2326-3814

Abstract

Pre-trained models, such as BERT, have resulted in significant pre-trained models, such as BERT, have resulted in significant improvements in many natural language processing (NLP) applications. However, due to differences in word distribution and domain data distribution, applying NLP advancements to log analysis directly faces some performance challenges. This paper studies how to adapt the recently introduced pre-trained language model BERT for log analysis. In this work, we propose a pre-trained log representation model with hierarchical bidirectional encoder transformers (namely, HilBERT). Unlike previous work, which used raw text as pre-training data, we parse logs into templates before using the log templates to pre-train HilBERT. We also design a hierarchical transformers model to capture log template sequence-level information. We use log-based anomaly detection for downstream tasks and fine-tune our model with different log data. Our experiments demonstrate that HilBERT outperforms other baseline techniques on unstable log data. While BERT obtains performance comparable to that of previous state-of-the-art models, HilBERT can significantly address the problem of log instability and achieve accurate and robust results.

Online Access

Full Text (IEEE) Web of Science JCR 저널정보 Scopus Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송