학술논문

A PHP and JSP Web Shell Detection System with Text Processing Based on Machine Learning
Document Type
Conference
Source
2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom) TRUSTCOM Trust, Security and Privacy in Computing and Communications (TrustCom), 2020 IEEE 19th International Conference on. :1584-1591 Dec, 2020
Subject
Communication, Networking and Broadcast Technologies
Computing and Processing
Signal Processing and Analysis
Privacy
Machine learning algorithms
Intrusion detection
Feature extraction
Numerical models
Security
Text processing
Web Shell
Opcode
Bytecode
AST
Machine Learning
XGBoost
Language
ISSN
2324-9013
Abstract
Web shell is one of the most common network attack methods, and traditional detection methods may not detect complex and flexible variants of web shell attacks. In this paper, we present a comprehensive detection system that can detect both PHP and JSP web shells. After file classification, we use different feature extraction methods, i.e. AST for PHP files and bytecode for JSP files. We present a detection model based on text processing methods including TF-IDF and Word2vec algorithms. We combine different kinds of machine learning algorithms and perform a comprehensively controlled experiment. After the experiment and evaluation, we choose the detection machine learning model of the best performance, which can achieve a high detection accuracy above 98%.