학술논문

Ensemble Model for Enhancing Classification Accuracy in Intrusion Detection Systems
Document Type
Conference
Source
2021 International Conference on Electrical, Computer and Energy Technologies (ICECET) Electrical, Computer and Energy Technologies (ICECET), 2021 International Conference on. :1-7 Dec, 2021
Subject
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Fields, Waves and Electromagnetics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Solid modeling
Machine learning algorithms
Computational modeling
Stacking
Network intrusion detection
Machine learning
Predictive models
machine learning
classification
ensemble model
intrusion detection system
supervised learning
network security
Language
Abstract
Currently, cases of data breaches are prevalent, partly due to the inability of the Intrusion Detection Systems (IDS) used to offer on-demand network protection through real-time intrusion detection. This study sought to develop an ideal machine learning model for enhancing accuracy in network intrusion detection to address this problem. Naïve Bayes, Artificial Neural Network, K nearest Neighbor, Support Vector Machine, and C 4.5 algorithms were trained and tested on the CIC-IDS2017 dataset using the k-folds cross-validation approach. AdaBoost, Bootstrap Aggregation, and Stacking ensemble models, using each of the five algorithms as base models, were also trained and tested on the same dataset. A comparison of the performance of the individual models and the ensemble models was done, and the best performing model was selected and tuned with respect to the number of iterations, batch size, and weight threshold to further enhance its quality and accuracy in prediction. AdaBoost ensemble model with C 4.5 as the base algorithm was found to give an effective model that could be implemented on IDS to enhance precision and recall, which translates to increased accuracy and efficiency in the classification of new instances. Depending on the volume of the data packets being transmitted on the organizational network, the model may require tuning on the batch size and the number of iterations in order to increase its accuracy, efficiency, and consistency in light of the available computational resources.