Welcome to Scholar Publishing Group

Machine Learning Theory and Practice, 2022, 3(3); doi: 10.38007/ML.2022.030302.

Intrusion Detection Classification Based on Random Forest

Author(s)

Baiming Liu

Corresponding Author:
Baiming Liu
Affiliation(s)

Beijing Potential Big Data Research Institute (PRI), Beijing 10095, China

Abstract

In recent years, network security has become increasingly severe and network intrusions are frequent. On the one hand, it is difficult to capture intrusions because of the increasingly large volume of computer networks and complex network topology, and on the other hand, it is more difficult to capture intrusions because the characteristics of intrusions are becoming more diverse and complex. The purpose of this paper is to study intrusion detection classification based on random forest. After constructing a high-precision decision tree, the double failure metric is used as the distance between two decision trees, and the k-means++ algorithm is used to select high-precision decision trees with a certain degree of independence to form the final random forest. The improved random forest algorithm for setting up clustering and the random forest algorithm were experimented as algorithm comparisons. The KDD-NSL dataset was selected and the experimental results were compared to demonstrate the effectiveness of the random forest algorithm based on gradient boosting. The experimental results show that the improved random forest model has a significant reduction in decision tree size and a significant increase in diversity due to the k-means++ algorithm for clustering and extracting cluster centers.

Keywords

Random Forest, Intrusion Detection, Detection Classification, K-means++ Clustering

Cite This Paper

Baiming Liu. Intrusion Detection Classification Based on Random Forest. Machine Learning Theory and Practice (2022), Vol. 3, Issue 3: 10-17. https://doi.org/10.38007/ML.2022.030302.

References

[1] Ankit Rajeshkumar Kharwar, Devendra V. Thakor: An Ensemble Approach for Feature Selection and Classification in Intrusion Detection Using Extra-Tree Algorithm. Int. J. Inf. Secur. Priv. 16(1): 1-21 (2022) https://doi.org/10.4018/IJISP.2022010113

[2] Kapil Kumar, Arvind Kumar, Vimal Kumar, Sunil Kumar: A Hybrid Classification Technique for Enhancing the Effectiveness of Intrusion Detection Systems Using Machine Learning. Int. J. Organ. Collect. Intell. 12(1): 1-18 (2022) https://doi.org/10.4018/IJOCI.2022010102

[3] P. Manjula, S. Baghavathi Priya: An effective network intrusion detection and classification system for securing WSN using VGG-19 and hybrid deep neural network techniques. J. Intell. Fuzzy Syst. 43(5): 6419-6432 (2022) https://doi.org/10.3233/JIFS-220444

[4] Padideh Choobdar, Marjan Naderan, Mahmood Naderan: Detection and Multi-Class Classification of Intrusion in Software Defined Networks Using Stacked Auto-Encoders and CICIDS2017 Dataset. Wirel. Pers. Commun. 123(1): 437-471 (2022) https://doi.org/10.1007/s11277-021-09139-y

[5] Sivamohan Krishnaveni, Sivanandam Sivamohan, S. S. Sridhar, S. Prabakaran: Efficient feature selection and classification through ensemble method for network intrusion detection on cloud computing. Clust. Comput. 24(3): 1761-1779 (2021) https://doi.org/10.1007/s10586-020-03222-y

[6] Zahra Asghari Varzaneh, Marjan Kuchaki Rafsanjani: Intrusion detection system using a new fuzzy rule-based classification system based on genetic algorithm. Intell. Decis. Technol. 15(2): 231-237 (2021) https://doi.org/10.3233/IDT-200036

[7] V. Sandeep, Saravanan Kondappan, A. Amir Anton Jone, S. Raj Barath: Anomaly Intrusion Detection Using SVM and C4.5 Classification with an Improved Particle Swarm Optimization (I-PSO). Int. J. Inf. Secur. Priv. 15(2): 113-130 (2021) https://doi.org/10.4018/IJISP.2021040106

[8] Dzelila Mehanovic, Dino Keco, Jasmin Kevric, Samed Jukic, Adnan Miljkovic, Zerina Masetic: Feature selection using cloud-based parallel genetic algorithm for intrusion detection data classification. Neural Comput. Appl. 33(18): 11861-11873 (2021) https://doi.org/10.1007/s00521-021-05871-5

[9] Maryam Yousefnezhad, Javad Hamidzadeh, Mohammad Aliannejadi: Ensemble classification for intrusion detection via feature extraction based on deep Learning. Soft Comput. 25(20): 12667-12683 (2021) https://doi.org/10.1007/s00500-021-06067-8

[10] Allen Yang, Boxiang Dong, Dawei Li, Weifeng Sun, Bharath K. Samanthula: DeepICU: imbalanced classification by using deep neural networks for network intrusion detection. Int. J. Big Data Intell. 7(3): 137-147 (2020) https://doi.org/10.1504/IJBDI.2020.10031966

[11] Imad Bouteraa, Makhlouf Derdour, Ahmed Ahmim: Intrusion detection using classification techniques: a comparative study. Int. J. Data Min. Model. Manag. 12(1): 65-86 (2020) https://doi.org/10.1504/IJDMMM.2020.105596

[12] Shen Kejia, Hamid Parvin, Sultan Noman Qasem, Bui Anh Tuan, Kim-Hung Pho: A classification model based on svm and fuzzy rough set for network intrusion detection. J. Intell. Fuzzy Syst. 39(5): 6801-6817 (2020) https://doi.org/10.3233/JIFS-191621

[13] Preethi Devan, Neelu Khare: An efficient XGBoost-DNN-based classification model for network intrusion detection system. Neural Comput. Appl. 32(16): 12499-12514 (2020) https://doi.org/10.1007/s00521-020-04708-x

[14] Julio Lamas Piñeiro, Lenis Wong Portillo: Web architecture for URL-based phishing detection based on Random Forest, Classification Trees, and Support Vector Machine. Inteligencia Artif. 25(69): 107-121 (2022) https://doi.org/10.4114/intartif.vol25iss69pp107-121

[15] María Guadalupe Bedolla-Ibarra, María del Cármen Cabrera-Hernández, Marco Antonio Aceves-Fernández, Saúl Tovar-Arriaga: Classification of attention levels using a Random Forest algorithm optimized with Particle Swarm Optimization. Evol. Syst. 13(5): 687-702 (2022) https://doi.org/10.1007/s12530-022-09444-2

[16] Tiebo Sun, Jinhao Liu, Jiangming Kan, Tingting Sui: Research on target classification method for dense matching point cloud based on improved random forest algorithm. Int. J. Inf. Commun. Technol. 21(3): 290-303 (2022) https://doi.org/10.1504/IJICT.2022.125541

[17] Nasrin Amini, Ahmad Shalbaf: Automatic classification of severity of COVID-19 patients using texture feature and random forest based on computed tomography images. Int. J. Imaging Syst. Technol. 32(1): 102-110 (2022) https://doi.org/10.1002/ima.22679

[18] C. Venkata Narasimhulu: An automatic feature selection and classification framework for analyzing ultrasound kidney images using dragonfly algorithm and random forest classifier. IET Image Process. 15(9): 2080-2096 (2021) https://doi.org/10.1049/ipr2.12179