Hybrid machine learning model for phishing detection

Perceval Maturure*, Asim Ali, Alexander Gegov

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

11 Downloads (Pure)

Abstract

Phishing threats have remained a long-standing information security issue for many years, causing billions of pounds in losses both in the United Kingdom and worldwide [1]. The aim of the study was to develop and evaluate the performance of the Machine Learning models that would detect and monitor phishing attacks more accurately. A single dataset with 42 features, a total of 247950 phishing and non-phishing emails was used to develop eight supervised machine learning models. The metrics used in evaluating the models show that the enhanced hybrid algorithm developed from combining two models (decision trees and the random forests) from the trained models generated the best classifier with an accuracy of 96%, precision 98%, f-measure 96%, sensitivity 94%, MCC 92% and ROC 96%. The enhanced hybrid voting model developed was integrated with a Django web application using 13 important features to build an accurate phishing detection and monitoring application. The model is proposed as a novel hybrid model because it demonstrated higher classification capabilities due to its inherent design to deal with complex patterns, overfitting issue and the presence of many features when compared to the other single analysis models in the experiments.
Original languageEnglish
Title of host publicationProceedings of 2024 IEEE 12th International Conference on Intelligent Systems (IS)
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350350982
ISBN (Print)9798350350999
DOIs
Publication statusPublished - 9 Oct 2024
Event12th IEEE International Conference on Intelligent Systems - Varna, Bulgaria
Duration: 29 Aug 202431 Aug 2024

Publication series

Name2024 IEEE 12th International Conference on Intelligent Systems (IS)
PublisherIEEE
ISSN (Print)2832-4145
ISSN (Electronic)2767-9802

Conference

Conference12th IEEE International Conference on Intelligent Systems
Country/TerritoryBulgaria
CityVarna
Period29/08/2431/08/24

Keywords

  • phishing detection
  • supervised machine learning
  • classification algorithms

Fingerprint

Dive into the research topics of 'Hybrid machine learning model for phishing detection'. Together they form a unique fingerprint.

Cite this