Research Article Open Access

Phishing Website Detection Using Improved Multilayered Convolutional Neural Networks

Hadia Bibi1, Syed Rehan Shah2, Mirza Murad Baig3, Muhammad Imran Sharif4, Mehwish Mehmood5, Zahid Akhtar6 and Kamran Siddique7
  • 1 Department of Computer Science, Bahauddin Zakariya University, Multan, Pakistan
  • 2 Department of Computer Science, Muhammad Nawaz Shareef University of Agriculture, Multan, Pakistan
  • 3 Department of Computer Science, National University of Modern Languages, Islamabad, Pakistan
  • 4 Department of Computer Science, Kansas State University, Manhattan, KS, United States
  • 5 Department of Electrical and Computer Engineering, Comsats University Islamabad, Islamabad Campus, Pakistan
  • 6 Department of Network and Computer Security, University of New York Polytechnic Institute, Utica, United States
  • 7 Department of Computer Science and Engineering, University of Alaska Anchorage, Anchorage, United States

Abstract

The internet has become an essential part of many fields: Communication, entertainment, commerce, industrial production, agriculture, etc. Unfortunately, online users are vulnerable to various attacks; this could lead to financial damages and loss of personal information. Phishing is seen as an internet threat and a cybercrime where anyone can capture personal information and data by posing as a reliable source. Data may include passwords to access confidential private or industrial repositories, emails, banks, financial information, etc. The prediction task is one of the crucial aspects of modern security systems, including anti-virus, firewall, and anti-spyware software. Currently, there is no availability of a single technique that can effectively detect every phishing attack. This study proposes a novel intelligent approach, phishing Prediction, using machine learning and deep learning to accurately predict phishing websites. We apply a pre-processing pipeline and develop the model using four machine learning models namely decision tree, Naive Bayes, support vector machine random forest, and Convolutional Neural Network (CNN) as a deep learning model. The UCI machine learning repository dataset comprised 11,055 websites, including lists of 4898 phishing and 6157 legitimate websites. The multilayered CNN has achieved the highest accuracy of 99.1% among all the listed algorithms, showcasing a precision of 97, a recall of 96%, and an F1-score of 96%.

Journal of Computer Science
Volume 20 No. 9, 2024, 1069-1079

DOI: https://doi.org/10.3844/jcssp.2024.1069.1079

Submitted On: 26 February 2024 Published On: 6 July 2024

How to Cite: Bibi, H., Shah, S. R., Baig, M. M., Sharif, M. I., Mehmood, M., Akhtar, Z. & Siddique, K. (2024). Phishing Website Detection Using Improved Multilayered Convolutional Neural Networks. Journal of Computer Science, 20(9), 1069-1079. https://doi.org/10.3844/jcssp.2024.1069.1079

  • 777 Views
  • 362 Downloads
  • 0 Citations

Download

Keywords

  • Machine Learning
  • Deep Learning
  • Phishing Website Prediction
  • Classification
  • CNN