Phishing Website Detection Using Machine Learning Algorithms
Phishing Website Detection Using Machine Learning Algorithms
net/publication/328541785
CITATIONS READS
5 12,401
2 authors:
Some of the authors of this publication are also working on these related projects:
Reinforcement Learning for anomalous path detection using wearable View project
All content following this page was uploaded by Rishikesh Mahajan on 14 June 2019.
45
International Journal of Computer Applications (0975 – 8887)
Volume 181 – No. 23, October 2018
46
International Journal of Computer Applications (0975 – 8887)
Volume 181 – No. 23, October 2018
6. CONCLUSION
This paper aims to enhance detection method to detect
phishing websites using machine learning technology. We
Fig. 1 Detection accuracy comparison
achieved 97.14% detection accuracy using random forest
5. IMPLEMENTATION AND RESULT algorithm with lowest false positive rate. Also result shows
Scikit-learn tool has been used to import Machine learning that classifiers give better performance when we used more
algorithms. Dataset is divided into training set and testing set data as training data.
in 50:50, 70:30 and 90:10 ratios respectively. Each classifier In future hybrid technology will be implemented to detect
is trained using training set and testing set is used to evaluate phishing websites more accurately, for which random forest
performance of classifiers. Performance of classifiers has been algorithm of machine learning technology and blacklist
evaluated by calculating classifier's accuracy score, false method will be used.
negative rate and false positive rate.
Table 1: Classifier's performance
7. REFERENCES
[1] Gunter Ollmann, “The Phishing Guide Understanding &
Dataset False False Preventing Phishing Attacks”, IBMInternet Security
Accuracy Systems, 2007.
Split Classifiers Negative Positive
Score
ratio Rate Rate
[2] https://resources.infosecinstitute.com/category/enterprise
Decision Tree 96.71 3.69 2.93 /phishing/the-phishing-landscape/phishing-data-attack-
statistics/#gref
Random [3] Mahmoud Khonji, Youssef Iraqi, "Phishing Detection: A
50:50 96.72 3.69 2.91 Literature Survey IEEE, and Andrew Jones, 2013
Forest
Support [4] Mohammad R., Thabtah F. McCluskey L., (2015)
vector 96.40 5.26 2.08 Phishing websites dataset. Available:
machine https://archive.ics.uci.edu/ml/datasets/Phishing+Websites
Accessed January 2016
Decision Tree 96.80 3.43 2.99
[5] http://dataaspirant.com/2017/01/30/how-decision-tree-
algorithm-works/
Random
70:30 96.84 3.35 2.98
Forest [6] http://dataaspirant.com/2017/05/22/random-forest-
Support algorithm-machine-learing/
vector 96.40 5.13 2.17 [7] https://www.kdnuggets.com/2016/07/support-vector-
machine machines-simple-explanation.html
IJCATM : www.ijcaonline.org
47