Comparative Analysis of Supervised Learning Techniques of Machine Learning For Software Defect Prediction
Comparative Analysis of Supervised Learning Techniques of Machine Learning For Software Defect Prediction
10th International Conference on System Modeling & Advancement in Research Trends, 10th–11th December, 2021
Faculty of Engineering & Computing Sciences, Teerthanker Mahaveer University, Moradabad, India
Anurag Gupta1, Ratnesh Kumar Shukla2, Dr. Abhishek Bhola3, Alok Singh Sengar4
Computer Science & Engineering, College of Computing Science & Information Technology,Moradabad, India
1,2
3
Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation,
Vaddeswaram, AP, India
4
Computer Science & Engineering, College of Computing Science & Information Technology, Moradabad, India
E-mail: [email protected], [email protected], [email protected], [email protected]
Abstract—Software Bug prediction or defect prediction is every tree present in Random Forest divides out the class
very important for the organizations to detect the bugs in the prediction and the class with maximum votes becomes
early stage of Software development process because software model’s prediction.
developers can know the vulnerable areas where the defects
may be present. 4) Artificial Neural Networks
In this research paper we have compared different Artificial neural networks is a supervised machine
statistical techniques like Linear Regression, Naïve learning Technique based on biological concept of working
Bayes, Random Forest, Decision Tree, Artificial Neural of brain which is based on small cells called neurons.
Networks etc. and come up with the best among them 5) Decision Tree (DT)
for the Bug prediction. Comparison is made using Decision tree is based on hierarchical concept in which
Performance Measures like Accuracy, precision, recall top down approach is used. Decisions are taken based on
and F-measure. the condition. if the condition is true then one child node
Keywords: Software Bug Prediction, Vulnerable, Statistical
is created and if false then other child node is created. This
Techniques
Process will continue unless we reach to leaf node.
I. Introduction II. Literature Review
In this paper study has been done on different statistical Different studies already made and references present
techniques and comparative Analysis has been done. in the references section.
There are many existing techniques through which we Malhotra, Ruchika [1] in her research paper shown
can do the software defect prediction based on the different comparative Analysis for software bug prediction
factors. Some of them are mentioned below which are used techniques. reviewed the techniques for software bug
in our Analysis prediction models and assessed their performance,
1) Linear Regression (LR) D’Ambros, Marco, Michele Lanza and others [2]
Linear regression is the supervised machine learning in their Research paper did useful comparison between
Technique which is using the concept of independent and different bug prediction methods. The study showed
dependent variables. When dependent variable is dependent comparison between different bug prediction methods, also
on a single variable then it becomes Single Linear introduced new approach and calculated its performance
Regression and when it is dependent on multiple variables by doing a good comparison with other approaches their
then it becomes Multiple Linear Regression. approach.
2) Naive Bayes Gupta and Saxena [3] generated a model for Object
Oriented Software Defect Forecasting System, the study
Naive Bayes is a Supervised machine learning technique
used many similar types of defect datasets which were
based on famous Mathematical Bayes Theorem to predict class
present in Promise Software Database. They proposed a
of datasets and it uses Posterior Probability and Pre Probabilities.
model with average model accuracy of 76.27%
It can be used for binary and multiclass classifications.
T. Gyimothy and others [4] analyzed various object
3) Random Forest oriented metrics. Results shown the Coupling between
Random forest is a Supervised learning technique is Objects metric (CBO) is the best metric to predict the bugs
nothing but a group of trees that works as an ensemble. in the class and the Lines of code as also good, but the
406 Copyright © IEEE–2021 ISBN: 978-1-6654-3970-1
Authorized licensed use limited to: UNIV ESTADUAL PAULISTA JULIO DE MESQUITA FILHO. Downloaded on May 16,2023 at 16:45:47 UTC from IEEE Xplore. Restrictions apply.
Comparative Analysis of Supervised Learning Techniques of Machine Learning for Software Defect Prediction
Authorized licensed use limited to: UNIV ESTADUAL PAULISTA JULIO DE MESQUITA FILHO. Downloaded on May 16,2023 at 16:45:47 UTC from IEEE Xplore. Restrictions apply.
10th International Conference on System Modeling & Advancement in Research Trends, 10th–11th December, 2021
Faculty of Engineering & Computing Sciences, Teerthanker Mahaveer University, Moradabad, India
Authorized licensed use limited to: UNIV ESTADUAL PAULISTA JULIO DE MESQUITA FILHO. Downloaded on May 16,2023 at 16:45:47 UTC from IEEE Xplore. Restrictions apply.
Comparative Analysis of Supervised Learning Techniques of Machine Learning for Software Defect Prediction
References [11] Tiwari, A. K., & Shukla, R. K. (2019, March). Machine Learning
Approaches for Face Identification Feed Forward Algorithms. In
[1] R. Malhotra, “Comparative analysis of statistical and machine
Proceedings of 2nd International Conference on Advanced Computing
learning methods for predicting faulty modules,” Applied Soft and Software Engineering (ICACSE).
Computing 21, (2014): 286-297 [12] Kumar Shukla, R., & Kumar Tiwari, A. (2021). Comparative
[2] D’Ambros, Marco, Michele Lanza, and Romain Robbes. “An Analysis of Machine Learning Based Approaches for Face Detection
extensive comparison of bug prediction approaches.” Mining and Recognition. Journal of Information Technology Management,
Software Repositories (MSR), 2010 7th IEEE Working Conference 13(1), 1-21.
on. IEEE, 2010 [13] Singhal, P., Srivastava, P. K., Tiwari, A. K., & Shukla, R. K. (2022).
[3] Gupta, Dharmendra Lal, and Kavita Saxena. “Software bug A Survey: Approaches to Facial Detection and Recognition with
prediction using object-oriented metrics.” Sādhanā (2017): 1-15. Machine Learning Techniques. In Proceedings of Second Doctoral
[4] T. Gyimothy, R. Ferenc and I. Siket, “Empirical Validation of Object Symposium on Computational Intelligence (pp. 103-125). Springer,
Oriented Metrics on Open Source Software for Fault Prediction,” Singapore.
IEEE Transactions On Software Engineering, 2005. [14] Shukla, R. K., Prakash, V., & Pandey, S. (2020, December). A
[5] Y. Tohman, K. Tokunaga, S. Nagase, and M. Y., “Structural Perspective on Internet of Things: Challenges & Applications.
approach to the estimation of the number of residual software faults In 2020 9th International Conference System Modeling and
based on the hyper-geometric districution model,” IEEE Trans. on Advancement in Research Trends (SMART) (pp. 184-189). IEEE.
Software Engineering, pp. 345–355, 1989. [15] Jain, A., Kumar, A., & Sharma, S. (2015). Comparative Design and
Analysis of Mesh, Torus and Ring NoC. Procedia Computer Science,
[6] A. Sheta and D. Rine, “Modeling Incremental Faults of
48, 330-337.
Software Testing Process Using AR Models ”, the Proceeding
[16] Ghai, D., Gianey, H. K., Jain, A., & Uppal, R. S. (2020). Quantum
of 4th International Multi-Conferences on Computer Science and and dual-tree complex wavelet transform-based image watermarking.
Information Technology (CSIT 2006), Amman, Jordan. Vol. 3. 2006. International Journal of Modern Physics B, 34(04), 2050009.
[7] D. Sharma and P. Chandra, “Software Fault Prediction Using [17] Agrawal, N., Jain, A., & Agarwal, A. (2019). Simulation of Network
Machine-Learning Techniques,” Smart Computing and Informatics. on Chip for 3D Router Architecture. International Journal of Recent
Springer, Singapore, 2018. 541-549. Technology and Engineering, 8, 58-62.
[8] M. M. Rosli, N. H. I. Teo, N. S. M. Yusop and N. S. Moham, “The [18] Shukla, R. K., Agarwal, A., & Malviya, A. K. (2018). An Introduction
Design of a Software Fault Prone Application Using Evolutionary of Face Recognition and Face Detection for Blurred and Noisy
Algorithm,” IEEE Conference on Open Systems, 2011. Images. International Journal of Scientific Research in Computer
[9] Singh, Praman Deep, and Anuradha Chug. “Software defect Science and Engineering, 6(3), 39-43.
prediction analysis using machine learning algorithms.” 7th [19] Agarwal, A. K., & Jain, A. (2019). Synthesis of 2D and 3D NoC
International Conference on Cloud Computing, Data Science & mesh router architecture in HDL environment. J Adv Res Dyn
Engineering-Confluence, IEEE, 2017. Control Syst, 11(4), 2573-2581.
[10] Shukla, R. K., & Tiwari, A. K. (2020). A Machine Learning [20] Jain, A., Gahlot, A. K., Dwivedi, R., Kumar, A., & Sharma, S.
Approaches on Face Detection and Recognition. Solid State K. (2018). Fat Tree NoC Design and Synthesis. In Intelligent
Technology, 63(5), 7619-7627. Communication, Control and Devices (pp. 1749-1756). Springer,
Authorized licensed use limited to: UNIV ESTADUAL PAULISTA JULIO DE MESQUITA FILHO. Downloaded on May 16,2023 at 16:45:47 UTC from IEEE Xplore. Restrictions apply.