Literature survey paper on Comparative Analysis of Diabetics Prediction Systems using Machine Learning Algorithms
Literature survey paper on Comparative Analysis of Diabetics Prediction Systems using Machine Learning Algorithms
Abstract— Machine learning algorithms are used in various This study aims to conduct a comprehensive comparative
domains like health care, banking sector, education because of analysis of various machine learning algorithms to identify the
extracting useful information from the database and predict the most effective model for predicting diabetes. The algorithms
disease. Diabetes is a prevalent chronic condition characterized evaluated in this research include Logistic Regression,
by elevated blood sugar levels, which, if not managed properly, Decision Trees, Random Forests, Support Vector Machines,
can lead to severe complications such as cardiovascular disease, and Neural Networks. Each algorithm is assessed based on a
neuropathy, and retinopathy. Early and accurate prediction of range of performance metrics, including accuracy, precision,
diabetes can significantly enhance patient outcomes by enabling recall, F1 score, and AUC-ROC, using a publicly available
timely intervention and treatment. This study conducts a
diabetes dataset.
comparative analysis of various machine learning algorithms to
determine the most effective model for predicting diabetes.
Keywords— Diabetes, Support Vector Machine, Machine The primary objective of this research is to provide
Learning, Decision Trees, K-NN, deep learning. healthcare professionals with insights into the strengths and
limitations of different machine learning models, guiding
them in selecting the optimal algorithm for clinical
applications. By comparing the performance of these models,
I. INTRODUCTION this study seeks to enhance the predictive accuracy of diabetes
Diabetes mellitus, a chronic metabolic disorder diagnosis and contribute to the growing body of knowledge in
characterized by elevated levels of blood glucose, has become the field of medical data science.
a global health crisis, affecting millions of individuals
worldwide. The early detection and accurate prediction of
diabetes are vital for effective management and prevention of
the associated complications, such as cardiovascular diseases,
neuropathy, and nephropathy. Traditional diagnostic methods
often rely on clinical assessments and laboratory tests, which,
while effective, can be time-consuming and may miss early
signs in at-risk populations.