0% found this document useful (0 votes)

33 views

Deep Learning Project Report

Uploaded by

KINJAL PARMAR

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views

Deep Learning Project Report

Uploaded by

KINJAL PARMAR

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Deep Learning Project Report

Main Objective of the Analysis

The main objective of this analysis is to develop a deep learning model to predict the
presence of heart disease in patients. By accurately predicting heart disease, healthcare
providers can prioritize patients for further diagnostic tests and treatment, potentially
improving patient outcomes. This analysis focuses on supervised learning using classification
algorithms to achieve high accuracy and provide actionable insights to healthcare
professionals.

Description of the Data Set

Data Set Overview

The data set used in this analysis is the Heart Disease Dataset from the UCI Machine
Learning Repository. It includes information on 303 patients with 14 features related to their
medical history and diagnostic test results. The dataset is sourced from four different
hospitals and is commonly used for benchmarking heart disease prediction models.

Summary of Attributes

The key attributes in the data set include:

 Age: Age of the patient

 Sex: Gender of the patient (1 = male; 0 = female)
 CP: Chest pain type (0 = typical angina, 1 = atypical angina, 2 = non-anginal pain, 3
= asymptomatic)
 Trestbps: Resting blood pressure (in mm Hg on admission to the hospital)
 Chol: Serum cholesterol in mg/dl
 FBS: Fasting blood sugar > 120 mg/dl (1 = true; 0 = false)
 Restecg: Resting electrocardiographic results (0 = normal, 1 = having ST-T wave
abnormality, 2 = showing probable or definite left ventricular hypertrophy)
 Thalach: Maximum heart rate achieved
 Exang: Exercise-induced angina (1 = yes; 0 = no)
 Oldpeak: ST depression induced by exercise relative to rest
 Slope: The slope of the peak exercise ST segment (0 = upsloping, 1 = flat, 2 =
downsloping)
 Ca: Number of major vessels (0-3) colored by fluoroscopy
 Thal: Thalassemia (1 = normal; 2 = fixed defect; 3 = reversible defect)
 Target: Diagnosis of heart disease (1 = presence; 0 = absence)

Data Exploration and Cleaning

Data Exploration

Initial exploration of the data set revealed:

 No missing values in the dataset
 A balanced distribution of patients across various categories such as age, gender, and
chest pain type
 Outliers in the Chol and Thalach columns

Data Cleaning and Feature Engineering

The following actions were taken to prepare the data for modeling:

 Normalized the Age, Trestbps, Chol, Thalach, and Oldpeak columns to standardize
the range of values
 One-hot encoded categorical variables such as CP, Restecg, Slope, and Thal
 Split the data into training and testing sets with a 70:30 ratio

Model Training
Model Variations

Three variations of deep learning models were trained:

 Model 1: A basic neural network with one hidden layer

 Model 2: A neural network with two hidden layers and dropout regularization
 Model 3: A convolutional neural network (CNN) designed to capture local patterns in
the data

Hyperparameter Tuning

For each model, hyperparameters such as learning rate, batch size, and the number of epochs
were tuned using grid search and cross-validation to find the optimal settings.

Recommended Model
After evaluating the performance of all models, Model 2 (neural network with two hidden
layers and dropout regularization) was selected as the final model. It achieved the highest
accuracy of 85% on the test set while maintaining good generalization performance.

Key Findings and Insights

The key findings from the analysis are as follows:

 Age, Chest pain type (CP), Maximum heart rate achieved (Thalach), and
Exercise-induced angina (Exang) were the most significant predictors of heart
disease.
 The deep learning model effectively captured non-linear relationships in the data,
leading to improved prediction accuracy.
 Regularization techniques such as dropout helped prevent overfitting and improved
the model's generalization to unseen data.
Python Code:

# Import necessary libraries

import pandas as pd

import numpy as np

from sklearn.model_selection import train_test_split

from sklearn.preprocessing import StandardScaler, OneHotEncoder

from sklearn.compose import ColumnTransformer

from sklearn.pipeline import Pipeline

import tensorflow as tf

from tensorflow.keras.models import Sequential

from tensorflow.keras.layers import Dense, Dropout

from tensorflow.keras.callbacks import EarlyStopping

from sklearn.metrics import accuracy_score, classification_report

# Load the dataset

url = "https://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/
processed.cleveland.data"

column_names = [

"age", "sex", "cp", "trestbps", "chol", "fbs", "restecg",

"thalach", "exang", "oldpeak", "slope", "ca", "thal", "target"

df = pd.read_csv(url, names=column_names)

# Replace missing values represented by '?' with NaN

df.replace('?', np.nan, inplace=True)

# Convert columns to numeric, forcing errors to NaN

df = df.apply(pd.to_numeric, errors='coerce')

# Fill missing values with column mean

df.fillna(df.mean(), inplace=True)

# Split the data into features and target

X = df.drop("target", axis=1)

y = df["target"].apply(lambda x: 1 if x > 0 else 0) # Binarize the target variable

# Define preprocessing steps for numerical and categorical features

numeric_features = ["age", "trestbps", "chol", "thalach", "oldpeak"]

numeric_transformer = Pipeline(steps=[

("scaler", StandardScaler())

])

categorical_features = ["sex", "cp", "fbs", "restecg", "exang", "slope", "ca", "thal"]

categorical_transformer = Pipeline(steps=[

("onehot", OneHotEncoder(handle_unknown="ignore"))

])

preprocessor = ColumnTransformer(

transformers=[

("num", numeric_transformer, numeric_features),

("cat", categorical_transformer, categorical_features)

]

# Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)

# Preprocess the data

X_train = preprocessor.fit_transform(X_train)

X_test = preprocessor.transform(X_test)

# Build the deep learning model

model = Sequential()

model.add(Dense(64, input_dim=X_train.shape[1], activation="relu"))

model.add(Dropout(0.5))

model.add(Dense(32, activation="relu"))

model.add(Dropout(0.5))

model.add(Dense(1, activation="sigmoid"))

model.compile(optimizer="adam", loss="binary_crossentropy", metrics=["accuracy"])

# Define early stopping

early_stopping = EarlyStopping(monitor="val_loss", patience=10,

restore_best_weights=True)

# Train the model

history = model.fit(
X_train, y_train,

validation_split=0.2,

epochs=100,

batch_size=32,

callbacks=[early_stopping],

verbose=2

# Evaluate the model

y_pred_train = (model.predict(X_train) > 0.5).astype("int32")

y_pred_test = (model.predict(X_test) > 0.5).astype("int32")

print("Training Accuracy:", accuracy_score(y_train, y_pred_train))

print("Testing Accuracy:", accuracy_score(y_test, y_pred_test))

# Print classification report

print("Classification Report:\n", classification_report(y_test, y_pred_test))

# Plotting training & validation accuracy values

import matplotlib.pyplot as plt

plt.plot(history.history['accuracy'])

plt.plot(history.history['val_accuracy'])

plt.title('Model accuracy')

plt.ylabel('Accuracy')
plt.xlabel('Epoch')

plt.legend(['Train', 'Validation'], loc='upper left')

plt.show()

# Plotting training & validation loss values

plt.plot(history.history['loss'])

plt.plot(history.history['val_loss'])

plt.title('Model loss')

plt.ylabel('Loss')

plt.xlabel('Epoch')

plt.legend(['Train', 'Validation'], loc='upper left')

plt.show()

Next Steps
To further improve the model, the following steps are recommended:

 Collect additional data to increase the training set size and improve model robustness.
 Explore feature engineering techniques to create new features that may enhance
predictive performance.
 Investigate other deep learning architectures such as recurrent neural networks
(RNNs) or ensemble methods to potentially achieve better results.

Malaria MCQ
100% (6)
Malaria MCQ
7 pages
Heart Diseases Prediction Using Deep Learning Neural Network Model
No ratings yet
Heart Diseases Prediction Using Deep Learning Neural Network Model
5 pages
Disease Diagnosis
No ratings yet
Disease Diagnosis
4 pages
TWS-Assign-2024
No ratings yet
TWS-Assign-2024
5 pages
Project_Report
No ratings yet
Project_Report
18 pages
Heart Disease Prediction Using Machine Learning
No ratings yet
Heart Disease Prediction Using Machine Learning
4 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
8 pages
Ml Cep FAisal
No ratings yet
Ml Cep FAisal
18 pages
Lab Report Content - 15marks(1) (2)
No ratings yet
Lab Report Content - 15marks(1) (2)
10 pages
Final Report
No ratings yet
Final Report
43 pages
A.I Lab Report
No ratings yet
A.I Lab Report
24 pages
BIBA Enhancing Heart Disease Prediction With A Hybrid Model Combining Decision Tree, Logistic Regres
No ratings yet
BIBA Enhancing Heart Disease Prediction With A Hybrid Model Combining Decision Tree, Logistic Regres
12 pages
Heart Disease Prediction System Using Machine Learning[3][2]
No ratings yet
Heart Disease Prediction System Using Machine Learning[3][2]
19 pages
Web Application
No ratings yet
Web Application
13 pages
SUMMARY
No ratings yet
SUMMARY
16 pages
Efficient Medical Diagnosis of Human Heart Diseases
No ratings yet
Efficient Medical Diagnosis of Human Heart Diseases
27 pages
Research Paper-TWS-Assign- 2-with mendeley software
No ratings yet
Research Paper-TWS-Assign- 2-with mendeley software
6 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
6 pages
Borella_Elisa
No ratings yet
Borella_Elisa
49 pages
INFX 499 Milestone 1
No ratings yet
INFX 499 Milestone 1
8 pages
Pavani
No ratings yet
Pavani
4 pages
Diagnostics 14 00239 v2
No ratings yet
Diagnostics 14 00239 v2
19 pages
Final Heart Disease Project Proposal
No ratings yet
Final Heart Disease Project Proposal
12 pages
Review Paper Heart Disease Prediction
No ratings yet
Review Paper Heart Disease Prediction
5 pages
SECOND REVIEW
No ratings yet
SECOND REVIEW
23 pages
review 2
No ratings yet
review 2
23 pages
PythonHeartDisease FirstReview
No ratings yet
PythonHeartDisease FirstReview
20 pages
Chapter 3 Old
No ratings yet
Chapter 3 Old
45 pages
Heart Disease Prediction With Machine Learning Approaches
No ratings yet
Heart Disease Prediction With Machine Learning Approaches
5 pages
Artificial Neural Network - Project Report RA1811032010005
No ratings yet
Artificial Neural Network - Project Report RA1811032010005
4 pages
first review
No ratings yet
first review
24 pages
03-Supervised Machine Learning Classification
No ratings yet
03-Supervised Machine Learning Classification
33 pages
HDD New Report
No ratings yet
HDD New Report
95 pages
Heart Disease Prediction PPT
No ratings yet
Heart Disease Prediction PPT
11 pages
Final Year Project Report
No ratings yet
Final Year Project Report
20 pages
The Prediction and Analysis of Heart Disease Using 240511 181237
No ratings yet
The Prediction and Analysis of Heart Disease Using 240511 181237
8 pages
Risk Prediction of Cardiovascular Disease Using
No ratings yet
Risk Prediction of Cardiovascular Disease Using
14 pages
Research Article: Prediction of Heart Disease Using A Combination of Machine Learning and Deep Learning
No ratings yet
Research Article: Prediction of Heart Disease Using A Combination of Machine Learning and Deep Learning
11 pages
Machine Learning
No ratings yet
Machine Learning
30 pages
Heart Disease Prediction With Machine Learning Approaches
No ratings yet
Heart Disease Prediction With Machine Learning Approaches
6 pages
Prediction of Heart Diseases Using Machine Learning
No ratings yet
Prediction of Heart Diseases Using Machine Learning
49 pages
A Clinical Decision Support System For Heart Disease Prediction Using Deep Learning
No ratings yet
A Clinical Decision Support System For Heart Disease Prediction Using Deep Learning
14 pages
4183 (2)
No ratings yet
4183 (2)
4 pages
Synopsis (Heart Disease Prediction)
No ratings yet
Synopsis (Heart Disease Prediction)
7 pages
Heart Disease Risk Prediction Using Deep Learning Techniques With Feature Augmentation
No ratings yet
Heart Disease Risk Prediction Using Deep Learning Techniques With Feature Augmentation
15 pages
Heart Disease Ppt-6
No ratings yet
Heart Disease Ppt-6
19 pages
Heart Disease Predictor
No ratings yet
Heart Disease Predictor
3 pages
Second Progres Report
No ratings yet
Second Progres Report
10 pages
Heart Disease Report
No ratings yet
Heart Disease Report
8 pages
Heart Disease Pre
No ratings yet
Heart Disease Pre
23 pages
W42 Final Machine Learning
No ratings yet
W42 Final Machine Learning
7 pages
coursera DL and RL project
No ratings yet
coursera DL and RL project
3 pages
Heart Diseases New
No ratings yet
Heart Diseases New
30 pages
Heart Disease Prediction Model: Dissertation
No ratings yet
Heart Disease Prediction Model: Dissertation
4 pages
Review 1
No ratings yet
Review 1
18 pages
Heart Disease Detection - Newreport
No ratings yet
Heart Disease Detection - Newreport
57 pages
Comparative Study For Classification
No ratings yet
Comparative Study For Classification
6 pages
Gokaraju Rangaraju Institute of Engineering and Technology
No ratings yet
Gokaraju Rangaraju Institute of Engineering and Technology
19 pages
An Automated Diagnostic System For Heart Disease Prediction Based On - Chi - 2 - Statistical Model and Optimally Configured Deep Neural Network
No ratings yet
An Automated Diagnostic System For Heart Disease Prediction Based On - Chi - 2 - Statistical Model and Optimally Configured Deep Neural Network
8 pages
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
Radio-Wave-Propagation
No ratings yet
Radio-Wave-Propagation
47 pages
Phase II Qb Solution Programs
No ratings yet
Phase II Qb Solution Programs
33 pages
t2 a Qp Python-i Sem-III 2022
No ratings yet
t2 a Qp Python-i Sem-III 2022
6 pages
Ch8, Central Processing Unit
No ratings yet
Ch8, Central Processing Unit
61 pages
50-60 GHZ Waveguide To Microstrip Transition On L TCC For Enabling Integrated Mmic Packaging
No ratings yet
50-60 GHZ Waveguide To Microstrip Transition On L TCC For Enabling Integrated Mmic Packaging
4 pages
A V-Band Waveguide To Microstrip Probe Transition
No ratings yet
A V-Band Waveguide To Microstrip Probe Transition
3 pages
Diploma in Anaesthesia (Da) : Page 1 of 5
No ratings yet
Diploma in Anaesthesia (Da) : Page 1 of 5
5 pages
Tourniquet Use at The Boston Marathon Bombing: Lost in Translation
No ratings yet
Tourniquet Use at The Boston Marathon Bombing: Lost in Translation
6 pages
Biology Quiz
No ratings yet
Biology Quiz
3 pages
Arakaki 2017
No ratings yet
Arakaki 2017
8 pages
Taking Thevital Signs
No ratings yet
Taking Thevital Signs
1 page
Matcha Benefits - Google Search
No ratings yet
Matcha Benefits - Google Search
1 page
[FREE PDF sample] First Aid for the Pediatrics Clerkship, 4E [TRUE PDF] 4th Edition Latha Ganti ebooks
100% (1)
[FREE PDF sample] First Aid for the Pediatrics Clerkship, 4E [TRUE PDF] 4th Edition Latha Ganti ebooks
65 pages
Subarachnoid Hemorrhage.5
No ratings yet
Subarachnoid Hemorrhage.5
45 pages
7. Urban administration
No ratings yet
7. Urban administration
5 pages
Hypertension Drugs Cheat Sheet: by Via
No ratings yet
Hypertension Drugs Cheat Sheet: by Via
3 pages
Poultry Coccidiosis Ebook Complete
No ratings yet
Poultry Coccidiosis Ebook Complete
6 pages
MCQ Base Clinical Pharmacology PDF
No ratings yet
MCQ Base Clinical Pharmacology PDF
30 pages
LiDCOunity Brochure 4634
No ratings yet
LiDCOunity Brochure 4634
7 pages
Lymphadenopathy: Clinical Approach
No ratings yet
Lymphadenopathy: Clinical Approach
31 pages
ADEC EHSMS Handbook V.01-2011-F-E PDF
No ratings yet
ADEC EHSMS Handbook V.01-2011-F-E PDF
32 pages
Soal Uts Akper Semester 3
No ratings yet
Soal Uts Akper Semester 3
17 pages
Course Title: Entrepreneurship Fall 2019 Semester 7: COMSATS University Islamabad, Lahore Campus
No ratings yet
Course Title: Entrepreneurship Fall 2019 Semester 7: COMSATS University Islamabad, Lahore Campus
4 pages
Epidemiological Theory Ppt-1
No ratings yet
Epidemiological Theory Ppt-1
64 pages
Current Therapy in Neurologic Disease 7th Edition Textbook Richard T. Johnson Md - Download the ebook and start exploring right away
100% (1)
Current Therapy in Neurologic Disease 7th Edition Textbook Richard T. Johnson Md - Download the ebook and start exploring right away
79 pages
Clarithromycin (Biaxin)
100% (1)
Clarithromycin (Biaxin)
1 page
Food Safety and Informal Markets
No ratings yet
Food Safety and Informal Markets
284 pages
The Principles of Cavity Preparation (Lecture by DR - Wedad Etman @AmCoFam)
92% (24)
The Principles of Cavity Preparation (Lecture by DR - Wedad Etman @AmCoFam)
38 pages
Renal Tubular Acidosis in Children Ppt 3
No ratings yet
Renal Tubular Acidosis in Children Ppt 3
1 page
M (ASCPi) Reading List
No ratings yet
M (ASCPi) Reading List
2 pages
Hematology: Basic Principles and Practice. 7th Edition. ISBN 0323357628, 978-0323357623
100% (22)
Hematology: Basic Principles and Practice. 7th Edition. ISBN 0323357628, 978-0323357623
23 pages
Somali Community Health Strategy111214
No ratings yet
Somali Community Health Strategy111214
40 pages
General Hospital Standard Checklist - Docx 2
No ratings yet
General Hospital Standard Checklist - Docx 2
109 pages
MSP 2022 For Production of Bovine Frozen Semen
No ratings yet
MSP 2022 For Production of Bovine Frozen Semen
56 pages
Shiva Abhishekam
No ratings yet
Shiva Abhishekam
3 pages