0% found this document useful (0 votes)
14 views

Project - Predicting Heart Disease

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views

Project - Predicting Heart Disease

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Project: Predicting Heart Disease

The World Health Organization (WHO) estimates that 17.9 million people die from

cardiovascular diseases (CVDs) every year.

There are multiple risk factors that could contribute to CVD in an individual, such as

unhealthy diet, lack of physical activity, or mental illness. Identifying these risk factors

early on could help prevent many premature deaths.

We are working for an R&D company that focuses on providing healthcare solutions.

The company has collected anonymized data from multiple hospitals on several

patients. The dataset includes relevant information for each patient, such as their

personal information and some medical data, including whether or not they have had

heart disease before.

We are tasked with using the dataset to accurately predict the likelihood of a new

patient having heart disease in the future. The dataset has the following features:

1. Age: age of the patient [years]


2. Sex: sex of the patient [M: Male, F: Female]
3. ChestPainType: chest pain type [TA: Typical Angina, ATA: Atypical Angina, NAP:
Non-Anginal Pain, ASY: Asymptomatic]
4. RestingBP: resting blood pressure [mm Hg]
5. Cholesterol: serum cholesterol [mm/dl]
6. FastingBS: fasting blood sugar [1: if FastingBS > 120 mg/dl, 0: otherwise]
7. RestingECG: resting electrocardiogram results [Normal: Normal, ST: having ST-T
wave abnormality (T wave inversions and/or ST elevation or depression of > 0.05
mV), LVH: showing probable or definite left ventricular hypertrophy by Estes'
criteria]
8. MaxHR: maximum heart rate achieved [Numeric value between 60 and 202]
9. ExerciseAngina: exercise-induced angina [Y: Yes, N: No]
10. Oldpeak: oldpeak = ST [Numeric value measured in depression]
11. ST_Slope: the slope of the peak exercise ST segment [Up: upsloping, Flat: flat,
Down: downsloping]
12. HeartDisease: output class [1: heart disease, 0: Normal]

Before we jump into building a model, let's conduct exploratory data analysis on the

dataset.

Please note: For each step of the project, you are encouraged to add appropriate

headers or subheaders and any discussions or explanations you see fit. It's always

beneficial to have an appropriate structure for any project. It will help others clearly

understand your work and will enhance your portfolio. It doesn't have to be perfect in

any sense of the word. The goal should be to clarify your thought process for every

step.

Instructions

In the Jupyter Notebook:

1. Add an appropriate title for the project in a markdown cell.


2. Add a brief description to the same cell that explains what the project will focus
on.
3. Import the libraries you think will be relevant to this project.
4. Read heart_disease_prediction.csv into a pandas dataframe.
5. Display the first five rows of the dataframe.
6. Print out the number of features and observations in the dataset.

You might also like