Problem Statement
Problem Statement
Problem statement:
Heart disease has been a major problem in the recent times. It is no longer true that only older
people are more prone to the disease. Your objective is to predict
Data Description:
This database contains 76 attributes, but all published experiments refer to using a subset of 14 of
them. The ‘num’ feature indicates the presence of heart disease.
Attribute Information:
Instructions:
1. Perform the required data pre-processing to treat for missing values and outliers.
2. Perform exploratory data analysis to visualise the spread of each of the X variables and the
relationship between the various X variables and the Y variable
3. Divide the given data into train and test sets
4. Predict how presence of heart disease by building classification models
5. Interpret how each of the X variables influence the conversion propensity
6. Evaluate the model performance measures and choose the most optimum model
7. Enlist your key findings based on the most optimum model and the respective feature
importance