0% found this document useful (0 votes)

18 views8 pages

Untitled2.Ipynb - Colab

The document outlines a data analysis and machine learning workflow using a heart disease dataset in Python with libraries such as pandas, seaborn, and scikit-learn. It includes data preprocessing, visualization of distributions and correlations, and the implementation of various classification models including Logistic Regression, Decision Tree, Random Forest, and SVM, along with their evaluation metrics. The analysis reveals insights into the dataset and the performance of different models in predicting heart disease.

Uploaded by

Omar Abdullah Prince

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views8 pages

Untitled2.Ipynb - Colab

Uploaded by

Omar Abdullah Prince

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

5/11/25, 10:35 PM Untitled2.

ipynb - Colab

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.ensemble import RandomForestClassifier
from sklearn.metrics import classification_report,confusion_matrix,accuracy_score

from google.colab import files

uploaded = files.upload()

import io
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt

# Automatically get filename

filename = list(uploaded.keys())[0]

# Read the file

df = pd.read_csv(io.BytesIO(uploaded[filename]))

# Show DataFrame details

print(df.head())
print(df.info())
print(df.isnull().sum())

# Plot the target column

sns.countplot(x='target', data=df)
plt.title("Heart Disease Count (0=No, 1=Yes)")
plt.show()

https://colab.research.google.com/drive/1dOSJItEe0P2tYlbFeuBxId4-CO9tj7_x#scrollTo=6oxSkoppFBZY&printMode=true 1/8
5/11/25, 10:35 PM Untitled2.ipynb - Colab

Choose Files heart.csv

heart.csv(text/csv) - 39689 bytes, last modified: 5/11/2025 - 100% done
Saving heart.csv to heart.csv
age sex chest pain type resting bp s cholesterol fasting blood sugar \
0 40 1 2 140 289 0
1 49 0 3 160 180 0
2 37 1 2 130 283 0
3 48 0 4 138 214 0
4 54 1 3 150 195 0

resting ecg max heart rate exercise angina oldpeak ST slope target
0 0 172 0 0.0 1 0
1 0 156 0 1.0 2 1
2 1 98 0 0.0 1 0
3 0 108 1 1.5 2 1
4 0 122 0 0.0 1 0
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 1190 entries, 0 to 1189
Data columns (total 12 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 age 1190 non-null int64
1 sex 1190 non-null int64
2 chest pain type 1190 non-null int64
3 resting bp s 1190 non-null int64
4 cholesterol 1190 non-null int64
5 fasting blood sugar 1190 non-null int64
6 resting ecg 1190 non-null int64
7 max heart rate 1190 non-null int64
8 exercise angina 1190 non-null int64
9 oldpeak 1190 non-null float64
10 ST slope 1190 non-null int64
11 target 1190 non-null int64
dtypes: float64(1), int64(11)
memory usage: 111.7 KB
None
age 0
sex 0
chest pain type 0
resting bp s 0
cholesterol 0
fasting blood sugar 0
resting ecg 0
max heart rate 0
exercise angina 0
oldpeak 0
ST slope 0
target 0
dtype: int64

import seaborn as sns

import matplotlib.pyplot as plt
import pandas as pd
sns.set(style="white")

fig, axes = plt.subplots(1, 3, figsize=(24, 8)) # Increase width and height for more space

# 1️⃣ Missing Values Heatmap

sns.heatmap(df.isnull(), cbar=False, cmap='viridis', ax=axes[0])
axes[0].set_title("Missing Values Heatmap")

# 2️⃣ Correlation Heatmap with increased spreading

https://colab.research.google.com/drive/1dOSJItEe0P2tYlbFeuBxId4-CO9tj7_x#scrollTo=6oxSkoppFBZY&printMode=true 2/8
5/11/25, 10:35 PM Untitled2.ipynb - Colab
2️⃣ p p g
corr = df.corr()
sns.heatmap(corr, annot=True, cmap='coolwarm', ax=axes[1], fmt='.2f', annot_kws={"size": 10})
axes[1].set_title("Correlation Between Features")

# 3️⃣ Categorical Pivot Heatmap: Sex vs Target Count

pivot_table = df.pivot_table(index='sex', columns='target', aggfunc='size', fill_value=0)
sns.heatmap(pivot_table, annot=True, fmt='d', cmap="YlGnBu", ax=axes[2])
axes[2].set_title("Target vs Sex Heatmap (Count)")

# Adjust layout to avoid overlap

plt.subplots_adjust(wspace=0.3) # Increase space between subplots
plt.tight_layout()
plt.show()

import seaborn as sns

import matplotlib.pyplot as plt

# Subplots: 3 row, 2 column = 6 graphs

fig, axes = plt.subplots(3, 2, figsize=(15, 14))

# Cholesterol
sns.histplot(df['cholesterol'], kde=True, color='blue', ax=axes[0, 0])
axes[0, 0].set_title("Cholesterol Distribution")
axes[0, 0].set_xlabel("Cholesterol Level")
axes[0, 0].set_ylabel("Frequency")

# Age
sns.histplot(df['age'], kde=True, color='purple', ax=axes[0, 1])
axes[0, 1].set_title("Age Distribution")
axes[0, 1].set_xlabel("Age")
axes[0, 1].set_ylabel("Frequency")

# Max Heart Rate

sns.histplot(df['max heart rate'], kde=True, color='red', ax=axes[1, 0])
axes[1, 0].set_title("Max Heart Rate Distribution")
axes[1, 0].set_xlabel("Max Heart Rate")
axes[1, 0].set_ylabel("Frequency")

# Oldpeak
sns.histplot(df['oldpeak'], kde=True, color='green', ax=axes[1, 1])
axes[1, 1].set_title("Oldpeak (ST Depression) Distribution")
axes[1, 1].set_xlabel("Oldpeak")

https://colab.research.google.com/drive/1dOSJItEe0P2tYlbFeuBxId4-CO9tj7_x#scrollTo=6oxSkoppFBZY&printMode=true 3/8
5/11/25, 10:35 PM Untitled2.ipynb - Colab
axes[1, 1].set_ylabel("Frequency")

# Resting Blood Pressure

sns.histplot(df['resting bp s'], kde=True, color='orange', ax=axes[2, 0])
axes[2, 0].set_title("Resting Blood Pressure Distribution")
axes[2, 0].set_xlabel("Resting BP")
axes[2, 0].set_ylabel("Frequency")

# Fasting Blood Sugar

sns.histplot(df['fasting blood sugar'], kde=False, color='teal', ax=axes[2, 1])
axes[2, 1].set_title("Fasting Blood Sugar Distribution")
axes[2, 1].set_xlabel("Fasting Blood Sugar")
axes[2, 1].set_ylabel("Count")

# Adjust layout
plt.tight_layout()
plt.show()

https://colab.research.google.com/drive/1dOSJItEe0P2tYlbFeuBxId4-CO9tj7_x#scrollTo=6oxSkoppFBZY&printMode=true 4/8
5/11/25, 10:35 PM Untitled2.ipynb - Colab

from sklearn.preprocessing import MinMaxScaler

scaler = MinMaxScaler()

numerical_columns = ['age', 'sex', 'chest pain type', 'resting bp s', 'cholesterol',

'fasting blood sugar', 'resting ecg', 'max heart rate', 'exercise angina',
'oldpeak', 'ST slope']

https://colab.research.google.com/drive/1dOSJItEe0P2tYlbFeuBxId4-CO9tj7_x#scrollTo=6oxSkoppFBZY&printMode=true 5/8
5/11/25, 10:35 PM Untitled2.ipynb - Colab

df[numerical_columns] = scaler.fit_transform(df[numerical_columns])

print(df.head())

age sex chest pain type resting bp s cholesterol \

0 0.244898 1.0 0.333333 0.70 0.479270
1 0.428571 0.0 0.666667 0.80 0.298507
2 0.183673 1.0 0.333333 0.65 0.469320
3 0.408163 0.0 1.000000 0.69 0.354892
4 0.530612 1.0 0.666667 0.75 0.323383

fasting blood sugar resting ecg max heart rate exercise angina \
0 0.0 0.0 0.788732 0.0
1 0.0 0.0 0.676056 0.0
2 0.0 0.5 0.267606 0.0
3 0.0 0.0 0.338028 1.0
4 0.0 0.0 0.436620 0.0

oldpeak ST slope target

0 0.295455 0.333333 0
1 0.409091 0.666667 1
2 0.295455 0.333333 0
3 0.465909 0.666667 1
4 0.295455 0.333333 0

from sklearn.model_selection import train_test_split

X = df.drop(columns=['target'])
y = df['target']

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

print("Training data size:", X_train.shape)

print("Test data size:", X_test.shape)

Training data size: (952, 11)

Test data size: (238, 11)

# Import necessary libraries

from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.tree import DecisionTreeClassifier
from sklearn.ensemble import RandomForestClassifier
from sklearn.svm import SVC
from sklearn.metrics import accuracy_score, confusion_matrix, classification_report

# Assume that df is already loaded and preprocessed

# Splitting the data into features (X) and target (y)

X = df.drop('target', axis=1) # Dropping target column for features
y = df['target'] # Target column

# Splitting data into training and testing sets (80% training, 20% testing)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Model 1: Logistic Regression

logreg_model = LogisticRegression(max_iter=1000)
logreg_model.fit(X_train, y_train)
logreg_pred = logreg_model.predict(X_test)

# Model 2: Decision Tree

dt_model = DecisionTreeClassifier(random_state=42)
dt_model.fit(X_train, y_train)
dt_pred = dt_model.predict(X_test)

# Model 3: Random Forest

rf_model = RandomForestClassifier(random_state=42)
rf_model.fit(X_train, y_train)
rf_pred = rf_model.predict(X_test)

# Model 4: Support Vector Machine (SVM)

svm_model = SVC(kernel='linear')
svm_model.fit(X_train, y_train)
svm_pred = svm_model.predict(X_test)

# Evaluate models

# Logistic Regression
print("Logistic Regression Accuracy:", accuracy_score(y_test, logreg_pred))
print("Logistic Regression Confusion Matrix:\n", confusion_matrix(y_test, logreg_pred))

https://colab.research.google.com/drive/1dOSJItEe0P2tYlbFeuBxId4-CO9tj7_x#scrollTo=6oxSkoppFBZY&printMode=true 6/8
5/11/25, 10:35 PM Untitled2.ipynb - Colab
print("Logistic Regression Classification Report:\n", classification_report(y_test, logreg_pred))

# Decision Tree
print("Decision Tree Accuracy:", accuracy_score(y_test, dt_pred))
print("Decision Tree Confusion Matrix:\n", confusion_matrix(y_test, dt_pred))
print("Decision Tree Classification Report:\n", classification_report(y_test, dt_pred))

# Random Forest
print("Random Forest Accuracy:", accuracy_score(y_test, rf_pred))
print("Random Forest Confusion Matrix:\n", confusion_matrix(y_test, rf_pred))
print("Random Forest Classification Report:\n", classification_report(y_test, rf_pred))

# SVM
print("SVM Accuracy:", accuracy_score(y_test, svm_pred))
print("SVM Confusion Matrix:\n", confusion_matrix(y_test, svm_pred))
print("SVM Classification Report:\n", classification_report(y_test, svm_pred))

Logistic Regression Accuracy: 0.8529411764705882

Logistic Regression Confusion Matrix:
[[ 90 17]
[ 18 113]]
Logistic Regression Classification Report:
precision recall f1-score support

0 0.83 0.84 0.84 107

1 0.87 0.86 0.87 131

accuracy 0.85 238

macro avg 0.85 0.85 0.85 238
weighted avg 0.85 0.85 0.85 238

Decision Tree Accuracy: 0.8991596638655462

Decision Tree Confusion Matrix:
[[ 99 8]
[ 16 115]]
Decision Tree Classification Report:
precision recall f1-score support

0 0.86 0.93 0.89 107

1 0.93 0.88 0.91 131

accuracy 0.90 238

macro avg 0.90 0.90 0.90 238
weighted avg 0.90 0.90 0.90 238

Random Forest Accuracy: 0.9453781512605042

Random Forest Confusion Matrix:
[[ 98 9]
[ 4 127]]
Random Forest Classification Report:
precision recall f1-score support

0 0.96 0.92 0.94 107

1 0.93 0.97 0.95 131

accuracy 0.95 238

macro avg 0.95 0.94 0.94 238
weighted avg 0.95 0.95 0.95 238

SVM Accuracy: 0.8571428571428571

SVM Confusion Matrix:
[[ 90 17]
[ 17 114]]
SVM Classification Report:
precision recall f1-score support

0 0.84 0.84 0.84 107

1 0.87 0.87 0.87 131

accuracy 0.86 238

macro avg 0.86 0.86 0.86 238
weighted avg 0.86 0.86 0.86 238

from sklearn.svm import SVC

from sklearn.metrics import accuracy_score, confusion_matrix, classification_report, roc_curve, auc
import matplotlib.pyplot as plt

# Model 4: Support Vector Machine (SVM)

svm_model = SVC(kernel='linear', probability=True, random_state=42)
svm_model.fit(X_train, y_train)
svm_pred = svm_model.predict(X_test)

# Logistic Regression Evaluation

print("Logistic Regression Evaluation:")
logreg_accuracy = accuracy_score(y_test, logreg_pred)
https://colab.research.google.com/drive/1dOSJItEe0P2tYlbFeuBxId4-CO9tj7_x#scrollTo=6oxSkoppFBZY&printMode=true 7/8
5/11/25, 10:35 PM Untitled2.ipynb - Colab
print(f"Accuracy: {logreg_accuracy * 100:.2f}%")
print("Confusion Matrix:")
print(confusion_matrix(y_test, logreg_pred))
print("Classification Report:")
print(classification_report(y_test, logreg_pred))

# ROC Curve: Logistic Regression

fpr, tpr, thresholds = roc_curve(y_test, logreg_model.predict_proba(X_test)[:, 1])
roc_auc = auc(fpr, tpr)
print("\n🔎 Logistic Regression Threshold Values (First 10):")
for i in range(min(10, len(thresholds))):
print(f"Threshold: {thresholds[i]:.4f}, TPR: {tpr[i]:.4f}, FPR: {fpr[i]:.4f}")

plt.figure(figsize=(10, 6))
plt.plot(fpr, tpr, color='blue', label=f'Logistic Regression (AUC = {roc_auc:.2f})')
plt.plot([0, 1], [0, 1], color='gray', linestyle='--')
for i in range(0, len(thresholds), max(1, len(thresholds)//10)):
plt.annotate(f'{thresholds[i]:.2f}', (fpr[i], tpr[i]), textcoords="offset points", xytext=(5, -10), ha='left', fontsize=8)
plt.title('ROC Curve with Thresholds (Logistic Regression)')
plt.xlabel('False Positive Rate')
plt.ylabel('True Positive Rate')
plt.legend(loc='lower right')
plt.grid(True)
plt.show()

# Decision Tree Evaluation

print("\nDecision Tree Evaluation:")
dt_accuracy = accuracy_score(y_test, dt_pred)
print(f"Accuracy: {dt_accuracy * 100:.2f}%")
print("Confusion Matrix:")
print(confusion_matrix(y_test, dt_pred))
print("Classification Report:")
print(classification_report(y_test, dt_pred))

# ROC Curve: Decision Tree

fpr, tpr, thresholds = roc_curve(y_test, dt_model.predict_proba(X_test)[:, 1])
roc_auc = auc(fpr, tpr)
print("\n🔎 Decision Tree Threshold Values (First 10):")
for i in range(min(10, len(thresholds))):
print(f"Threshold: {thresholds[i]:.4f}, TPR: {tpr[i]:.4f}, FPR: {fpr[i]:.4f}")

plt.figure(figsize=(10, 6))
plt.plot(fpr, tpr, color='red', label=f'Decision Tree (AUC = {roc_auc:.2f})')
plt.plot([0, 1], [0, 1], color='gray', linestyle='--')
for i in range(0, len(thresholds), max(1, len(thresholds)//10)):
plt.annotate(f'{thresholds[i]:.2f}', (fpr[i], tpr[i]), textcoords="offset points", xytext=(5, -10), ha='left', fontsize=8)
plt.title('ROC Curve with Thresholds (Decision Tree)')
plt.xlabel('False Positive Rate')
plt.ylabel('True Positive Rate')
plt.legend(loc='lower right')
plt.grid(True)
plt.show()

# Random Forest Evaluation

print("\nRandom Forest Evaluation:")
rf_accuracy = accuracy_score(y_test, rf_pred)
print(f"Accuracy: {rf_accuracy * 100:.2f}%")
print("Confusion Matrix:")
print(confusion_matrix(y_test, rf_pred))
print("Classification Report:")
print(classification_report(y_test, rf_pred))

# ROC Curve: Random Forest

fpr, tpr, thresholds = roc_curve(y_test, rf_model.predict_proba(X_test)[:, 1])
roc_auc = auc(fpr, tpr)
print("\n🔎 Random Forest Threshold Values (First 10):")
for i in range(min(10, len(thresholds))):
print(f"Threshold: {thresholds[i]:.4f}, TPR: {tpr[i]:.4f}, FPR: {fpr[i]:.4f}")

plt.figure(figsize=(10, 6))
plt.plot(fpr, tpr, color='green', label=f'Random Forest (AUC = {roc_auc:.2f})')
plt.plot([0, 1], [0, 1], color='gray', linestyle='--')
for i in range(0, len(thresholds), max(1, len(thresholds)//10)):
plt.annotate(f'{thresholds[i]:.2f}', (fpr[i], tpr[i]), textcoords="offset points", xytext=(5, -10), ha='left', fontsize=8)
plt.title('ROC Curve with Thresholds (Random Forest)')
plt.xlabel('False Positive Rate')
plt.ylabel('True Positive Rate')
plt.legend(loc='lower right')
plt.grid(True)
plt.show()

# SVM Evaluation
print("\nSVM Evaluation:")
https://colab.research.google.com/drive/1dOSJItEe0P2tYlbFeuBxId4-CO9tj7_x#scrollTo=6oxSkoppFBZY&printMode=true 8/8

Your Electronic Ticket Receipt
No ratings yet
Your Electronic Ticket Receipt
2 pages
Manual Equus 810 070
100% (1)
Manual Equus 810 070
10 pages
Vertopal.com Heart Failure Prediction With Detailed Headings
No ratings yet
Vertopal.com Heart Failure Prediction With Detailed Headings
12 pages
Major project - Colab
No ratings yet
Major project - Colab
15 pages
Ide To 6 Classification Algorithms
No ratings yet
Ide To 6 Classification Algorithms
34 pages
heart_cleveland.ipynb - Colab
No ratings yet
heart_cleveland.ipynb - Colab
5 pages
Hare Krishna
No ratings yet
Hare Krishna
1 page
Dovdush_KN-305_lab3
No ratings yet
Dovdush_KN-305_lab3
2 pages
ALY6015 Final Project Report
No ratings yet
ALY6015 Final Project Report
19 pages
Ai in HC - 2
No ratings yet
Ai in HC - 2
9 pages
eda-ml-decision-tree.ipynb - Colab
No ratings yet
eda-ml-decision-tree.ipynb - Colab
20 pages
Dovdush_KN-305_lab2
No ratings yet
Dovdush_KN-305_lab2
2 pages
Heart Failure Prediction
100% (1)
Heart Failure Prediction
41 pages
LP Practical ! Jupyter Notebook
No ratings yet
LP Practical ! Jupyter Notebook
6 pages
Import Numpy as Np
No ratings yet
Import Numpy as Np
3 pages
Project_Report
No ratings yet
Project_Report
18 pages
Bio-Signal Analysis For Smoking
No ratings yet
Bio-Signal Analysis For Smoking
1 page
AI Mini Project
No ratings yet
AI Mini Project
6 pages
Cluster Result
No ratings yet
Cluster Result
3 pages
Cluster Result
No ratings yet
Cluster Result
3 pages
Heart Disease Indicator Prediction Model
No ratings yet
Heart Disease Indicator Prediction Model
17 pages
# Load Packages: Pandas Pandas PD PD Numpy Numpy NP NP
No ratings yet
# Load Packages: Pandas Pandas PD PD Numpy Numpy NP NP
17 pages
C ML1
No ratings yet
C ML1
10 pages
DocScanner Oct 22, 2024 17-38
No ratings yet
DocScanner Oct 22, 2024 17-38
2 pages
Stroke Prediction Dataset
No ratings yet
Stroke Prediction Dataset
48 pages
AIML Practical 05 22105A2021
No ratings yet
AIML Practical 05 22105A2021
9 pages
Heart_Disease_1.Ipynb - Colaboratory (1)[1]
No ratings yet
Heart_Disease_1.Ipynb - Colaboratory (1)[1]
9 pages
Week - 6 - SWI - MLP - LogisticRegression - Ipynb - Colaboratory
No ratings yet
Week - 6 - SWI - MLP - LogisticRegression - Ipynb - Colaboratory
15 pages
Diabetis Project
No ratings yet
Diabetis Project
7 pages
Heart Disease Report
No ratings yet
Heart Disease Report
8 pages
Web Application
No ratings yet
Web Application
13 pages
Stroke Prediction
No ratings yet
Stroke Prediction
10 pages
Assignment 1
No ratings yet
Assignment 1
10 pages
Logistic Regression
No ratings yet
Logistic Regression
12 pages
vertopal.com_Project_16_Calories_Burnt_Prediction
No ratings yet
vertopal.com_Project_16_Calories_Burnt_Prediction
10 pages
Practical 1
No ratings yet
Practical 1
7 pages
C2 W4 Lab 02 Tree Ensemble
No ratings yet
C2 W4 Lab 02 Tree Ensemble
16 pages
Heart Disease Classification ML Assignment - Jupyter Notebook
No ratings yet
Heart Disease Classification ML Assignment - Jupyter Notebook
7 pages
Heart Disease
No ratings yet
Heart Disease
37 pages
Python for Machine Learning Visualization 1735231185
No ratings yet
Python for Machine Learning Visualization 1735231185
69 pages
Cardiovascular_Disease_Prediction
No ratings yet
Cardiovascular_Disease_Prediction
2 pages
Ml practicals
No ratings yet
Ml practicals
21 pages
Python Datavisualization
No ratings yet
Python Datavisualization
69 pages
baseline.ipynb - Colab
No ratings yet
baseline.ipynb - Colab
5 pages
HEART DISEASE CLASSIFICATION USING ANN HANDS-ON
No ratings yet
HEART DISEASE CLASSIFICATION USING ANN HANDS-ON
7 pages
KNN - Jupyter Notebook (1)
No ratings yet
KNN - Jupyter Notebook (1)
7 pages
project code health sleep lifestyle
No ratings yet
project code health sleep lifestyle
4 pages
Dataset Documentation
No ratings yet
Dataset Documentation
3 pages
Logistic Regression 205
No ratings yet
Logistic Regression 205
8 pages
5
No ratings yet
5
5 pages
COMP5318
No ratings yet
COMP5318
42 pages
Exp 5
No ratings yet
Exp 5
7 pages
QUIZ Week 2 CART Practice PDF
No ratings yet
QUIZ Week 2 CART Practice PDF
10 pages
LAB8_LogisticReg_HeartDisease[1]
No ratings yet
LAB8_LogisticReg_HeartDisease[1]
31 pages
Lecture-4 (Day 3) - Pandas
No ratings yet
Lecture-4 (Day 3) - Pandas
4 pages
Model2.ipynb - Colab
No ratings yet
Model2.ipynb - Colab
11 pages
Problem Statement
No ratings yet
Problem Statement
2 pages
Adaboost 2
No ratings yet
Adaboost 2
9 pages
DWDM Lab 3
No ratings yet
DWDM Lab 3
10 pages
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
Develop Snake & Ladder Game in an Hour (Complete Guide with Code & Design)
From Everand
Develop Snake & Ladder Game in an Hour (Complete Guide with Code & Design)
Anurag Pandey
No ratings yet
Advanced Cue Ball Control Self-Testing Program
From Everand
Advanced Cue Ball Control Self-Testing Program
Allan P. Sand
No ratings yet
s12144-025-07373-2
No ratings yet
s12144-025-07373-2
13 pages
Simulation_model_of_PID_for_DC_DC_conver
No ratings yet
Simulation_model_of_PID_for_DC_DC_conver
7 pages
dsp_exp_1_report_1
No ratings yet
dsp_exp_1_report_1
4 pages
dsp_exp_2 (1)
No ratings yet
dsp_exp_2 (1)
5 pages
F-58 Check Printout-SBOI
No ratings yet
F-58 Check Printout-SBOI
8 pages
Markov Vs Arima
No ratings yet
Markov Vs Arima
93 pages
Download ebooks file The Evolution of Consciousness Implications for Mental Health and Quality of Life 1st Edition Bjørn Grinde (Auth.) all chapters
100% (3)
Download ebooks file The Evolution of Consciousness Implications for Mental Health and Quality of Life 1st Edition Bjørn Grinde (Auth.) all chapters
55 pages
Diagnosis_of_Coronary_Heart_Disease_Through_Deep_Learning-Based_Segmentation_and_Localization_in_Computed_Tomography_Angiography
No ratings yet
Diagnosis_of_Coronary_Heart_Disease_Through_Deep_Learning-Based_Segmentation_and_Localization_in_Computed_Tomography_Angiography
17 pages
A Project Report On: Comparative Analysis of Sales Promotional Activities of Reliance Communication With Others
No ratings yet
A Project Report On: Comparative Analysis of Sales Promotional Activities of Reliance Communication With Others
66 pages
CRM Concepts
88% (8)
CRM Concepts
36 pages
The Automotive Standard ISO 26262 The Innovative D
No ratings yet
The Automotive Standard ISO 26262 The Innovative D
9 pages
HP Envy 14 Inventec Romeo DIS EV145I 6050A2316601-MB-A03 MV 0420
No ratings yet
HP Envy 14 Inventec Romeo DIS EV145I 6050A2316601-MB-A03 MV 0420
66 pages
NDT014L N-Channel Logic Level Enhancement Mode Field Effect Transistor
No ratings yet
NDT014L N-Channel Logic Level Enhancement Mode Field Effect Transistor
10 pages
Z180ZDS0100ZCC: User Manual UM004300-COR0200
No ratings yet
Z180ZDS0100ZCC: User Manual UM004300-COR0200
164 pages
A Survey of Graph Neural Networks For Recommender Systems - Challenges, Methods, and Directions
No ratings yet
A Survey of Graph Neural Networks For Recommender Systems - Challenges, Methods, and Directions
51 pages
9454Lab Manual Expt No. 6 AOA - All Pair Shortest Path
No ratings yet
9454Lab Manual Expt No. 6 AOA - All Pair Shortest Path
8 pages
Digital Opportunities For Civic Education
No ratings yet
Digital Opportunities For Civic Education
26 pages
logcat_1731540495781
No ratings yet
logcat_1731540495781
16 pages
TASK ONE.5.TABLE - Survey That Was Conducted Among 6800 Scottish People Who Were 16 Years Old or Over.c
No ratings yet
TASK ONE.5.TABLE - Survey That Was Conducted Among 6800 Scottish People Who Were 16 Years Old or Over.c
3 pages
Rainbow English Activity Book 1
No ratings yet
Rainbow English Activity Book 1
27 pages
Table Tag
No ratings yet
Table Tag
5 pages
8df79 en DiSEqC For Technicians
100% (1)
8df79 en DiSEqC For Technicians
12 pages
Questioned Document Examination Reviewer: Criminology
No ratings yet
Questioned Document Examination Reviewer: Criminology
18 pages
EE216 Electircal Engineering
100% (1)
EE216 Electircal Engineering
2 pages
65 Profile Creation Sites SEO Rocket PDF
No ratings yet
65 Profile Creation Sites SEO Rocket PDF
4 pages
MS 201T01A ENU Student Lab Manual
No ratings yet
MS 201T01A ENU Student Lab Manual
29 pages
IP Set of 30 Questions With Answers
No ratings yet
IP Set of 30 Questions With Answers
5 pages
Ey Cyber Risk Management
100% (1)
Ey Cyber Risk Management
12 pages
2013-03-15 "Joule Thief" Powered by .040 V Thermocouple - RustyBolt - Info - Wordpress
No ratings yet
2013-03-15 "Joule Thief" Powered by .040 V Thermocouple - RustyBolt - Info - Wordpress
1 page
DataStorage Lab2
No ratings yet
DataStorage Lab2
2 pages
Engineering Cost and Cost Estimating 3 VIP NASA
No ratings yet
Engineering Cost and Cost Estimating 3 VIP NASA
51 pages
Remote Alarm Notification: Moeller Intelligent Relays
No ratings yet
Remote Alarm Notification: Moeller Intelligent Relays
12 pages

Untitled2.Ipynb - Colab

Uploaded by

Untitled2.Ipynb - Colab

Uploaded by

5/11/25, 10:35 PM Untitled2.

from google.colab import files

# Automatically get filename

# Read the file

# Show DataFrame details

# Plot the target column

Choose Files heart.csv

import seaborn as sns

# 1️⃣ Missing Values Heatmap

# 2️⃣ Correlation Heatmap with increased spreading

# 3️⃣ Categorical Pivot Heatmap: Sex vs Target Count

# Adjust layout to avoid overlap

import seaborn as sns

# Subplots: 3 row, 2 column = 6 graphs

# Max Heart Rate

# Resting Blood Pressure

# Fasting Blood Sugar

from sklearn.preprocessing import MinMaxScaler

numerical_columns = ['age', 'sex', 'chest pain type', 'resting bp s', 'cholesterol',

age sex chest pain type resting bp s cholesterol \

oldpeak ST slope target

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

print("Training data size:", X_train.shape)

Training data size: (952, 11)

# Import necessary libraries

# Assume that df is already loaded and preprocessed

# Splitting the data into features (X) and target (y)

# Model 1: Logistic Regression

# Model 2: Decision Tree

# Model 3: Random Forest

# Model 4: Support Vector Machine (SVM)

Logistic Regression Accuracy: 0.8529411764705882

0 0.83 0.84 0.84 107

accuracy 0.85 238

Decision Tree Accuracy: 0.8991596638655462

0 0.86 0.93 0.89 107

accuracy 0.90 238

Random Forest Accuracy: 0.9453781512605042

0 0.96 0.92 0.94 107

accuracy 0.95 238

SVM Accuracy: 0.8571428571428571

0 0.84 0.84 0.84 107

accuracy 0.86 238

from sklearn.svm import SVC

# Model 4: Support Vector Machine (SVM)

# Logistic Regression Evaluation

# ROC Curve: Logistic Regression

# Decision Tree Evaluation

# ROC Curve: Decision Tree

# Random Forest Evaluation

# ROC Curve: Random Forest

You might also like