0% found this document useful (0 votes)

24 views3 pages

AAM 6th Prac

The document outlines the implementation of the Random Forest Algorithm using Python and scikit-learn, detailing steps from data preprocessing to model evaluation. It includes data loading, feature encoding, model fitting, prediction, and performance metrics such as accuracy and classification report. Additionally, it highlights the importance of using RandomForestClassifier for real-world applications and discusses optional feature scaling and visualization techniques.

Uploaded by

Bhaktesh Chandajkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views3 pages

AAM 6th Prac

Uploaded by

Bhaktesh Chandajkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

6.Implement the Random Forest Algorithm using following Steps a.

Data Preprocessing Step

b. Fitting the Random I orest Algorithm to the Framing Set

c. Predicting the Test Set Result

d. Creating the confusion Matrix

e. Visualizing the training set result

f. Visualizing the test set result

import pandas as pd

from sklearn.model_selection import train_test_split

from sklearn.ensemble import RandomForestClassifier

from sklearn.metrics import accuracy_score, classification_report

import warnings

warnings.filterwarnings('ignore')

# Corrected URL for the dataset

url = "https://raw.githubusercontent.com/datasciencedojo/datasets/master/titanic.csv"

titanic_data = pd.read_csv(url)

# Drop rows with missing 'Survived' values

titanic_data = titanic_data.dropna(subset=['Survived'])

# Features and target variable

X = titanic_data[['Pclass', 'Sex', 'Age', 'SibSp', 'Parch', 'Fare']]

y = titanic_data['Survived']

# Encode 'Sex' column

X.loc[:, 'Sex'] = X['Sex'].map({'female': 0, 'male': 1})

# Fill missing 'Age' values with the median

X.loc[:, 'Age'].fillna(X['Age'].median(), inplace=True)

# Split data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Initialize RandomForestClassifier

rf_classifier = RandomForestClassifier(n_estimators=100, random_state=42)

# Fit the classifier to the training data

rf_classifier.fit(X_train, y_train)

# Make predictions

y_pred = rf_classifier.predict(X_test)

# Calculate accuracy and classification report

accuracy = accuracy_score(y_test, y_pred)

classification_rep = classification_report(y_test, y_pred)

# Print the results

print(f"Accuracy: {accuracy:.2f}")

print("\nClassification Report:\n", classification_rep)

# Sample prediction

sample = X_test.iloc[0:1] # Keep as DataFrame to match model input format

prediction = rf_classifier.predict(sample)

# Retrieve and display the sample

sample_dict = sample.iloc[0].to_dict()

print(f"\nSample Passenger: {sample_dict}")

print(f"Predicted Survival: {'Survived' if prediction[0] == 1 else 'Did Not Survive'}")

output
 Scikit-learn's RandomForestClassifier: This code now uses the highly optimized and
well-tested RandomForestClassifier from scikit-learn. This is strongly recommended over
a manual implementation for any real-world use.
 Clearer Data Handling: The code explicitly separates features (X) and the target variable
(y).
 Feature Scaling (Optional): I've included the code for feature scaling (using
StandardScaler), but commented it out. Random Forest often doesn't require feature
scaling because the tree-based nature of the algorithm is less sensitive to feature scales.
However, for some datasets, it might improve performance. If you're unsure, experiment with
it both ways.
 Adjustable Parameters: The RandomForestClassifier has parameters you can tune
(like n_estimators, criterion, max_depth, etc.). n_estimators is the number of trees in
the forest. criterion is the function to measure the quality of a split ("gini" or "entropy").
random_state ensures you get the same results each time you run the code.
 Confusion Matrix: The code now calculates and prints the confusion matrix, which is
essential for evaluating classification model performance.
 Accuracy: The code calculates and prints the accuracy score.
 Visualization: The visualization code is improved and now uses meshgrid and contourf
to create a more visually appealing and informative decision boundary plot. It also allows you
to customize the colors used in the plot. The visualization is only really helpful when you
have 2 features (as in the example). For higher dimensional data, visualization becomes much
more complex.
 Comments and Explanations: The code has more detailed comments explaining each
step.

Fees Structure For Government Sponsored (KUCCPS) Students: University of Eastern Africa, Baraton
No ratings yet
Fees Structure For Government Sponsored (KUCCPS) Students: University of Eastern Africa, Baraton
3 pages
Problems Encountered in Conducting Patrol During Night Time by The Brangay Labuagon Tanods and Officials
91% (11)
Problems Encountered in Conducting Patrol During Night Time by The Brangay Labuagon Tanods and Officials
7 pages
ex 6b
No ratings yet
ex 6b
3 pages
Random Forest
No ratings yet
Random Forest
3 pages
ML Asst.-01(25) (1)
No ratings yet
ML Asst.-01(25) (1)
21 pages
Python Implementation of Random Forest Algorithm
No ratings yet
Python Implementation of Random Forest Algorithm
10 pages
Random Forest Algorithm - Titanic Dataset
No ratings yet
Random Forest Algorithm - Titanic Dataset
12 pages
Random Forest
No ratings yet
Random Forest
11 pages
Decision Tree, Random Forest
No ratings yet
Decision Tree, Random Forest
37 pages
Modelling and Simulation Sample Model 4
No ratings yet
Modelling and Simulation Sample Model 4
3 pages
Machine Learning Random Forest Algorithm - Javatpoint
No ratings yet
Machine Learning Random Forest Algorithm - Javatpoint
14 pages
aam p-4 to 6
No ratings yet
aam p-4 to 6
6 pages
Import Numpy As NP Import Pandas As PD
No ratings yet
Import Numpy As NP Import Pandas As PD
7 pages
Untitled Document
No ratings yet
Untitled Document
6 pages
RANDOM_FOREST__1737667979
No ratings yet
RANDOM_FOREST__1737667979
11 pages
Random Forest Algorithm
No ratings yet
Random Forest Algorithm
9 pages
5) Randomforest - Ipynb - Colaboratory
No ratings yet
5) Randomforest - Ipynb - Colaboratory
12 pages
10 Random - Forest - Algo
No ratings yet
10 Random - Forest - Algo
6 pages
Machine Learning - Random Forest
No ratings yet
Machine Learning - Random Forest
6 pages
Random Forest: The Algorithm in A Nutshell
No ratings yet
Random Forest: The Algorithm in A Nutshell
10 pages
CSL0777 L26
No ratings yet
CSL0777 L26
33 pages
CS326Report
No ratings yet
CS326Report
36 pages
Recsify Technologies Assignment
No ratings yet
Recsify Technologies Assignment
10 pages
Bagging - Ipynb - Colab
No ratings yet
Bagging - Ipynb - Colab
2 pages
Random Forest
No ratings yet
Random Forest
25 pages
ML 12 RandomForest
No ratings yet
ML 12 RandomForest
1 page
Pract 10 (A)
No ratings yet
Pract 10 (A)
2 pages
Ex No 6
No ratings yet
Ex No 6
3 pages
03_Random Forest
No ratings yet
03_Random Forest
24 pages
Random_Forest_Classification.ipynb - Colab
No ratings yet
Random_Forest_Classification.ipynb - Colab
3 pages
2. Random Forest Algorithm
No ratings yet
2. Random Forest Algorithm
2 pages
23BCE7092_ML_Lab_Assignment[1]
No ratings yet
23BCE7092_ML_Lab_Assignment[1]
14 pages
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 7
No ratings yet
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 7
23 pages
Titanic (5)
No ratings yet
Titanic (5)
3 pages
Titanic (4)
No ratings yet
Titanic (4)
3 pages
Dm.practical06
No ratings yet
Dm.practical06
12 pages
MlLabManualdocx 2024 09 04 22 02 58
No ratings yet
MlLabManualdocx 2024 09 04 22 02 58
19 pages
decision tree
No ratings yet
decision tree
6 pages
reast-cancer-prediction-using-debt
No ratings yet
reast-cancer-prediction-using-debt
18 pages
Random Forests
No ratings yet
Random Forests
35 pages
015 - Random Forest
No ratings yet
015 - Random Forest
15 pages
Machine Learning With Random Forests - by Knoldus Inc. - Knoldus - Technical Insights - Medium
No ratings yet
Machine Learning With Random Forests - by Knoldus Inc. - Knoldus - Technical Insights - Medium
12 pages
Scikit Learn What Were Covering
No ratings yet
Scikit Learn What Were Covering
15 pages
AIML Laboratory Set-B
No ratings yet
AIML Laboratory Set-B
7 pages
Exp 3 121a1047 Lavanya Kurup ML
No ratings yet
Exp 3 121a1047 Lavanya Kurup ML
4 pages
Random Forest
No ratings yet
Random Forest
21 pages
Ensembles Models and Decision Tree
No ratings yet
Ensembles Models and Decision Tree
21 pages
Titanic Akshaya
No ratings yet
Titanic Akshaya
12 pages
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
No ratings yet
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
20 pages
RandomForest
No ratings yet
RandomForest
8 pages
AI ML - Cycle 2 Programs (1)
No ratings yet
AI ML - Cycle 2 Programs (1)
15 pages
Exercise Random Forests
No ratings yet
Exercise Random Forests
2 pages
NF Assighment4
No ratings yet
NF Assighment4
5 pages
ChatGPT_randomforest
No ratings yet
ChatGPT_randomforest
4 pages
4.1.3.5 Lab - Decision Tree Classification
No ratings yet
4.1.3.5 Lab - Decision Tree Classification
11 pages
Linearregression SVM
No ratings yet
Linearregression SVM
3 pages
Natural Disasters Prediction
No ratings yet
Natural Disasters Prediction
21 pages
AttiqAhmadAfsar_lab_13
No ratings yet
AttiqAhmadAfsar_lab_13
5 pages
FB Models PDF
No ratings yet
FB Models PDF
14 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
Foundations of Education Part 1
No ratings yet
Foundations of Education Part 1
58 pages
PHD Thesis Latex Template Mathematics
100% (3)
PHD Thesis Latex Template Mathematics
4 pages
Pro 2 an Assessment of Parental Attitude Towards the Achievement of Junior Secondary School Students in Business Study in Oyo East Local Government Area
No ratings yet
Pro 2 an Assessment of Parental Attitude Towards the Achievement of Junior Secondary School Students in Business Study in Oyo East Local Government Area
70 pages
Section 1 - Friendly Letter Writing
100% (2)
Section 1 - Friendly Letter Writing
1 page
1_Unit 1_ Vocabulary_ the Gifts of Youth 2 BAC English Teacher_ Ahmed Ezzarouali
No ratings yet
1_Unit 1_ Vocabulary_ the Gifts of Youth 2 BAC English Teacher_ Ahmed Ezzarouali
10 pages
Electrical Technician: CV For Mwaturura Forgiveness Proceed
No ratings yet
Electrical Technician: CV For Mwaturura Forgiveness Proceed
4 pages
DPS Annual Report Card - District Achievement
No ratings yet
DPS Annual Report Card - District Achievement
5 pages
12TraitsofaHigh-ValueManandHowYouCanBecomeOneToo_1691650700526
No ratings yet
12TraitsofaHigh-ValueManandHowYouCanBecomeOneToo_1691650700526
40 pages
My Pet
No ratings yet
My Pet
2 pages
Notes Psychological Foundations
No ratings yet
Notes Psychological Foundations
112 pages
abcd
No ratings yet
abcd
7 pages
Chapter 1 Introduction to Academic Literacy
No ratings yet
Chapter 1 Introduction to Academic Literacy
9 pages
Unit 4 Test On Grammar and Vocab-Ry!
100% (1)
Unit 4 Test On Grammar and Vocab-Ry!
4 pages
Tugas 1 (Cross Cultural Understanding)
No ratings yet
Tugas 1 (Cross Cultural Understanding)
2 pages
Version 1 G12 Sesotho Term 1 Listening Comprehension M7 Answer Sheet (1)
No ratings yet
Version 1 G12 Sesotho Term 1 Listening Comprehension M7 Answer Sheet (1)
5 pages
Compare Contrast Essay Topic
100% (2)
Compare Contrast Essay Topic
7 pages
MCDP Tool Kit
No ratings yet
MCDP Tool Kit
28 pages
When Thinking Is The Problem - Slides
No ratings yet
When Thinking Is The Problem - Slides
36 pages
Business Model
No ratings yet
Business Model
25 pages
PR Chapter 1-3
No ratings yet
PR Chapter 1-3
24 pages
WWW - Joinindianarmy.nic - in WWW - Joinindiannavy.gov - in WWW - Careerindianairforce.cdac - in
No ratings yet
WWW - Joinindianarmy.nic - in WWW - Joinindiannavy.gov - in WWW - Careerindianairforce.cdac - in
11 pages
Choosing Career
No ratings yet
Choosing Career
2 pages
Detailed Lesson Plan
No ratings yet
Detailed Lesson Plan
7 pages
Pengalaman Ujian CCNA
No ratings yet
Pengalaman Ujian CCNA
3 pages
In Progress Resume
No ratings yet
In Progress Resume
1 page
2 Bottery - 1990
No ratings yet
2 Bottery - 1990
13 pages
Digital Literacy
No ratings yet
Digital Literacy
2 pages

AAM 6th Prac

Uploaded by

AAM 6th Prac

Uploaded by

6.Implement the Random Forest Algorithm using following Steps a.

Data Preprocessing Step

b. Fitting the Random I orest Algorithm to the Framing Set

c. Predicting the Test Set Result

d. Creating the confusion Matrix

e. Visualizing the training set result

f. Visualizing the test set result

from sklearn.model_selection import train_test_split

from sklearn.ensemble import RandomForestClassifier

from sklearn.metrics import accuracy_score, classification_report

# Corrected URL for the dataset

# Drop rows with missing 'Survived' values

# Features and target variable

X = titanic_data[['Pclass', 'Sex', 'Age', 'SibSp', 'Parch', 'Fare']]

# Encode 'Sex' column

X.loc[:, 'Sex'] = X['Sex'].map({'female': 0, 'male': 1})

# Fill missing 'Age' values with the median

X.loc[:, 'Age'].fillna(X['Age'].median(), inplace=True)

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

rf_classifier = RandomForestClassifier(n_estimators=100, random_state=42)

# Fit the classifier to the training data

# Calculate accuracy and classification report

accuracy = accuracy_score(y_test, y_pred)

classification_rep = classification_report(y_test, y_pred)

# Print the results

print("\nClassification Report:\n", classification_rep)

sample = X_test.iloc[0:1] # Keep as DataFrame to match model input format

# Retrieve and display the sample

print(f"\nSample Passenger: {sample_dict}")

print(f"Predicted Survival: {'Survived' if prediction[0] == 1 else 'Did Not Survive'}")

You might also like