0% found this document useful (0 votes)

23 views7 pages

Hyperparameter Tuning

Python tunning

Uploaded by

dharam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views7 pages

Hyperparameter Tuning

Python tunning

Uploaded by

dharam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

HYPERPARAMETER TUNING

The process of finding the best set of hyperparameters for

a machine learning model
TYPES: Random Search, Grid Search, Genetic Algorithms,
Bayesian Optimization, etc. But we are going to consider
the manual search and the GridSearchCV techniques.

Hyperparameter Tuning for one model

# Import necessary libraries
import pandas as pd
from sklearn import datasets
from sklearn.model_selection import train_test_split, GridSearchCV,
RandomizedSearchCV
from sklearn.svm import SVC
from sklearn.metrics import accuracy_score
import warnings
# Ignore all warnings
warnings.simplefilter("ignore")

# Load the Iris dataset

iris = datasets.load_iris()

# Create a DataFrame using pandas

iris_df = pd.DataFrame(data=iris.data, columns=iris.feature_names)

# Add the target column to the DataFrame

iris_df['target'] = iris.target

# Display the first few rows of the dataset

print("First few rows of the Iris dataset:")
print(iris_df.head())

First few rows of the Iris dataset:

sepal length (cm) sepal width (cm) petal length (cm) petal width
(cm) \
0 5.1 3.5 1.4
0.2
1 4.9 3.0 1.4
0.2
2 4.7 3.2 1.3
0.2
3 4.6 3.1 1.5
0.2
4 5.0 3.6 1.4
0.2

target
0 0
1 0
2 0
3 0
4 0

# specify the features and the target

X = iris.data
y = iris.target

# Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y,
test_size=0.6, random_state=42)

Manual Search
# Choose the model (SVM in this case) with specific hyperparameters
model = SVC(C=100, kernel='rbf', gamma=10)

# Fit your model

model.fit(X_train, y_train)

SVC(C=100, gamma=10)

y_predict = model.predict(X_test)
y_predict

array([1, 2, 2, 1, 1, 0, 1, 2, 1, 1, 2, 0, 0, 0, 0, 1, 2, 1, 1, 2, 0,
2,
0, 2, 2, 2, 2, 2, 0, 0, 2, 2, 1, 0, 0, 2, 1, 0, 0, 2, 2, 1, 1,
0,
0, 1, 1, 2, 1, 2, 1, 2, 1, 0, 2, 1, 0, 0, 2, 1, 2, 2, 0, 0, 1,
0,
1, 2, 0, 1, 2, 2, 2, 2, 1, 1, 2, 1, 0, 1, 2, 0, 0, 1, 1, 0, 2,
0,
0, 2])

accuracy = accuracy_score(y_test,y_predict)
accuracy
0.9

GridSearchCV
# Define the hyperparameter grid for GridSearchCV
param_grid_gridsearch = {
'C': [0.1, 1, 10, 100],
'kernel': ['linear', 'rbf', 'poly'],
'gamma': [0.01, 0.1, 1, 'auto']
}

# Create a new model for GridSearchCV

model_gridsearch = SVC()

# Perform GridSearchCV
grid_search = GridSearchCV(model_gridsearch,
param_grid=param_grid_gridsearch, scoring='accuracy', cv=5)
grid_search.fit(X_train, y_train)

GridSearchCV(cv=5, estimator=SVC(),
param_grid={'C': [0.1, 1, 10, 100],
'gamma': [0.01, 0.1, 1, 'auto'],
'kernel': ['linear', 'rbf', 'poly']},
scoring='accuracy')

# Get the best hyperparameters from GridSearchCV

best_params_grid = grid_search.best_params_

# Print the optimal hyperparameters

print("Optimal Hyperparameters from GridSearchCV:")
print(best_params_grid)

Optimal Hyperparameters from GridSearchCV:

{'C': 10, 'gamma': 0.01, 'kernel': 'linear'}

# Train models with the best hyperparameters

best_model_grid = grid_search.best_estimator_

# Evaluate models on the test set

y_pred_grid = best_model_grid.predict(X_test)

# Check the accuracy

accuracy_grid = accuracy_score(y_test, y_pred_grid)
accuracy_grid

0.9777777777777777
Hyperparameter Tuning for Multiple Models

Manual Search
from sklearn.ensemble import RandomForestClassifier
from sklearn.linear_model import LogisticRegression

# Define the SVM model

model1 = SVC(C=0.1, kernel='linear', gamma=0.01)

# Fit the model

model1.fit(X_train, y_train)

SVC(C=0.1, gamma=0.01, kernel='linear')

# Predict the test set

y1_predict = model1.predict(X_test)
y1_predict

array([1, 0, 2, 1, 1, 0, 1, 2, 1, 1, 2, 0, 0, 0, 0, 1, 2, 1, 1, 2, 0,
2,
0, 2, 2, 2, 2, 2, 0, 0, 0, 0, 1, 0, 0, 2, 1, 0, 0, 0, 2, 1, 1,
0,
0, 1, 2, 2, 1, 2, 1, 2, 1, 0, 2, 1, 0, 0, 0, 1, 2, 0, 0, 0, 1,
0,
1, 2, 0, 1, 2, 0, 1, 2, 1, 1, 2, 1, 0, 1, 2, 0, 0, 1, 2, 0, 2,
0,
0, 1])

# Check for the accuracy

accuracy1 = accuracy_score(y_test, y1_predict)
accuracy1

0.9777777777777777

# Define the RF model

model2 = RandomForestClassifier(n_estimators=50, max_depth=10,
min_samples_split=2)

# Fit the model

model2.fit(X_train, y_train)

RandomForestClassifier(max_depth=10, n_estimators=50)
# Predict the test set
y2_predict = model2.predict(X_test)
y2_predict

array([1, 0, 2, 1, 1, 0, 1, 2, 1, 1, 2, 0, 0, 0, 0, 1, 2, 1, 1, 2, 0,
2,
0, 2, 2, 2, 2, 2, 0, 0, 0, 0, 1, 0, 0, 2, 1, 0, 0, 0, 2, 1, 1,
0,
0, 1, 1, 2, 1, 2, 1, 2, 1, 0, 2, 1, 0, 0, 0, 1, 2, 0, 0, 0, 1,
0,
1, 2, 0, 1, 2, 0, 2, 2, 1, 1, 2, 1, 0, 1, 2, 0, 0, 1, 2, 0, 2,
0,
0, 2])

# Check for the accuracy

accuracy2 = accuracy_score(y_test, y2_predict)
accuracy2

0.9666666666666667

# Define the LR model

model3 = LogisticRegression(C=0.1, penalty='l1', solver='liblinear')

# Fit the model

model3.fit(X_train, y_train)

LogisticRegression(C=0.1, penalty='l1', solver='liblinear')

# Predict the testset

y3_predict = model3.predict(X_test)
y3_predict

array([2, 0, 2, 2, 2, 0, 2, 2, 2, 2, 2, 0, 0, 0, 0, 2, 2, 2, 2, 2, 0,
2,
0, 2, 2, 2, 2, 2, 0, 0, 0, 0, 2, 0, 0, 2, 2, 0, 0, 0, 2, 2, 2,
0,
0, 2, 2, 2, 2, 2, 2, 2, 2, 0, 2, 2, 0, 0, 0, 2, 2, 0, 0, 0, 2,
0,
2, 2, 0, 2, 2, 0, 2, 2, 2, 2, 2, 2, 0, 2, 2, 0, 0, 2, 2, 0, 2,
0,
0, 2])

# Check for the accuracy

accuracy3 = accuracy_score(y_test, y3_predict)
accuracy3

0.6777777777777778
Using GridSearchCV
# Define models
models = {
'SVM': SVC(),
'Random Forest': RandomForestClassifier(),
'Logistic Regression': LogisticRegression()
}

# Define hyperparameter grids for each model

param_grid = {
'SVM': {'C': [0.1, 1, 10, 100], 'kernel': ['linear', 'rbf',
'poly'], 'gamma': [0.01, 0.1, 1, 'auto']},
'Random Forest': {'n_estimators': [10, 50, 100, 200], 'max_depth':
[None, 10, 20, 30], 'min_samples_split': [2, 5, 10]},
'Logistic Regression': {'C': [0.1, 1, 10, 100], 'penalty': ['l1',
'l2'], 'solver': ['liblinear']}
}

import warnings
# Ignore all warnings
warnings.simplefilter("ignore")

# Perform GridSearchCV for each model

best_models = {}

for name, model in models.items():

grid_search = GridSearchCV(model, param_grid=param_grid[name],
scoring='accuracy', cv=5)
grid_search.fit(X_train, y_train)
best_models[name] = grid_search.best_estimator_

# Print optimal hyperparameters for each model

print(f"{name} - Optimal Hyperparameters:
{grid_search.best_params_}")

SVM - Optimal Hyperparameters: {'C': 10, 'gamma': 0.01, 'kernel':

'linear'}
Random Forest - Optimal Hyperparameters: {'max_depth': None,
'min_samples_split': 10, 'n_estimators': 50}
Logistic Regression - Optimal Hyperparameters: {'C': 10, 'penalty':
'l2', 'solver': 'liblinear'}

# Evaluate best models on the test set

for name, model in best_models.items():
y_pred = model.predict(X_test)
accuracy = accuracy_score(y_test, y_pred)
print(f"{name} - Test Accuracy: {accuracy}")
SVM - Test Accuracy: 0.9777777777777777
Random Forest - Test Accuracy: 0.9666666666666667
Logistic Regression - Test Accuracy: 0.9555555555555556

Thank You

Name: Clement Asare

Email: [email protected]

ORCID: 0009-0000-2684-7611

YouTube: https://bit.ly/GmathStats

Diabetes Case Study - Jupyter Notebook
100% (1)
Diabetes Case Study - Jupyter Notebook
10 pages
Data Mining Practicals
No ratings yet
Data Mining Practicals
22 pages
AML_code_for_m2
No ratings yet
AML_code_for_m2
7 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
9 pages
HyperParameterTuning
No ratings yet
HyperParameterTuning
4 pages
Supple Maximizing Performance in Cs CuBiCl
No ratings yet
Supple Maximizing Performance in Cs CuBiCl
5 pages
Lec-04-05
No ratings yet
Lec-04-05
37 pages
20. Hyperparameter_Tuning
No ratings yet
20. Hyperparameter_Tuning
3 pages
grid search
No ratings yet
grid search
48 pages
ml lab programs 2
No ratings yet
ml lab programs 2
16 pages
Classification Review
No ratings yet
Classification Review
8 pages
ML5&6&7&8&9&10
No ratings yet
ML5&6&7&8&9&10
35 pages
amll
No ratings yet
amll
1 page
Machine Learning
No ratings yet
Machine Learning
3 pages
Linearregression SVM
No ratings yet
Linearregression SVM
3 pages
8 To 12 Jaimeen
No ratings yet
8 To 12 Jaimeen
34 pages
Scikit Learn What Were Covering
No ratings yet
Scikit Learn What Were Covering
15 pages
ML Algorithms
100% (1)
ML Algorithms
1 page
Hyperparameter Tuning For Machine Learning Models
No ratings yet
Hyperparameter Tuning For Machine Learning Models
5 pages
Unit2 ML Programs
No ratings yet
Unit2 ML Programs
7 pages
Implementing Custom Randomsearchcv: 'Red' 'Blue'
No ratings yet
Implementing Custom Randomsearchcv: 'Red' 'Blue'
1 page
Grid Search Steps and Example
No ratings yet
Grid Search Steps and Example
1 page
3.2 Grid Search
No ratings yet
3.2 Grid Search
28 pages
DA_012307
No ratings yet
DA_012307
8 pages
ANN_EXPERIENTIAL_LEARNING
No ratings yet
ANN_EXPERIENTIAL_LEARNING
43 pages
Heart Disease 50% Code
No ratings yet
Heart Disease 50% Code
3 pages
AI Note
No ratings yet
AI Note
5 pages
ML Assignment 4
No ratings yet
ML Assignment 4
7 pages
Reference guide- Validation & cross-validation
No ratings yet
Reference guide- Validation & cross-validation
7 pages
BTVN5_Code
No ratings yet
BTVN5_Code
2 pages
Updated Lecture 12 Zainab
No ratings yet
Updated Lecture 12 Zainab
17 pages
ML Codes
No ratings yet
ML Codes
9 pages
1
No ratings yet
1
13 pages
Prathamesh KRAI
No ratings yet
Prathamesh KRAI
38 pages
ml using python programs
No ratings yet
ml using python programs
12 pages
ML INTERNAL ANSWERS
No ratings yet
ML INTERNAL ANSWERS
9 pages
frmCourseSyllabusIPDownload (2)
No ratings yet
frmCourseSyllabusIPDownload (2)
3 pages
To Improve The Performance of Models Predicting Ba
No ratings yet
To Improve The Performance of Models Predicting Ba
6 pages
AI ML - Cycle 2 Programs (1)
No ratings yet
AI ML - Cycle 2 Programs (1)
15 pages
QB 1
No ratings yet
QB 1
11 pages
MLfull
No ratings yet
MLfull
29 pages
decision tree
No ratings yet
decision tree
6 pages
vertopal.com_Untitled57
No ratings yet
vertopal.com_Untitled57
4 pages
ML_4,5 (1)
No ratings yet
ML_4,5 (1)
5 pages
ML EXTERNAL XEROX
No ratings yet
ML EXTERNAL XEROX
1 page
Predictive Modeling Machine Learning
No ratings yet
Predictive Modeling Machine Learning
16 pages
ML chap 5
No ratings yet
ML chap 5
14 pages
Data Science
No ratings yet
Data Science
8 pages
MlLabManualdocx 2024 09 04 22 02 58
No ratings yet
MlLabManualdocx 2024 09 04 22 02 58
19 pages
ML Mini Project
No ratings yet
ML Mini Project
9 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
Aiml Ex 4-7
No ratings yet
Aiml Ex 4-7
8 pages
Cheat Sheet Building Supervised Learning Models
No ratings yet
Cheat Sheet Building Supervised Learning Models
3 pages
Tuning A CART's Hyperparameters: Elie Kawerk
No ratings yet
Tuning A CART's Hyperparameters: Elie Kawerk
26 pages
SVM K NN MLP With Sklearn Jupyter NoteBo
No ratings yet
SVM K NN MLP With Sklearn Jupyter NoteBo
22 pages
ML NEW Final Format
No ratings yet
ML NEW Final Format
37 pages
Assignment 1
No ratings yet
Assignment 1
17 pages
AAM CODES
No ratings yet
AAM CODES
8 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
309 Assignment2 Fall23
No ratings yet
309 Assignment2 Fall23
2 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
Queuing Formulas PDF
No ratings yet
Queuing Formulas PDF
6 pages
07 Neural Networks1
No ratings yet
07 Neural Networks1
73 pages
SISO-MIMO Design Examples
No ratings yet
SISO-MIMO Design Examples
14 pages
TEAM MEMBERS Noopur Sharma Vartika Singh Vivashwat Thakur
No ratings yet
TEAM MEMBERS Noopur Sharma Vartika Singh Vivashwat Thakur
13 pages
Random Variable
No ratings yet
Random Variable
10 pages
Long Short Term Memory (LSTM)
No ratings yet
Long Short Term Memory (LSTM)
33 pages
Introduction To Recurrent Neural Network
No ratings yet
Introduction To Recurrent Neural Network
9 pages
Feedforward Neural Networks - Part 1 - Parveen Khurana - Medium
No ratings yet
Feedforward Neural Networks - Part 1 - Parveen Khurana - Medium
53 pages
AI in Marketing Industry Course Curriculum
No ratings yet
AI in Marketing Industry Course Curriculum
17 pages
Bee4333 Intelligent Control: Artificial Neural Network (ANN)
No ratings yet
Bee4333 Intelligent Control: Artificial Neural Network (ANN)
76 pages
Unit1_TDL_compressed (1)
No ratings yet
Unit1_TDL_compressed (1)
402 pages
Inheritance Sample Programs
No ratings yet
Inheritance Sample Programs
7 pages
Sta416 Topic 5 3
No ratings yet
Sta416 Topic 5 3
22 pages
Sta2604 2018 TL 103 2 B
No ratings yet
Sta2604 2018 TL 103 2 B
10 pages
Object-Oriented Systems Analysis and Design Using UML
No ratings yet
Object-Oriented Systems Analysis and Design Using UML
81 pages
NOTES SQQS1043 CHAPTER 5 - Student
No ratings yet
NOTES SQQS1043 CHAPTER 5 - Student
74 pages
Recurrent Neural Network: SUBMITTED BY: Harmanjeet Singh ROLL NO - 1803448 B.Tech, Cse (7) Ctiemt, Shahpur (Jalandhar)
No ratings yet
Recurrent Neural Network: SUBMITTED BY: Harmanjeet Singh ROLL NO - 1803448 B.Tech, Cse (7) Ctiemt, Shahpur (Jalandhar)
11 pages
Unit 2: Role of Lexical Analyzer
No ratings yet
Unit 2: Role of Lexical Analyzer
11 pages
Main Linear Models of Time Series
No ratings yet
Main Linear Models of Time Series
30 pages
Time Series QBank
No ratings yet
Time Series QBank
6 pages
8 CNN Example
No ratings yet
8 CNN Example
33 pages
State Diagram
No ratings yet
State Diagram
8 pages
Neural - N - Problems - MLP
No ratings yet
Neural - N - Problems - MLP
15 pages
Construction of DFA
No ratings yet
Construction of DFA
70 pages
On Families of Generalized Pareto Distributions: Properties and Applications
No ratings yet
On Families of Generalized Pareto Distributions: Properties and Applications
20 pages
Box-Jenkins Methodology Forecasting Basics
No ratings yet
Box-Jenkins Methodology Forecasting Basics
11 pages
Neural Networks - Basics Matlab PDF
No ratings yet
Neural Networks - Basics Matlab PDF
59 pages
ML model set 1
No ratings yet
ML model set 1
2 pages

Hyperparameter Tuning

Uploaded by

Hyperparameter Tuning

Uploaded by

HYPERPARAMETER TUNING

The process of finding the best set of hyperparameters for

Hyperparameter Tuning for one model

# Load the Iris dataset

# Create a DataFrame using pandas

# Add the target column to the DataFrame

# Display the first few rows of the dataset

First few rows of the Iris dataset:

# specify the features and the target

# Split the data into training and testing sets

# Fit your model

# Create a new model for GridSearchCV

# Get the best hyperparameters from GridSearchCV

# Print the optimal hyperparameters

Optimal Hyperparameters from GridSearchCV:

# Train models with the best hyperparameters

# Evaluate models on the test set

# Check the accuracy

# Define the SVM model

# Fit the model

SVC(C=0.1, gamma=0.01, kernel='linear')

# Predict the test set

# Check for the accuracy

# Define the RF model

# Fit the model

# Check for the accuracy

# Define the LR model

# Fit the model

LogisticRegression(C=0.1, penalty='l1', solver='liblinear')

# Predict the testset

# Check for the accuracy

# Define hyperparameter grids for each model

# Perform GridSearchCV for each model

for name, model in models.items():

# Print optimal hyperparameters for each model

SVM - Optimal Hyperparameters: {'C': 10, 'gamma': 0.01, 'kernel':

# Evaluate best models on the test set

Name: Clement Asare

You might also like