0% found this document useful (0 votes)

25 views

Fine Tuning Hper Parameters

Uploaded by

divejdivej16

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views

Fine Tuning Hper Parameters

Uploaded by

divejdivej16

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Fine Tuning of Neural

Network Hyper parameters

• Hyperparameter / Hyper Tuning:
• Building machine learning models is an iterative process that involves
optimizing the model’s performance and compute resources.

• The settings that you adjust during each iteration are called hyperparameters.

• The process of searching for optimal hyperparameters is

called hyperparameter tuning or hypertuning, and is essential in any machine
learning project.

• Hypertuning helps boost performance and reduces model complexity by

removing unnecessary parameters (e.g., number of units in a dense layer).
UNIT-2
DNN (Deep Neural Networks)

• Deep neural networks have changed the landscape of artificial intelligence in

the modern era.

• In recent times, there have been several research advancements in both deep
learning and neural networks

• which dramatically increase the quality of projects related to artificial

intelligence.

• These deep neural networks help developers to achieve more sustainable and
high-quality results.
• An artificial neural network (ANN) or a simple traditional neural network
aims to solve trivial tasks with a straightforward data.

• An artificial neural network is loosely inspired from biological neural

networks. It is a collection of layers to perform a specific task.

• Each layer consists of a collection of nodes to operate together.

• These networks usually consist of an input layer, one to two hidden layers,
and an output layer.

• While it is possible to solve easy mathematical statements and computer

problems
• It is tough for these networks to solve complicated Image processing,
computer vision, and natural language processing tasks.

• For these problems, we utilize deep neural networks, which often have a
complex hidden layer structure with a wide variety of different layers.

• such as a convolutional layer, max-pooling layer, dense layer, and other

unique layers.

• These additional layers help the model to understand problems better and
provide optimal solutions to complex projects.

• A deep neural network has more layers (more depth) than ANN and each
layer adds complexity to the model.
• # Importing the necessary functionality
• import tensorflow as tf
• from tensorflow.keras.models import Sequential
• from tensorflow.keras.layers import Input, Dense, Conv2D
• from tensorflow.keras.layers import Flatten, MaxPooling2D
Training DNN:
• There are certain practices in Deep Learning that are highly recommended,
in order to efficiently train Deep Neural Networks.
• Training data:

• A lot of ML practitioners are habitual of throwing raw training data in any Deep Neural
Net(DNN). And why not, any DNN would(presumably) still give good results.

• When - “given the right type of data, a fairly simple model will provide better and faster
results than a complex DNN.

• So, whether you are working with Computer Vision, Natural Language
Processing, Statistical Modelling, etc. try to preprocess your raw data.
A few measures one can take to get better training data:

• Get your hands on as large a dataset as possible(DNNs are quite data-hungry: more is
better)

• Remove any training sample with corrupted data(short texts, highly distorted images,
spurious output labels, features with lots of null values, etc.)

Choose appropriate activation functions:

• One of the vital components of any Neural Networks are activation functions.

• For years, sigmoid activation functions have been the preferable choice. But, a sigmoid
function is inherently cursed by these two drawbacks - 1. Saturation of sigmoids at
tails(further causing vanishing gradient problem).
• Number of Hidden Units and Layers:

• Keeping in mind a larger number of hidden units than the optimal number, is generally a
safe.

• On the other hand, while keeping smaller numbers of hidden units(than the optimal
number), there are higher chances of underfitting the model.

• Also, while employing unsupervised representations the optimal number of hidden

units are generally kept even larger.

• Since, Unsupervised representation might contain a lot of irrelevant information in these

representations

• By increasing the number of hidden units, model will have the required flexibility to filter
out the most appropriate information out of these pre-trained representations.
• Hyperparameter Tuning: Grid Search & Random Search:

• Grid Search has been prevalent in classical machine learning.

•
• But, Grid Search is not at all efficient in finding optimal hyperparameters for DNNs.
•
• Primarily, because of the time taken by a DNN in trying out different hyperparameter
combinations.

• As the number of hyperparameters keeps on increasing, computation required for Grid

Search also increases exponentially.
• There are two ways to go about it:

1. Based on your prior experience, you can manually tune some common
hyperparameters like learning rate, number of layers, etc.

• Instead of Grid Search, use Random Search/Random Sampling for

choosing optimal hyperparameters.
• Number of Epochs”:

• “Training a Deep Learning Model for multiple epochs will result in a better model” - we
have heard it a couple of times, but how do we quantify “many”

FLEXINVERTER Power Station - Installation - OM - Manual - UL - Rev6
100% (1)
FLEXINVERTER Power Station - Installation - OM - Manual - UL - Rev6
283 pages
Credit Card Details
67% (3)
Credit Card Details
2 pages
Machine Learning
100% (2)
Machine Learning
136 pages
Deep Learing
No ratings yet
Deep Learing
37 pages
Deep Learning
100% (2)
Deep Learning
49 pages
Deep Learning concise notes
No ratings yet
Deep Learning concise notes
4 pages
DL Shikai
No ratings yet
DL Shikai
18 pages
DGM MID SEM
No ratings yet
DGM MID SEM
39 pages
Unit-2 Improving-Deep-Neural-Networks
No ratings yet
Unit-2 Improving-Deep-Neural-Networks
18 pages
CS 611 Slides 5
No ratings yet
CS 611 Slides 5
28 pages
3rd Unit DL Final Class Notes
No ratings yet
3rd Unit DL Final Class Notes
78 pages
Artificial Neural Networks (ANN)
No ratings yet
Artificial Neural Networks (ANN)
6 pages
Unit II
No ratings yet
Unit II
56 pages
Introduction To Deep Neural Networks - DataCamp
No ratings yet
Introduction To Deep Neural Networks - DataCamp
10 pages
DL Intro
No ratings yet
DL Intro
64 pages
four unit
No ratings yet
four unit
3 pages
Deep Learning Fundamentals
No ratings yet
Deep Learning Fundamentals
19 pages
Deep Learning Tutorial: Reference: Hung-Yi Lee
100% (1)
Deep Learning Tutorial: Reference: Hung-Yi Lee
179 pages
chp2 Tuning hidden layer
No ratings yet
chp2 Tuning hidden layer
3 pages
DEU CSC5045 Intelligent System Applications Using Fuzzy - 7+Deep+Learning
No ratings yet
DEU CSC5045 Intelligent System Applications Using Fuzzy - 7+Deep+Learning
108 pages
ML06_Neural-Network_2024-2025
No ratings yet
ML06_Neural-Network_2024-2025
78 pages
Unit 1 Introduction to Neural Networks Cleaned
No ratings yet
Unit 1 Introduction to Neural Networks Cleaned
4 pages
Notes DL-1
No ratings yet
Notes DL-1
10 pages
The Deep Learning Revolution: Introductory Overview Lecture
No ratings yet
The Deep Learning Revolution: Introductory Overview Lecture
35 pages
Deepnet Lourentzou
No ratings yet
Deepnet Lourentzou
49 pages
Secrets of Deep Learning 1716536527
No ratings yet
Secrets of Deep Learning 1716536527
12 pages
Deep Learning - Unit 1 Notes
No ratings yet
Deep Learning - Unit 1 Notes
27 pages
Deep Learning and Neural Networks
No ratings yet
Deep Learning and Neural Networks
8 pages
NoteGPT Summary DL Mod1
No ratings yet
NoteGPT Summary DL Mod1
3 pages
Deep Learnig
No ratings yet
Deep Learnig
16 pages
Building Deep Neural Network
No ratings yet
Building Deep Neural Network
17 pages
Review of Deep Learning Algorithms and Architectur
No ratings yet
Review of Deep Learning Algorithms and Architectur
29 pages
UNIT II DNN
No ratings yet
UNIT II DNN
24 pages
Experiment 2.4 DL
No ratings yet
Experiment 2.4 DL
4 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
205 pages
Lecture 1
No ratings yet
Lecture 1
38 pages
Unit 5 (Second Half)
No ratings yet
Unit 5 (Second Half)
10 pages
2. Deep Neural Network
No ratings yet
2. Deep Neural Network
60 pages
UNIT - 5 Lecture 2
No ratings yet
UNIT - 5 Lecture 2
26 pages
2 Deep Neural Network_241120_095158
No ratings yet
2 Deep Neural Network_241120_095158
47 pages
Hyperparameters
No ratings yet
Hyperparameters
15 pages
3rd Unit DL Final Class Notes (1)
No ratings yet
3rd Unit DL Final Class Notes (1)
78 pages
DL Unit-3
No ratings yet
DL Unit-3
9 pages
DL Insem Final
No ratings yet
DL Insem Final
2 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
22 pages
Course contents #1
No ratings yet
Course contents #1
24 pages
Matlab NN Toolbox
No ratings yet
Matlab NN Toolbox
18 pages
TP 5 Aii
No ratings yet
TP 5 Aii
9 pages
Deep Learning
No ratings yet
Deep Learning
10 pages
Neural-Network-oxygen
No ratings yet
Neural-Network-oxygen
25 pages
R21 - A7709 - Deep Learning: Dr. Bhawani Sankar Panigrahi
No ratings yet
R21 - A7709 - Deep Learning: Dr. Bhawani Sankar Panigrahi
92 pages
unit-1
No ratings yet
unit-1
19 pages
UNIT-I.pptx
No ratings yet
UNIT-I.pptx
90 pages
Unit 4
100% (1)
Unit 4
57 pages
Deep Vs Shallow Neural Networks
No ratings yet
Deep Vs Shallow Neural Networks
13 pages
Unit 2 DL
No ratings yet
Unit 2 DL
3 pages
Ann
No ratings yet
Ann
24 pages
Unit 1
No ratings yet
Unit 1
20 pages
Assignment Jaiprakash
No ratings yet
Assignment Jaiprakash
5 pages
Components-Algorithms/: The Basic Architecture of Neural Networks: Single Computational Layer
No ratings yet
Components-Algorithms/: The Basic Architecture of Neural Networks: Single Computational Layer
65 pages
DL_UNIT_3_NOTES
No ratings yet
DL_UNIT_3_NOTES
16 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Artificial Intelligence Algorithms
From Everand
Artificial Intelligence Algorithms
akosnemeth
No ratings yet
Malaria in Pregnancy - A Community-Based Study On The Knowledge, Perception, and
No ratings yet
Malaria in Pregnancy - A Community-Based Study On The Knowledge, Perception, and
15 pages
Blackmailing Format PDF
No ratings yet
Blackmailing Format PDF
1 page
RIce Plant Disease Detection Using Different AI Approaches
No ratings yet
RIce Plant Disease Detection Using Different AI Approaches
11 pages
Driver Installation Instructions PDF
No ratings yet
Driver Installation Instructions PDF
7 pages
Homelan Security: By: Tim O'Neill The Oldcommguy
No ratings yet
Homelan Security: By: Tim O'Neill The Oldcommguy
9 pages
The Excise Stamp Management Directive - 1004 - 2024
No ratings yet
The Excise Stamp Management Directive - 1004 - 2024
18 pages
Official: Strategy
No ratings yet
Official: Strategy
260 pages
part_3-運銷系統之軟體確效準備
No ratings yet
part_3-運銷系統之軟體確效準備
16 pages
Portfolio IN Work Immersion: North Fairview High School
No ratings yet
Portfolio IN Work Immersion: North Fairview High School
24 pages
SOA Interview Questions
No ratings yet
SOA Interview Questions
49 pages
Visual Music Display Formats Full Dome P
No ratings yet
Visual Music Display Formats Full Dome P
166 pages
Manual Book Fuel Dispenser portable bbm
No ratings yet
Manual Book Fuel Dispenser portable bbm
17 pages
Enable Alerts For Expiring and Expired Licenses and Certifications
No ratings yet
Enable Alerts For Expiring and Expired Licenses and Certifications
8 pages
Yes Bank
No ratings yet
Yes Bank
1,418 pages
10. Unit 10_Ôn tập cuối kỳ
No ratings yet
10. Unit 10_Ôn tập cuối kỳ
2 pages
Python Pandas
No ratings yet
Python Pandas
9 pages
KFS PKR en
No ratings yet
KFS PKR en
2 pages
Contraction of Shield: Figure 6.1 Construction Stages of A Shield Tunnel Model
No ratings yet
Contraction of Shield: Figure 6.1 Construction Stages of A Shield Tunnel Model
19 pages
Installation of The Rectifier
No ratings yet
Installation of The Rectifier
1 page
Archaeology Missions
No ratings yet
Archaeology Missions
49 pages
1A - Getting Started With Risk Scenarios
100% (2)
1A - Getting Started With Risk Scenarios
10 pages
Ac7114-3 Rev o
No ratings yet
Ac7114-3 Rev o
30 pages
TWT English
No ratings yet
TWT English
20 pages
Jakeman Etal 2006 Model Development and Evaluation
No ratings yet
Jakeman Etal 2006 Model Development and Evaluation
13 pages
SEM 919,921 T2 Operation Maintenance Manual-V1
No ratings yet
SEM 919,921 T2 Operation Maintenance Manual-V1
91 pages
Science and Technology Questions and Solutions PDF 2022
No ratings yet
Science and Technology Questions and Solutions PDF 2022
6 pages
Untitled
No ratings yet
Untitled
87 pages