0% found this document useful (0 votes)

48 views9 pages

Regularization_for_Neural_Networks_1718966083

Uploaded by

Avinash Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views9 pages

Regularization_for_Neural_Networks_1718966083

Uploaded by

Avinash Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

FUNDAMENTALS OF DEEP LEARNING

Regularization for Neural Networks

or did I just memorize

Did I practice the
the solved examples ?
problems

Take a deep breath,

you’re overfitting.

Shivang Kainthola
Regularization in Neural Networks

⟶ When a neural network or machine learning model performs too well on

the training data but fails to generalize to the testing data, the problem is
called overfitting.

⟶ Overfitting is a common issue while training deep neural networks or

any machine learning models.

Overfitting in a neural network also manifests itself in or with other

problems like :
1) Internal Covariate Shift

2) Co-Adaptation

3) Large Weights

⟶ To counter overfitting in neural networks, we use some regularization

techniques :
1) Batch Normalization

2) Dropout

3) L1 Regularization

4) L2 Regularization
1) BATCH NORMALIZATION

Problem : Internal Covariate Shift

During the training of a neural network by backpropagation, the parameters

(weights and biases) are updated based on the error calculated by the loss
function.
⟶ The activations of each layer depend on the inputs and parameters, both
of which are changing during training.
⟶ Since the parameters as well as the inputs to each layer are constantly
being updated, so there is a shift in the distribution of inputs to each layer
which is called internal covariate shift.

Internal covariate shift can slow down training and lead to instability.

https://kwokanthony.medium.com/batch-normalization-in-neural-network-simply-explained-115fe281f4cd
Solution : Batch Normalization

⟶ A batch or mini-batch is a collection of samples that will be passed

through the network at one time for the weights update.

⟶ Batch Normalization is a regularization technique - where we normalize

the inputs to a layer for every mini-batch.

⟶ Normalizing the data involves transforming it to have mean = 0 and

standard deviation = 1.
⟶ Besides tackling internal covariate shift, it makes the gradient descent
better.

https://medium.com/@abheerchrome/batch-normalization-explained-1e78f7eb1e8a

⟶ In a convolutional neural network, Batch Normalization is carried out

with a Normalization Layer placed after the convolution layer.
2) Dropout

Problem : Co-adaptation relationships

⟶ Co-adaptation is a situation when neurons become excessively reliant on

one another.
⟶ To work properly, the neurons rely on the input of other co-adapted
neurons.
⟶ With this co-adaptation relationship, the neurons also tend to be more
generalized to training data, may underperform on testing data.

The co-adapted neurons X and Y can become dependent on each other.

Solution : Dropout

⟶ The ‘dropout’ method works by randomly setting a fraction of the

input units (neurons) in a layer to zero, i.e. dropping them out, during
each training iteration.

⟶ This dropped out fraction of neurons does not take part in forward
pass (activation and gradient computation) or backpropagation (gradient
updates) during training.

⟶ With dropout, the neurons are denied the convenience of making

co-adaptations and relying on other neurons, since they must learn more
robust features on their own.

⟶ Dropout method is applied to a neural network as a Dropout( ) layer, which

takes the fraction of neurons to be dropped out as input.
3) L2 Regularization

Problem : Large weights

⟶ L2 Regularization is a popular regularization technique which penalizes

models with large parameter (weight) values.

⟶ It works by adding L2 regularization term to the loss function,

and when the loss function is minimized (by gradient descent - which seeks
the global minima), it steers the network away from having large weights.

⟶ The loss function L of a neural network with L2 regularization term will

be :

||w||^2 represents the squared L2 norm of the weights (sum of squares of all elements in the weight vector)

⟶ It is controlled by the parameter lambda λ, and regularization penalties

are applied on a per layer basis.
4) L1 Regularization

⟶ L1 regularization penalizes large weights by adding the L1

regularization term to the loss function, similar in working to L2
regularization.

⟶ The loss function L of a neural network with L1 regularization term will

be :

||w||_1 represents the L1 norm of the weights, which is the sum of the absolute values of all elements in the weight vector
(w)

⟶ It can be combined with L2 regularization, often known as Elastic Net

regularization.

CNN For Sentnce Modeling
No ratings yet
CNN For Sentnce Modeling
2 pages
Machine Learning Security and Privacy
No ratings yet
Machine Learning Security and Privacy
3 pages
Master Recursion 10 Days 1703256276
No ratings yet
Master Recursion 10 Days 1703256276
23 pages
30 Questions for Google Cloud Professional Machine Learning Engineer Exam _ Mikael Ahonen
No ratings yet
30 Questions for Google Cloud Professional Machine Learning Engineer Exam _ Mikael Ahonen
12 pages
Machine Learning Q and Ai Sample
No ratings yet
Machine Learning Q and Ai Sample
83 pages
Tiny ML
No ratings yet
Tiny ML
7 pages
AI_Algorithms_Explained_To_Kids_1717055132
No ratings yet
AI_Algorithms_Explained_To_Kids_1717055132
10 pages
Modul 1 - Intro To Network Security
No ratings yet
Modul 1 - Intro To Network Security
47 pages
Top 100 Deep Learning Interview Questions
No ratings yet
Top 100 Deep Learning Interview Questions
157 pages
New CZ3005 Module 5 - Reinforcement Learning
No ratings yet
New CZ3005 Module 5 - Reinforcement Learning
31 pages
Data Science Guide
100% (1)
Data Science Guide
275 pages
CNN 1
No ratings yet
CNN 1
23 pages
Ace The Coding Interview
No ratings yet
Ace The Coding Interview
17 pages
Security Ciso Worksho Zero Threatp
No ratings yet
Security Ciso Worksho Zero Threatp
26 pages
Google All Time Q
100% (1)
Google All Time Q
45 pages
Python Code Snippets for Interviews
No ratings yet
Python Code Snippets for Interviews
80 pages
200+ Python Exercises For Beginners Solve Coding Challenges
No ratings yet
200+ Python Exercises For Beginners Solve Coding Challenges
8 pages
Verilog Nonblocking Assignments Demystified
100% (2)
Verilog Nonblocking Assignments Demystified
3 pages
Codeforces List of Resources (Inishan, Expert)
No ratings yet
Codeforces List of Resources (Inishan, Expert)
27 pages
Little Guide To Building Large Language Models in 2024
100% (1)
Little Guide To Building Large Language Models in 2024
65 pages
Lecture 12 Two Phase Methods
No ratings yet
Lecture 12 Two Phase Methods
72 pages
Kim 2018
No ratings yet
Kim 2018
92 pages
Adversarial ML Survey Paper
No ratings yet
Adversarial ML Survey Paper
23 pages
20 Types Prompting Styles
No ratings yet
20 Types Prompting Styles
22 pages
CNN PPT Unit Iv
No ratings yet
CNN PPT Unit Iv
134 pages
System-Design-Primer - Learn How To Design Large-Scale Systems. Prep For The System Design Interview. Includes Anki Flashcards
No ratings yet
System-Design-Primer - Learn How To Design Large-Scale Systems. Prep For The System Design Interview. Includes Anki Flashcards
78 pages
Finite Difference Methods: I-Liang Chern
No ratings yet
Finite Difference Methods: I-Liang Chern
131 pages
Virtualizing Application Security
No ratings yet
Virtualizing Application Security
22 pages
Computional Engineering Contents Pages
No ratings yet
Computional Engineering Contents Pages
6 pages
The Illustrated Word2vec - Jay Alammar - Visualizing Machine Learning One Concept at A Time
100% (1)
The Illustrated Word2vec - Jay Alammar - Visualizing Machine Learning One Concept at A Time
24 pages
Autogen Guide
No ratings yet
Autogen Guide
232 pages
Lec19 - GANs
No ratings yet
Lec19 - GANs
47 pages
Descriptive Stats
No ratings yet
Descriptive Stats
83 pages
The Hundred Page Machine Learning Book
No ratings yet
The Hundred Page Machine Learning Book
7 pages
Greedy Search Algorithm in AI 241130 230424
No ratings yet
Greedy Search Algorithm in AI 241130 230424
9 pages
ARTIFICIAL INTELLIGENCE Question Paper 21 22
0% (1)
ARTIFICIAL INTELLIGENCE Question Paper 21 22
3 pages
Mat 202
No ratings yet
Mat 202
51 pages
Abaqus_Impact_Simulations
No ratings yet
Abaqus_Impact_Simulations
8 pages
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
No ratings yet
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
12 pages
Lecture Notes 2 Asymptotic Notation
No ratings yet
Lecture Notes 2 Asymptotic Notation
24 pages
Competitive Programming Starting Guide
No ratings yet
Competitive Programming Starting Guide
5 pages
How To Learn Machine Learning Algorithms For Interviews
No ratings yet
How To Learn Machine Learning Algorithms For Interviews
16 pages
Limiting Factors
100% (1)
Limiting Factors
7 pages
CH 2 Interpolation
No ratings yet
CH 2 Interpolation
17 pages
OpenAI_o1_Technical_Summary_and_Examples
No ratings yet
OpenAI_o1_Technical_Summary_and_Examples
20 pages
2 Time Series Regression and Exploratory Data Analysis 2.1 Classical Regression in The Time Series Context
No ratings yet
2 Time Series Regression and Exploratory Data Analysis 2.1 Classical Regression in The Time Series Context
16 pages
Partial Fractions: X B X A X X X
No ratings yet
Partial Fractions: X B X A X X X
21 pages
A Gentle Introduction To Neural Networks With Python
100% (1)
A Gentle Introduction To Neural Networks With Python
85 pages
GPU Architecture
No ratings yet
GPU Architecture
17 pages
8. Nonlinear Optimization With Inequality Constraints
No ratings yet
8. Nonlinear Optimization With Inequality Constraints
21 pages
Machine Learning Interview Guide
100% (1)
Machine Learning Interview Guide
41 pages
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
From Everand
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
Fouad Sabry
No ratings yet
Blind 75 Questions
No ratings yet
Blind 75 Questions
3 pages
Wavelets Meet Large Language Models
No ratings yet
Wavelets Meet Large Language Models
16 pages
AI Algorithms
No ratings yet
AI Algorithms
12 pages
Neural Networks: Learning: Cost Function
No ratings yet
Neural Networks: Learning: Cost Function
33 pages
Generative Ai With Python Harnessing the Power of Machine Learning and Deep Learning to Build Creative and Intelligent Systems
100% (1)
Generative Ai With Python Harnessing the Power of Machine Learning and Deep Learning to Build Creative and Intelligent Systems
239 pages
Trust-In Machine Learning Models
No ratings yet
Trust-In Machine Learning Models
11 pages
Computation and Programming in Physics - Euler & Runge-Kutta Methods
No ratings yet
Computation and Programming in Physics - Euler & Runge-Kutta Methods
34 pages
Ayushi Agarwal: Artificial Intelligence
No ratings yet
Ayushi Agarwal: Artificial Intelligence
14 pages
Machine Learning Midterm
No ratings yet
Machine Learning Midterm
18 pages
TypeofChunking
No ratings yet
TypeofChunking
11 pages
Copy of AIML Simp-Tie
No ratings yet
Copy of AIML Simp-Tie
4 pages
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
m5 Upto Nov 2012
No ratings yet
m5 Upto Nov 2012
31 pages
Day 2 Module 2 - Understanding LLMs
No ratings yet
Day 2 Module 2 - Understanding LLMs
14 pages
snowflake_cortex
No ratings yet
snowflake_cortex
9 pages
Turn_CSV_data_into_Text2SQL_agent
No ratings yet
Turn_CSV_data_into_Text2SQL_agent
9 pages
Infosys Pragathi Report
No ratings yet
Infosys Pragathi Report
68 pages
Complex Trusses
0% (2)
Complex Trusses
1 page
Introduction to Graph Neural Networks - Zhiyuan Liu & Jie Zhou
No ratings yet
Introduction to Graph Neural Networks - Zhiyuan Liu & Jie Zhou
142 pages
Machine Learning
No ratings yet
Machine Learning
216 pages
Geographic Coordinate Conversion
No ratings yet
Geographic Coordinate Conversion
11 pages
How LLMs Collaborate With Multi Agent Setup
No ratings yet
How LLMs Collaborate With Multi Agent Setup
6 pages
RBAC_Configuration_Management_v1_1723989259
No ratings yet
RBAC_Configuration_Management_v1_1723989259
4 pages
Brent Optimization
No ratings yet
Brent Optimization
8 pages
An Introduction To Vision-Language Modeling: Aishwarya Agrawal Kate Saenko Asli Celikyilmaz Vikas Chandra
No ratings yet
An Introduction To Vision-Language Modeling: Aishwarya Agrawal Kate Saenko Asli Celikyilmaz Vikas Chandra
76 pages
Machine Learning Bits
100% (2)
Machine Learning Bits
28 pages
2022 Staticspeed Vunerability Report Template
No ratings yet
2022 Staticspeed Vunerability Report Template
57 pages
ف1
No ratings yet
ف1
4 pages
Game Theory and Machine Learning For Cyber Security (Charles A. Kamhoua (Editor) Etc.) (Z-Library)
No ratings yet
Game Theory and Machine Learning For Cyber Security (Charles A. Kamhoua (Editor) Etc.) (Z-Library)
547 pages
Types of Algorithm Analysis
No ratings yet
Types of Algorithm Analysis
3 pages
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
From Everand
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
Fouad Sabry
No ratings yet
Lectures Machine Learning
No ratings yet
Lectures Machine Learning
205 pages
Omkar Sabnis B4-764 Experiment No. 7 Aim: Implementation of MC-Culloch Pitt Model For AND Gate Using Python. Theory
No ratings yet
Omkar Sabnis B4-764 Experiment No. 7 Aim: Implementation of MC-Culloch Pitt Model For AND Gate Using Python. Theory
10 pages
Signals, Spectra and Signal Processing Name: Date: Section: Rating: Exercise #04 The Convolution Sum
No ratings yet
Signals, Spectra and Signal Processing Name: Date: Section: Rating: Exercise #04 The Convolution Sum
8 pages
Tactical Barbell Green Protocol - K Black
100% (11)
Tactical Barbell Green Protocol - K Black
252 pages
Exam. Code: 107202 Subject Code: 1706
No ratings yet
Exam. Code: 107202 Subject Code: 1706
2 pages
Algorithmic Mathematics
No ratings yet
Algorithmic Mathematics
5 pages
MTH603 Finalterm Solved MCQs With Reference.
No ratings yet
MTH603 Finalterm Solved MCQs With Reference.
10 pages
Machine Learning Interviews V 2 Week 11715787639480
0% (1)
Machine Learning Interviews V 2 Week 11715787639480
49 pages
FEM Syllabus
100% (1)
FEM Syllabus
2 pages
SF Dataloading Commands
No ratings yet
SF Dataloading Commands
4 pages
Runge-Kutta-Fehlberg: (T K4 (T K I :K1 - ) K4)
No ratings yet
Runge-Kutta-Fehlberg: (T K4 (T K I :K1 - ) K4)
2 pages
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet
Solution of The Assignment Problem
No ratings yet
Solution of The Assignment Problem
5 pages
Machine Learning: Andrew NG's Course From Coursera: Presentation
100% (1)
Machine Learning: Andrew NG's Course From Coursera: Presentation
4 pages
A Novel Adoption of LSTM in Customer Touchpoint Prediction Problems Presentation 1
100% (1)
A Novel Adoption of LSTM in Customer Touchpoint Prediction Problems Presentation 1
73 pages
Deep Learning with Python Develop Deep Learning Models on Theano and TensorFLow Using Keras Jason Brownlee All Chapters Instant Download
100% (1)
Deep Learning with Python Develop Deep Learning Models on Theano and TensorFLow Using Keras Jason Brownlee All Chapters Instant Download
65 pages
PyTorch Workflow Fundamentals
No ratings yet
PyTorch Workflow Fundamentals
1 page
C OMBINATORIAL M ODELS OF C OMPLEX S YSTEMSTesis Doctorado Eng
No ratings yet
C OMBINATORIAL M ODELS OF C OMPLEX S YSTEMSTesis Doctorado Eng
194 pages
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
What Is Convolutional Neural Network
No ratings yet
What Is Convolutional Neural Network
16 pages
Types_of_agents
No ratings yet
Types_of_agents
16 pages
Vector_Databases
No ratings yet
Vector_Databases
35 pages
The Body Transformation Blueprint 271 Pages - Sean Nalewanyj
93% (43)
The Body Transformation Blueprint 271 Pages - Sean Nalewanyj
271 pages
Help Your Kids With Computer Coding - 2nd Edition - DK
100% (10)
Help Your Kids With Computer Coding - 2nd Edition - DK
226 pages
The Ultimate Bodybuilding Cookbook
97% (63)
The Ultimate Bodybuilding Cookbook
299 pages
Kettlebell Workouts PDF
100% (10)
Kettlebell Workouts PDF
48 pages
IQ Puzzles
96% (25)
IQ Puzzles
210 pages
Full Body Aesthetics Vol2
95% (19)
Full Body Aesthetics Vol2
36 pages
100 Metcon Workouts
90% (10)
100 Metcon Workouts
41 pages
The Hypertrophy Handbook
100% (10)
The Hypertrophy Handbook
53 pages
How To Be A Math Genius - Your Brilliant Brain and How To Train It
96% (25)
How To Be A Math Genius - Your Brilliant Brain and How To Train It
128 pages
Training After 40 - Guide To Building and Maintaining A Healthier Leaner and Stronger Body
100% (19)
Training After 40 - Guide To Building and Maintaining A Healthier Leaner and Stronger Body
108 pages
Calisthenics For Beginners - A Step-by-Step Program To Get in Shape and To Build Explosive Strength at Any Fitness Level Without Going To The Gym
92% (25)
Calisthenics For Beginners - A Step-by-Step Program To Get in Shape and To Build Explosive Strength at Any Fitness Level Without Going To The Gym
110 pages
4th Grade Math Book PDF
92% (26)
4th Grade Math Book PDF
522 pages
Squat Bible
97% (32)
Squat Bible
177 pages
The Kettlebell Solution
90% (10)
The Kettlebell Solution
87 pages
100 PLANKS - The Plank Encyclopedia For Back Health - Bodyweight Training - and Ultimate Core Strength PDF
95% (19)
100 PLANKS - The Plank Encyclopedia For Back Health - Bodyweight Training - and Ultimate Core Strength PDF
409 pages
Hiib 100
90% (10)
Hiib 100
29 pages
Ryan Fishcher Bodyweight (4weeks)
100% (7)
Ryan Fishcher Bodyweight (4weeks)
24 pages
Practical Projects
100% (30)
Practical Projects
478 pages
10-Week Muscle Building Training Program by Jeffrey Ortiz
90% (29)
10-Week Muscle Building Training Program by Jeffrey Ortiz
54 pages
Grade 5 Math Book
100% (10)
Grade 5 Math Book
212 pages
The Best ChatGPT
100% (37)
The Best ChatGPT
8 pages
Aerobic Bodybuilder
90% (10)
Aerobic Bodybuilder
38 pages
12 Week Shred
88% (161)
12 Week Shred
70 pages
100 No Equipment Workouts 2014 by Neilarey
100% (42)
100 No Equipment Workouts 2014 by Neilarey
210 pages
Vshred Workout Log
91% (32)
Vshred Workout Log
13 pages
Spartan Workout
93% (14)
Spartan Workout
22 pages
FBB Lookgoodmovewellguide
87% (15)
FBB Lookgoodmovewellguide
45 pages
Mobility Routine
100% (33)
Mobility Routine
15 pages
Getting Things Done Personal Workflow Map (MM-EN-SB)
89% (75)
Getting Things Done Personal Workflow Map (MM-EN-SB)
3 pages