0% found this document useful (0 votes)

7 views

chapter4 (1)

The document provides an overview of loading and processing data for deep learning using PyTorch, focusing on an animals dataset. It covers defining input features, target values, creating a TensorDataset, and using DataLoader for batching. Additionally, it discusses model evaluation metrics, overfitting, and strategies to improve model performance, including regularization techniques and hyperparameter tuning.

Uploaded by

hunglaikcad1247

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

chapter4 (1)

Uploaded by

hunglaikcad1247

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

A deeper dive into

loading data
INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Maham Faisal Khan

Senior Data Scientist
Back to our animals dataset
import pandas as pd
pd.read_csv('animals.csv')

animal_name hair feathers eggs milk predator fins legs tail type
skimmer 0 1 1 0 1 0 2 1 2
gull 0 1 1 0 1 0 2 1 2
seahorse 0 0 1 0 0 1 0 1 4
tuatara 0 0 1 0 1 0 4 1 3
squirrel 1 0 0 1 0 0 2 1 1

Type key: mammal (1), bird (2), reptile (3), fish (4), amphibian (5), bug (6), invertebrate (7).

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Back to our animals dataset: defining features
import numpy as np
# Define input features
features = animals.iloc[:, 1:-1]
X = features.to_numpy()
print(X)

array([[0, 1, 1, 0, 1, 0, 2, 1],
[0, 1, 1, 0, 1, 0, 2, 1],
[0, 0, 1, 0, 0, 1, 0, 1],
[0, 0, 1, 0, 1, 0, 4, 1],
[1, 0, 0, 1, 0, 0, 2, 1]])

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Back to our animals dataset: defining target values
# Define target features (ground truth)
target = animals.iloc[:, -1]
y = target.to_numpy()

array([2, 2, 4, 3, 1])

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Recalling TensorDataset
import torch
from torch.utils.data import TensorDataset

# Instantiate dataset class

dataset = TensorDataset(torch.tensor(X).float(), torch.tensor(y).float())

# Access an individual sample

sample = dataset[0]
input_sample, label_sample = sample
print('input sample:', input_sample)
print('label_sample:', label_sample)

input sample: tensor([0., 1., 1., 0., 1., 0., 2., 1.])
label_sample: tensor(2.)

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Recalling DataLoader
from torch.utils.data import DataLoader

batch_size = 2
shuffle = True

# Create a DataLoader
dataloader = DataLoader(dataset, batch_size=batch_size, shuffle=shuffle)

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Recalling DataLoader
# Iterate over the dataloader
for batch_inputs, batch_labels in dataloader:
print('batch inputs', batch_inputs)
print('batch labels', batch_labels)

batch inputs: tensor([[0., 0., 1., 0., 0., 1., 0., 1.],
[0., 1., 1., 0., 1., 0., 2., 1.]])
batch labels: tensor([4., 2.])
batch inputs: tensor([[0., 1., 1., 0., 1., 0., 2., 1.],
[1., 0., 0., 1., 0., 0., 2., 1.]])
batch labels: tensor([2., 1.])
batch inputs: tensor([[0., 0., 1., 0., 1., 0., 4., 1.]])
batch labels: tensor([3.])

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Let's practice!
INTRODUCTION TO DEEP LEARNING WITH PYTORCH
Evaluating model
performance
INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Maham Faisal Khan

Senior Data Scientist
Training, validation and testing
Raw dataset is usually split in three subsets:

Percent of data Role

Training 80-90% Used to adjust the model's parameters
Validation 10-20% Used for hyperparameter tuning
Testing 5-10% Only used once to calculate final metrics

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Model evaluation metrics
In this video, we'll focus on evaluating: In classification, accuracy measures how
Loss well model correctly predicts ground truth
Training labels

Validation

Accuracy
Training

Validation

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Calculating training loss
For each epoch: training_loss = 0.0
we sum up the loss for each iteration of for i, data in enumerate(trainloader, 0):
the training set dataloader # Run the forward pass
...
at the end of the epoch, we calculate the
mean training loss # Calculate the loss
loss = criterion(outputs, labels)
# Calculate the gradients
...
# Calculate and sum the loss
training_loss += loss.item()
epoch_loss = training_loss / len(trainloader)

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Calculating validation loss
After the training epoch, we iterate over the validation set and calculate the average
validation loss

validation_loss = 0.0
model.eval() # Put model in evaluation mode
with torch.no_grad(): # Speed up the forward pass
for i, data in enumerate(validationloader, 0):
# Run the forward pass
...
# Calculate the loss
loss = criterion(outputs, labels)
validation_loss += loss.item()
epoch_loss = validation_loss / len(validationloader)
model.train()

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Overfitting

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Calculating accuracy with torchmetrics
import torchmetrics

# Create accuracy metric using torch metrics

metric = torchmetrics.Accuracy(task="multiclass", num_classes=3)
for i, data in enumerate(dataloader, 0):
features, labels = data
outputs = model(features)
# Calculate accuracy over the batch
acc = metric(outputs, labels.argmax(dim=-1))
# Calculate accuracy over the whole epoch
acc = metric.compute()
print(f"Accuracy on all data: {acc}")
# Reset the metric for the next epoch (training or validation)
metric.reset()

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Let's practice!
INTRODUCTION TO DEEP LEARNING WITH PYTORCH
Fighting overfitting
INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Maham Faisal Khan

Senior Data Scientist
Reasons for overfitting
Overfitting: the model does not generalize to unseen data.
model memorizes training data

good performances on the training set / poor performances on the validation set

Possible causes:

Problem Solutions
Dataset is not large enough Get more data / use data augmentation
Model has too much capacity Reduce model size / add dropout
Weights are too large Weight decay

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Fighting overfitting
Strategies:

Reducing model size or adding dropout layer

Using weight decay to force parameters to remain small

Obtaining new data or augmenting data

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

"Regularization" using a dropout layer
Randomly zeroes out elements of the input tensor during training

model = nn.Sequential(nn.Linear(8, 4),

nn.ReLU(),
nn.Dropout(p=0.5))
features = torch.randn((1, 8))
model(i)

tensor([[1.4655, 0.0000, 0.0000, 0.8456]], grad_fn=<MulBackward0>)

Dropout is added after the activation function

Behaves differently during training and evaluation; we must remember to switch modes
using model.train() and model.eval()

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Regularization with weight decay
optimizer = optim.SGD(model.parameters(), lr=1e-3, weight_decay=1e-4)

Optimizer's weight_decay parameter takes values between zero and one

Typically small value, e.g. 1e-3

Weight decay adds penalty to loss function to discourage large weights and biases

The higher the parameter, the less likely the model is to overfit

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Data augmentation

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Let's practice!
INTRODUCTION TO DEEP LEARNING WITH PYTORCH
Improving model
performance
INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Maham Faisal Khan

Senior Data Scientist
Steps to maximize performance
Overfit the training set
can we solve the problem?

sets a performance baseline

Reduce overfitting
improve performances on the validation
set

Fine-tune hyperparameters

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Step 1: overfit the training set
Modify the training loop to overfit a single data point (batch size of 1)

features, labels = next(iter(trainloader))

for i in range(1e3):
outputs = model(features)
loss = criterion(outputs, labels)
loss.backward()
optimizer.step()

should reach 1.0 accuracy and 0 loss

helps findings bugs in the code

Goal: minimize the training loss

create large enough model

use a default learning rate

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Step 2: reduce overfitting
Goal: maximize the validation accuracy
Experiment with:
Dropout

Data augmentation

Weight decay

Reducing model capacity

Keep track of each hyperparameter and

report maximum validation accuracy

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Step 2: reduce overfitting
Original model overfitting the Model with too much regularization
training data

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Step 3: fine-tune hyperparameters
Grid search Random search

for factor in range(2, 6): factor = np.random.uniform(2, 6)

lr = 10 ** -factor lr = 10 ** -factor

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Let's practice!
INTRODUCTION TO DEEP LEARNING WITH PYTORCH
Wrap-up video
INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Maham Faisal Khan

Senior Data Scientist
Summary
Chapter 1 Chapter 3
Discovered deep learning Manipulated the architecture of a neural
network
Created small neural networks
Played with learning rate and momentum
Discovered linear layers and activation
functions Learned about transfer learning

Chapter 2 Chapter 4
Created and used loss functions Learned about dataloaders

Calculated derivatives and use Reduced overfitting

backpropagation Evaluated model performance
Trained a neural network

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Next steps
Course
Intermediate Deep Learning with
PyTorch

Learn
Probability and statistics

Linear algebra

Calculus

Practice
Pick a dataset on Kaggle
Check out DataCamp workspace
Train a neural network

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Let's practice!
INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Pytorch Cheatsheet EN
No ratings yet
Pytorch Cheatsheet EN
1 page
Chapter 4
No ratings yet
Chapter 4
34 pages
Activation Functions: Ismail Elezi
No ratings yet
Activation Functions: Ismail Elezi
30 pages
chapter2 (1)
No ratings yet
chapter2 (1)
35 pages
Chapter 1
No ratings yet
Chapter 1
37 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
Chapter1 Intro
No ratings yet
Chapter1 Intro
35 pages
chapter3 (1)
No ratings yet
chapter3 (1)
26 pages
chapter1 (1)
No ratings yet
chapter1 (1)
50 pages
Chapter 3
No ratings yet
Chapter 3
26 pages
Module02 PyTorch
No ratings yet
Module02 PyTorch
36 pages
PyTorch Workflow Fundamentals - Zero To Mastery Learn PyTorch For Deep Learning
No ratings yet
PyTorch Workflow Fundamentals - Zero To Mastery Learn PyTorch For Deep Learning
43 pages
Pytorch Tutorial: Narges Honarvar Nazari January 30
No ratings yet
Pytorch Tutorial: Narges Honarvar Nazari January 30
29 pages
PyTorch Crash Course 1713016363
No ratings yet
PyTorch Crash Course 1713016363
15 pages
Py Torch
No ratings yet
Py Torch
786 pages
vertopal.com_PyTorch_CrashCourse
No ratings yet
vertopal.com_PyTorch_CrashCourse
16 pages
Lab 2-Image-Classification-Using-NNs
No ratings yet
Lab 2-Image-Classification-Using-NNs
6 pages
Deep Learning Lab: How To Train Your First Neural Network
No ratings yet
Deep Learning Lab: How To Train Your First Neural Network
68 pages
Intro To PyTorch and Neural Networks - Intro To PyTorch and Neural Networks Cheatsheet - Codecademy
No ratings yet
Intro To PyTorch and Neural Networks - Intro To PyTorch and Neural Networks Cheatsheet - Codecademy
8 pages
CIFAR_10_ Dataset_Using_CNN_Aniiiii_HTML
No ratings yet
CIFAR_10_ Dataset_Using_CNN_Aniiiii_HTML
8 pages
Lab 9
No ratings yet
Lab 9
29 pages
PyTorch_Guide_With_Code
No ratings yet
PyTorch_Guide_With_Code
4 pages
PyTorch - A Comprehensive Overview
No ratings yet
PyTorch - A Comprehensive Overview
7 pages
Deep Learning With PyTorch 1
No ratings yet
Deep Learning With PyTorch 1
1 page
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
10 pages
Pytorch Tutorial 1
No ratings yet
Pytorch Tutorial 1
48 pages
Pytorch Neural Networks Guide 1717173717
No ratings yet
Pytorch Neural Networks Guide 1717173717
17 pages
Pytorch 101: Deep Learning PHD Course 2017/2018
No ratings yet
Pytorch 101: Deep Learning PHD Course 2017/2018
19 pages
Stars 4 0 0 0 + Forks 7 0 0 + License MIT
No ratings yet
Stars 4 0 0 0 + Forks 7 0 0 + License MIT
19 pages
Building Deep Learning Models Using the PyTorch Library
No ratings yet
Building Deep Learning Models Using the PyTorch Library
4 pages
Fibercablelength Understanding
No ratings yet
Fibercablelength Understanding
5 pages
PyTorch_CrashCourse
No ratings yet
PyTorch_CrashCourse
17 pages
Steps To Use Pytorch
No ratings yet
Steps To Use Pytorch
2 pages
PyTorch Workflow Fundamentals
No ratings yet
PyTorch Workflow Fundamentals
1 page
Pytorch Tutorial 1 Rev 1
No ratings yet
Pytorch Tutorial 1 Rev 1
48 pages
(Deep Learning Using PyTorch) (Cheatsheet)
No ratings yet
(Deep Learning Using PyTorch) (Cheatsheet)
7 pages
2c PyTorch4
No ratings yet
2c PyTorch4
4 pages
Deep Learning With PyTorch Guide For Beginners and Intermediate
100% (7)
Deep Learning With PyTorch Guide For Beginners and Intermediate
120 pages
Notebook - Agave Plant Maturation Model Inference and Testing
No ratings yet
Notebook - Agave Plant Maturation Model Inference and Testing
7 pages
Pytorch A Detailed Overview Agladze Mikhail instant download
No ratings yet
Pytorch A Detailed Overview Agladze Mikhail instant download
82 pages
Day 45 PyTorch Presentation
No ratings yet
Day 45 PyTorch Presentation
67 pages
DL 1 - ComputerVision With PyTorch Notes
No ratings yet
DL 1 - ComputerVision With PyTorch Notes
304 pages
Week_7_-mnist-mlp
No ratings yet
Week_7_-mnist-mlp
7 pages
DIP Lab 10
No ratings yet
DIP Lab 10
11 pages
PyTorch Made Easy A Quick Overview
No ratings yet
PyTorch Made Easy A Quick Overview
55 pages
Deep Learning With PyTorch
No ratings yet
Deep Learning With PyTorch
19 pages
PyTorch 1 - 0 - Bringing Research and Production Together Presentation
No ratings yet
PyTorch 1 - 0 - Bringing Research and Production Together Presentation
108 pages
Py Torch
No ratings yet
Py Torch
19 pages
Harvard CS197 Lecture 6 & 7 Notes
No ratings yet
Harvard CS197 Lecture 6 & 7 Notes
18 pages
یادگیری پایتورچ
No ratings yet
یادگیری پایتورچ
30 pages
Pytorch
No ratings yet
Pytorch
38 pages
EE769 Assignment 3
No ratings yet
EE769 Assignment 3
1 page
NN From Scratch
No ratings yet
NN From Scratch
5 pages
bldd_VIT_ResNet50v2_CustomCNN
No ratings yet
bldd_VIT_ResNet50v2_CustomCNN
38 pages
About PyTorch
No ratings yet
About PyTorch
2 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
Intro To Deep Learning With TensorFlow - Introduction To TensorFlow Cheatsheet - Codecademy
No ratings yet
Intro To Deep Learning With TensorFlow - Introduction To TensorFlow Cheatsheet - Codecademy
8 pages
PyTorch PDF
No ratings yet
PyTorch PDF
72 pages
The Supervised Learning Workshop - Second Edition: A New, Interactive Approach to Understanding Supervised Learning Algorithms, 2nd Edition
From Everand
The Supervised Learning Workshop - Second Edition: A New, Interactive Approach to Understanding Supervised Learning Algorithms, 2nd Edition
Blaine Bateman
No ratings yet
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
Adele - Hello Worksheet
100% (1)
Adele - Hello Worksheet
3 pages
In The Matter of The Adoption of Stephanie Nathy Astorga Garcia
No ratings yet
In The Matter of The Adoption of Stephanie Nathy Astorga Garcia
1 page
Final Test Discuss
No ratings yet
Final Test Discuss
10 pages
Ict LP
No ratings yet
Ict LP
2 pages
Civics and Community Engagement Lecture 1
No ratings yet
Civics and Community Engagement Lecture 1
11 pages
Writing PhD-DBA-DBL Proposal Guidelines 22.10.2009
No ratings yet
Writing PhD-DBA-DBL Proposal Guidelines 22.10.2009
1 page
What Is Music Therapy?: Where Do Music Therapists Work?
No ratings yet
What Is Music Therapy?: Where Do Music Therapists Work?
15 pages
Organizational Culture
No ratings yet
Organizational Culture
5 pages
ILS Assessment
No ratings yet
ILS Assessment
12 pages
TWI Training Exam Results
100% (1)
TWI Training Exam Results
1 page
DrArti Tyagi CV_241116_094835
No ratings yet
DrArti Tyagi CV_241116_094835
6 pages
Understanding The Four Main Communication Styles in The Workplace
No ratings yet
Understanding The Four Main Communication Styles in The Workplace
5 pages
Final Report PTA
No ratings yet
Final Report PTA
14 pages
Qualitative Research Methods: Olasile Babatunde Adedoyin
No ratings yet
Qualitative Research Methods: Olasile Babatunde Adedoyin
8 pages
Physical Properties Of Liquid Crystals Nematics Dunmur David Fukuda pdf download
100% (2)
Physical Properties Of Liquid Crystals Nematics Dunmur David Fukuda pdf download
79 pages
Willing To Be Disturbed
No ratings yet
Willing To Be Disturbed
4 pages
DLL MATATAG_MATHEMATICS 1_Q2_W6
No ratings yet
DLL MATATAG_MATHEMATICS 1_Q2_W6
11 pages
Chapter 2
No ratings yet
Chapter 2
14 pages
Manthan Volume 3
No ratings yet
Manthan Volume 3
49 pages
CLIL Unit 1 Technology PDF
No ratings yet
CLIL Unit 1 Technology PDF
2 pages
Constructivist Lesson Plan
No ratings yet
Constructivist Lesson Plan
6 pages
The Rise and Fall - of The Digital Natives
100% (1)
The Rise and Fall - of The Digital Natives
21 pages
Nptel: Reading Poetry - Web Course
No ratings yet
Nptel: Reading Poetry - Web Course
4 pages
WORK_IMMERSION_PORTFOLIO-dicnhs-tvl mica
No ratings yet
WORK_IMMERSION_PORTFOLIO-dicnhs-tvl mica
55 pages
Lab Report Bazil - TONG LIP ZHONG
No ratings yet
Lab Report Bazil - TONG LIP ZHONG
8 pages
2021 Wts 12 Physcs t1 Durban Camp-1
No ratings yet
2021 Wts 12 Physcs t1 Durban Camp-1
74 pages
Matatag Demo LP - TLE7
No ratings yet
Matatag Demo LP - TLE7
6 pages
Instant Ebooks Textbook Researching Audio Description: New Approaches 1st Edition Anna Matamala Download All Chapters
100% (4)
Instant Ebooks Textbook Researching Audio Description: New Approaches 1st Edition Anna Matamala Download All Chapters
62 pages
Bibliographic Record
No ratings yet
Bibliographic Record
3 pages
Problem Set 2 2022
No ratings yet
Problem Set 2 2022
4 pages

chapter4 (1)

Uploaded by

chapter4 (1)

Uploaded by

A deeper dive into

Maham Faisal Khan

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

# Instantiate dataset class

# Access an individual sample

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Maham Faisal Khan

Percent of data Role

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

# Create accuracy metric using torch metrics

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Maham Faisal Khan

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Reducing model size or adding dropout layer

Using weight decay to force parameters to remain small

Obtaining new data or augmenting data

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

model = nn.Sequential(nn.Linear(8, 4),

tensor([[1.4655, 0.0000, 0.0000, 0.8456]], grad_fn=<MulBackward0>)

Dropout is added after the activation function

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Optimizer's weight_decay parameter takes values between zero and one

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Maham Faisal Khan

sets a performance baseline

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

features, labels = next(iter(trainloader))

should reach 1.0 accuracy and 0 loss

helps findings bugs in the code

Goal: minimize the training loss

use a default learning rate

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Reducing model capacity

Keep track of each hyperparameter and

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

for factor in range(2, 6): factor = np.random.uniform(2, 6)

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

Maham Faisal Khan

Calculated derivatives and use Reduced overfitting

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

INTRODUCTION TO DEEP LEARNING WITH PYTORCH

You might also like