0% found this document useful (0 votes)

12 views5 pages

Deep Learning

deep learningdeep learning

Uploaded by

kmedo8080966

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views5 pages

Deep Learning

deep learningdeep learning

Uploaded by

kmedo8080966

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

lOMoARcPSD|15872722

I Introduction

Computer Vision: We’re trying to make sure that machines are learning to see in a similar way that humans
are doing. That is why we need Machine Learning methods, to get there. CV is the center for robotics so
that you understand the environment and what it does for you. There’s a lot of images/videos processing
done by CV as well.

I.1 The history of Computer Vision

Hubel and Wiesel Experiment

Hubel and Wiesel (neurobiologists) experimented on cats’ brains by putting electrodes in it and recording
them while the cat was being shown stimuli through a screen (mostly edges). They were able to find out
that visual cortex cells are sensitive to the orientation of edges, yet they were insensitive to the position of
the edges. Something that we will see later in convolutional networks.

I.2 The summer vision project 1966

They tried to construct a significant part of a visual system, and it was the time when pattern recognition
was coined.
CV is a very core element to other areas as well such as robotics, NLP, optics and image processing, algo
optimization, neuroscience, AI and ML.

I.3 Image classification

Previously DL was not used for Image classification yet it became popular later. Earlier they did prepro-
cessing (i.e. normalizing the colors of images), then came the feature descriptor which funcioned kind of
the same as the Hubel Wiesel Experiment in the sense that certain properties were not important such as
position of edges.
Different types of feature descriptor are HAAR, HOG, SIFT, SURF. In order to get to these feature descriptor
they had to hand engineer it, since most are gradient based. After that you have aggregators such as svm
rf, ann etc which would aggreagate the features and give the label.
Instead of feature extraction+accumulation we have a magic box that does that for us. That magic box is
deep learning. We do not have the hand engineer the feature descriptors. We are letting a data set decide
what the best possible descriptor might be that will give us the best results.
Image Classification Issues:

• Occlussion.
• Background cluttering: Background and foreground (object) similar colors
• Representation: Ex: cat drawing vs cat photo

1
Downloaded by Eng Esraa ([email protected])
lOMoARcPSD|15872722

I.4 History of Deep Learning

Started in 1940 with the electronic brain. Each cell has a certain pattern in them. They accumulated
weights/impulses and eventually made a decision.
1960 we saw the perceptron. Instead of fixed weights, we could learn the weights. We showed the system a
couple of example and we hope to essentially learn certain parameters of these perceptrons. We learn the
feature extraction (weights) and the threshold of learning. This was all hardwired.
Then we had Adaline (the golden age of deep learning). There was a lot of hype and progess being made.
Then in 1969, people realized the problems with perceptrons, specifically the xor problem. The problem
was that a linear model (a single perceptron) cannot separate the two classes. This era was called the AI
winter.
In 1986, the multi-layer perceptron came to light. We have several layers that can be trained (optimized for
the weights of the multi-layer perceptron). This is called backpropagation. Gradient based method.
In 1995 there was the SVM. Since it was successful, it put a halt to deep learning.
In 2006, Hinton and Ruslan developed Deep Belief Networks. The idea of pretraining came around. So you
train an nn and then you train it again for a specific task. The idea of pretraining is still one of the most
relevant today (for example transfer learning with ImageNet weights). Despite of this, neural networks were
still not a mainstream method.
In 2012 : the AlexNet architecture (see Section X.2) was the first neural network based architecture that
won the ImageNet competition based on the lowest top 5-error.
Definition of top 5 error: Give me an image, ask the method what class it is and see if the top five predictions
include the correct class.

I.5 What made this [Deep Learning] possible?

• Big Data: When we have big data, models learn where to learn from and we have so much more data
today then we did back then. The datasets are also online.
• Better Hardware: Not only has the data changed, the hardware has changes as well (i.e. GPU).
Hardware was developed for the rendering of images in games, and it is now used as well for deep
learning, to train models faster.
• Models are more complex

I.6 Different Tasks in DL

• Object Detection
• Self-Driving Cars
• Gaming (i.e. AlphaGO, AlphaStar)
• Machine Translation
• Automated Text Generation (ChatBots)
• Healthcare, cancer Detection

2
Downloaded by Eng Esraa ([email protected])
lOMoARcPSD|15872722

II Machine Learning Basics

Unsupervised Learning
Supervised Learning An underlying assumption is that train and test data come from the same distribu-
tion.
Nearest neighbor Model: Supervised learning method, labels the sample based on the majority label of
its neighboring samples. The hyper-parameters to be tweaked in KNN are: k, L1 or L2 distances.
Cross Validation: Split the data in K folds
Decision Boundaries are boundaries where the data is separated into classes.
The pros and cons of using linear decision boundaries:

+ It’s very easy to implement and derive

+ It’s easy to find the hyperparameters
- The distribution must be clearly separated
- Harder to use for multi classes (?)

Linear Regression is a supervised learning method that finds a linear model that explains a target y given
inputs x with weights ¹:

d
ŷi = xij ¹j
j=1

and the prediction looks like:

d
ŷi = ¹0 + xij ¹j =⇒ ŷ = X¹
j=1

xij are the features; ¹ are the weights (model parameters). ¹0 is a bias.
Mean squared error:
n n
1 1 2
J(¹) = (ŷi − yi ) = (xi ¹ − yi )2
n i=1
n i=1

Matrix notation: min J(¹) = min(X¹ − y)T (X¹ − y)

¹ ¹
This loss function is convex, thus have a closed form solution.¹ = (X T X)−1 X T y = X y

II.1 Maximum Likelihood

Find the parameter values that maximize the likelihood of making the observations given by the parameters.

3
Downloaded by Eng Esraa ([email protected])
lOMoARcPSD|15872722

II.2 Logistic Regression

¹M L = arg max pmodel (Y |X, ¹)

¹
n
= arg max pmodel (yi |xi , ¹)
¹
i=1
n
= arg max log pmodel (yi |xi , ¹)
¹
i=1

MLE assumes that the training samples are independent and generated by the same distribu-
tion.
What shape does our probability distribution have? Assuming Gaussian distribution: yi = N (xi ¹, Ã 2 ) =
xi ¹ + N (0, Ã 2 )
1 − 12 (yi −xi ¹)2
p(yi |xi , ¹) = e 2σ
(2ÃÃ 2 )

then after more matrix calculations we get:

n 1
¹M L = − log(2ÃÃ 2 ) − 2 (y − X¹)T (y − X¹)
2 2Ã
¹M L = (X T X)−1 X T y

So the MLE is the same as the least squares estimate we found previously.

II.2 Logistic Regression

Sigmoid function:

1
Ã(x) =
1 + e−x

Probability of a binary output:

n
p(y|X, ¹) = yˆi yi (1 − yˆi )(1−yi )
i=1

ŷi = Ã(xi ¹)

Maximum Likelihood Estimate:

θM L = arg max log p(y | X, θ)

θ
N
= arg max log yî yi (1 − yî )1−yi
θ
i=1
N
= arg max yi log(yî ) + (1 − yi ) log(1 − yî )
θ
i=1

This is called binary cross-entropy loss or BCE.

In the more general case (number of classes > 2), the cross entropy loss can be written as :

4
Downloaded by Eng Esraa ([email protected])
lOMoARcPSD|15872722

II.2 Logistic Regression

N C
L(ŷi , yi ) = yi,j log yˆi,j
i=1 j=1

n
Cost function (mean of losses for all samples): C(¹) = − n1 i=1 L(ŷi , yi ), optimize via gradient descent (no
closed form)

5
Downloaded by Eng Esraa ([email protected])

Lecture 2 African Philosophy _EDCC 514
No ratings yet
Lecture 2 African Philosophy _EDCC 514
28 pages
Intensive Care Chart Malaysia
100% (2)
Intensive Care Chart Malaysia
1 page
Military AI-Week 02-Key Concept Machine Learning
No ratings yet
Military AI-Week 02-Key Concept Machine Learning
84 pages
Research Areas in Artificial Intelligence and Machine Learning
100% (1)
Research Areas in Artificial Intelligence and Machine Learning
72 pages
Machine Learning Lecture
No ratings yet
Machine Learning Lecture
23 pages
Gas Analysis v2 Powell 2010 StanfordGW
No ratings yet
Gas Analysis v2 Powell 2010 StanfordGW
27 pages
Unit 1a - Fundamentals of Deep Learning
No ratings yet
Unit 1a - Fundamentals of Deep Learning
54 pages
Presentation on ML - Copy
No ratings yet
Presentation on ML - Copy
469 pages
Unit-3
No ratings yet
Unit-3
16 pages
Deep Learning - A Gentle Introduction
No ratings yet
Deep Learning - A Gentle Introduction
100 pages
Liner Removal procedure for MAN engine (MC and ME engines)
No ratings yet
Liner Removal procedure for MAN engine (MC and ME engines)
3 pages
Tubingen DL Notes
No ratings yet
Tubingen DL Notes
151 pages
ANN_Unit-2
No ratings yet
ANN_Unit-2
48 pages
DL1-Ver1
No ratings yet
DL1-Ver1
49 pages
Lesson 1 - History, Definitions and Basic Concepts
No ratings yet
Lesson 1 - History, Definitions and Basic Concepts
6 pages
Gyekye-Communitarianism_ycw6vs
No ratings yet
Gyekye-Communitarianism_ycw6vs
17 pages
Abhijit Ghatak - Deep Learning With R-Springer (2019)
No ratings yet
Abhijit Ghatak - Deep Learning With R-Springer (2019)
259 pages
02 ML Fundatmentals 2
No ratings yet
02 ML Fundatmentals 2
81 pages
Direct and Indirect Effects of Endogeic Earthworms On Plant Seeds
No ratings yet
Direct and Indirect Effects of Endogeic Earthworms On Plant Seeds
12 pages
Day 1 S3
No ratings yet
Day 1 S3
29 pages
Annexure-11 Major Bill of Materials (BOM) - DelCEN 2500 HV
No ratings yet
Annexure-11 Major Bill of Materials (BOM) - DelCEN 2500 HV
2 pages
AccurioPress C2070/C2070P/C2060 AccurioPrint C2060L Safety Information
100% (1)
AccurioPress C2070/C2070P/C2060 AccurioPrint C2060L Safety Information
42 pages
What Is Computer Vision?
No ratings yet
What Is Computer Vision?
120 pages
What Is Computer Vision?
No ratings yet
What Is Computer Vision?
125 pages
OHE Fittings & PG Clamps-PPS International
100% (1)
OHE Fittings & PG Clamps-PPS International
8 pages
Introduction+to+Neural+Networks+ +Lecture+Slides+Part+1
No ratings yet
Introduction+to+Neural+Networks+ +Lecture+Slides+Part+1
36 pages
Introduction to machine learning
No ratings yet
Introduction to machine learning
33 pages
Leadership and Social Media_ 5 Effective Strategies
No ratings yet
Leadership and Social Media_ 5 Effective Strategies
15 pages
Deep Learning Introduction Class (1)
No ratings yet
Deep Learning Introduction Class (1)
46 pages
Unit -1 Deep Learning
No ratings yet
Unit -1 Deep Learning
26 pages
Forensic Chemistry & Toxicology
No ratings yet
Forensic Chemistry & Toxicology
8 pages
01_ml_basics
No ratings yet
01_ml_basics
61 pages
ETAP Presentation
0% (1)
ETAP Presentation
63 pages
Unit I - Fundamentals of DL
No ratings yet
Unit I - Fundamentals of DL
41 pages
1c Machinelearning
No ratings yet
1c Machinelearning
50 pages
Prayer With and For The Youth 2018-2019
No ratings yet
Prayer With and For The Youth 2018-2019
3 pages
University of Delhi: Semester Examination May-June 2020 Transcript
No ratings yet
University of Delhi: Semester Examination May-June 2020 Transcript
2 pages
Deep Learning For NLP
No ratings yet
Deep Learning For NLP
78 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
92 pages
ppt1dl
No ratings yet
ppt1dl
50 pages
Modern Mining Company - Calcined Petroleum Coke Plant - FEED STAGE
No ratings yet
Modern Mining Company - Calcined Petroleum Coke Plant - FEED STAGE
5 pages
Review 2 Number N Simple Present
No ratings yet
Review 2 Number N Simple Present
2 pages
AI Lecture 5
No ratings yet
AI Lecture 5
22 pages
E Yada Di Shi'-Ite On Rising Above Personalityn (March 16, 2008)
100% (3)
E Yada Di Shi'-Ite On Rising Above Personalityn (March 16, 2008)
15 pages
Deep_Learning_with_R
No ratings yet
Deep_Learning_with_R
18 pages
DLTest 1 QB
No ratings yet
DLTest 1 QB
13 pages
Deep Learning: A Visual Introduction
No ratings yet
Deep Learning: A Visual Introduction
53 pages
Deep Learning
100% (3)
Deep Learning
32 pages
Lec 1
No ratings yet
Lec 1
30 pages
Deep learning Module 1 Chapter 1
No ratings yet
Deep learning Module 1 Chapter 1
18 pages
AA12_Deep_Learning_2024 (1)
No ratings yet
AA12_Deep_Learning_2024 (1)
30 pages
1 AI_Introduction and ML
No ratings yet
1 AI_Introduction and ML
32 pages
HVS Hotel Cost Estimating Guide 2020
No ratings yet
HVS Hotel Cost Estimating Guide 2020
126 pages
01 Introduction 1
No ratings yet
01 Introduction 1
71 pages
Module1_ Deep Learning
No ratings yet
Module1_ Deep Learning
26 pages
ARTEMIS Road Model Description V04d 071008
No ratings yet
ARTEMIS Road Model Description V04d 071008
169 pages
C++Lab Notes
No ratings yet
C++Lab Notes
20 pages
Farkas Image Classif NN
No ratings yet
Farkas Image Classif NN
32 pages
MODULE 1 DL SNOTES
No ratings yet
MODULE 1 DL SNOTES
11 pages
Unit - 1 Deep Learning 3-2
No ratings yet
Unit - 1 Deep Learning 3-2
15 pages
Introduction To Deep Learning: Radu Ionescu, Prof. PHD
No ratings yet
Introduction To Deep Learning: Radu Ionescu, Prof. PHD
90 pages
NISS Deep Learning Tutorial
No ratings yet
NISS Deep Learning Tutorial
58 pages
Deep Learning Question
No ratings yet
Deep Learning Question
4 pages
23 DeepLearning PDF
No ratings yet
23 DeepLearning PDF
74 pages
Speak A New Language So That The World Will Be A New World.
No ratings yet
Speak A New Language So That The World Will Be A New World.
19 pages
Lecture 23 Soil Taxonomy
No ratings yet
Lecture 23 Soil Taxonomy
49 pages
Ohsm Plan Assessment Report
No ratings yet
Ohsm Plan Assessment Report
17 pages
Data Mining: Practical Machine Learning Tools and Techniques
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
123 pages
CV Raven Rullyapatra Nasution
No ratings yet
CV Raven Rullyapatra Nasution
3 pages
The Smartest and Most Accurate Ultrasonic Thickness Gauge
No ratings yet
The Smartest and Most Accurate Ultrasonic Thickness Gauge
4 pages
Deep
No ratings yet
Deep
15 pages
Deep Learning With Tensorflow
100% (1)
Deep Learning With Tensorflow
70 pages
Evs Assignment 1ST
No ratings yet
Evs Assignment 1ST
17 pages
Deep Learning in Neural Networks An Overview
No ratings yet
Deep Learning in Neural Networks An Overview
89 pages
Deep Learning 2 July 2014
No ratings yet
Deep Learning 2 July 2014
75 pages
Short Course On Deep Learning: Welcome!!
No ratings yet
Short Course On Deep Learning: Welcome!!
57 pages
Deep Learning in Neural Networks: An Overview
No ratings yet
Deep Learning in Neural Networks: An Overview
31 pages
21CS743 Model Question Paper Solution
No ratings yet
21CS743 Model Question Paper Solution
32 pages
CS464 Ch1 Intro Fall2020
No ratings yet
CS464 Ch1 Intro Fall2020
83 pages
The Deep Learning Revolution: Introductory Overview Lecture
No ratings yet
The Deep Learning Revolution: Introductory Overview Lecture
35 pages
Ms. Virginia Gamba - Summary CV
No ratings yet
Ms. Virginia Gamba - Summary CV
2 pages
De Thi Thu Du Lich
No ratings yet
De Thi Thu Du Lich
6 pages
Deep Learning
No ratings yet
Deep Learning
100 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
56 pages
HY OT: How To Use Everyday Ingenuity To Solve Problems Big and Small
No ratings yet
HY OT: How To Use Everyday Ingenuity To Solve Problems Big and Small
2 pages
Lec 1,2
No ratings yet
Lec 1,2
69 pages
Deep Learning
No ratings yet
Deep Learning
43 pages
Scrum Questions
No ratings yet
Scrum Questions
2 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet

Deep Learning

Uploaded by

Deep Learning

Uploaded by

lOMoARcPSD|15872722

I.1 The history of Computer Vision

Hubel and Wiesel Experiment

I.2 The summer vision project 1966

I.3 Image classification

I.4 History of Deep Learning

I.4 History of Deep Learning

I.5 What made this [Deep Learning] possible?

I.6 Different Tasks in DL

II Machine Learning Basics

+ It’s very easy to implement and derive

and the prediction looks like:

Matrix notation: min J(¹) = min(X¹ − y)T (X¹ − y)

II.1 Maximum Likelihood

II.2 Logistic Regression

¹M L = arg max pmodel (Y |X, ¹)

then after more matrix calculations we get:

II.2 Logistic Regression

Probability of a binary output:

Maximum Likelihood Estimate:

θM L = arg max log p(y | X, θ)

This is called binary cross-entropy loss or BCE.

II.2 Logistic Regression

You might also like