0% found this document useful (0 votes)

40 views

Deep Learning Basics Lecture 1 Feedforward

This document provides an overview of deep learning basics and feedforward networks. It discusses how machine learning involves collecting data, extracting features, and building models to minimize loss. Feedforward networks are motivated by representing inputs with learned nonlinear functions across multiple layers, inspired by biological neurons. Key components of feedforward networks include the input, hidden layers with activation functions like ReLU to introduce nonlinearity, and an output layer for tasks like regression or classification.

Uploaded by

baris

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views

Deep Learning Basics Lecture 1 Feedforward

Uploaded by

baris

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Deep Learning Basics

Lecture 1: Feedforward
Princeton University COS 495
Instructor: Yingyu Liang
Motivation I: representation learning
Machine learning 1-2-3

• Collect data and extract features

• Build model: choose hypothesis class 𝓗 and loss function 𝑙
• Optimization: minimize the empirical loss
Features

𝑥
Color Histogram

Extract build
features hypothesis 𝑦 = 𝑤𝑇𝜙 𝑥

Red Green Blue

Features: part of the model
Nonlinear model

build
hypothesis 𝑦 = 𝑤𝑇𝜙 𝑥

Linear model
Example: Polynomial kernel SVM

𝑥1

𝑦 = sign(𝑤 𝑇 𝜙(𝑥) + 𝑏)

𝑥2

Fixed 𝜙 𝑥
Motivation: representation learning
• Why don’t we also learn 𝜙 𝑥 ?

Learn 𝜙 𝑥 𝜙 𝑥 Learn 𝑤
𝑦 = 𝑤𝑇𝜙 𝑥

𝑥
Feedforward networks
• View each dimension of 𝜙 𝑥 as something to be learned

…
𝑦 = 𝑤𝑇𝜙 𝑥
…

𝑥 𝜙 𝑥
Feedforward networks
• Linear functions 𝜙𝑖 𝑥 = 𝜃𝑖𝑇 𝑥 don’t work: need some nonlinearity

…
𝑦 = 𝑤𝑇𝜙 𝑥
…

𝑥 𝜙 𝑥
Feedforward networks
• Typically, set 𝜙𝑖 𝑥 = 𝑟(𝜃𝑖𝑇 𝑥) where 𝑟(⋅) is some nonlinear function

…
𝑦 = 𝑤𝑇𝜙 𝑥
…

𝑥 𝜙 𝑥
Feedforward deep networks
• What if we go deeper?
…

…
……
𝑦
…

ℎ1 ℎ2 ℎ𝐿
𝑥
Figure from
Deep learning, by
Goodfellow, Bengio, Courville.
Dark boxes are things to be learned.
Motivation II: neurons
Motivation: neurons

Figure from
Wikipedia
Motivation: abstract neuron model
• Neuron activated when the correlation
between the input and a pattern 𝜃
exceeds some threshold 𝑏 𝑥1
• 𝑦 = threshold(𝜃 𝑇 𝑥 − 𝑏) 𝑥2
or 𝑦 = 𝑟(𝜃 𝑇 𝑥 − 𝑏)
• 𝑟(⋅) called activation function 𝑦

𝑥𝑑
Motivation: artificial neural networks
Motivation: artificial neural networks
• Put into layers: feedforward deep networks
…

…
……
𝑦
…

ℎ1 ℎ2 ℎ𝐿
𝑥
Components in Feedforward
networks
Components
• Representations:
• Input
• Hidden variables
• Layers/weights:
• Hidden layers
• Output layer
Components
First layer Output layer
…

…
……
𝑦
…

Input 𝑥 Hidden variables ℎ1 ℎ2 ℎ𝐿

Input
• Represented as a vector

• Sometimes require some

preprocessing, e.g., Expand
• Subtract mean
• Normalize to [-1,1]
Output layers
Output layer
• Regression: 𝑦 = 𝑤𝑇ℎ +𝑏
• Linear units: no nonlinearity

ℎ
Output layers
Output layer
• Multi-dimensional regression: 𝑦 = 𝑊𝑇ℎ +𝑏
• Linear units: no nonlinearity

ℎ
Output layers
Output layer
𝜎(𝑤 𝑇 ℎ
• Binary classification: 𝑦 = + 𝑏)
• Corresponds to using logistic regression on ℎ

ℎ
Output layers
Output layer
• Multi-class classification:
• 𝑦 = softmax 𝑧 where 𝑧 = 𝑊 𝑇 ℎ + 𝑏
• Corresponds to using multi-class
logistic regression on ℎ
𝑧 𝑦

ℎ
Hidden layers
• Neuron take weighted linear
combination of the previous

…
layer
• So can think of outputting one
value for the next layer

…
ℎ𝑖 ℎ𝑖+1
Hidden layers
• 𝑦 = 𝑟(𝑤 𝑇 𝑥 + 𝑏)

• Typical activation function 𝑟

• Threshold t 𝑧 = 𝕀[𝑧 ≥ 0]
𝑟(⋅)
• Sigmoid 𝜎 𝑧 = 1/ 1 + exp(−𝑧) 𝑥 𝑦
• Tanh tanh 𝑧 = 2𝜎 2𝑧 − 1
Hidden layers
• Problem: saturation

𝑟(⋅)
𝑥 𝑦
Too small gradient

Figure borrowed from Pattern Recognition and Machine Learning, Bishop

Hidden layers
• Activation function ReLU (rectified linear unit)
• ReLU 𝑧 = max{𝑧, 0}

Figure from Deep learning, by

Goodfellow, Bengio, Courville.
Hidden layers
• Activation function ReLU (rectified linear unit)
• ReLU 𝑧 = max{𝑧, 0} Gradient 1

Gradient 0
Hidden layers
• Generalizations of ReLU gReLU 𝑧 = max 𝑧, 0 + 𝛼 min{𝑧, 0}
• Leaky-ReLU 𝑧 = max{𝑧, 0} + 0.01 min{𝑧, 0}
• Parametric-ReLU 𝑧 : 𝛼 learnable

gReLU 𝑧

ECE604 f20 hw3
0% (1)
ECE604 f20 hw3
3 pages
AML 04 Backpropagation
100% (1)
AML 04 Backpropagation
26 pages
Back Propagation
No ratings yet
Back Propagation
27 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
AI - Physics Informed Neural Network by ARNAB HALDER
No ratings yet
AI - Physics Informed Neural Network by ARNAB HALDER
15 pages
Inbound 8392301798635648784
No ratings yet
Inbound 8392301798635648784
43 pages
3-LG_Eval
No ratings yet
3-LG_Eval
52 pages
L2 Neural Network Basics
No ratings yet
L2 Neural Network Basics
105 pages
Chapter 9
No ratings yet
Chapter 9
73 pages
ML06_Neural-Network_2024-2025
No ratings yet
ML06_Neural-Network_2024-2025
78 pages
1AI.04b - Introduction To Machine Learning - Supervised Learning - DT PDF
No ratings yet
1AI.04b - Introduction To Machine Learning - Supervised Learning - DT PDF
65 pages
MN906-NNWatermarking
No ratings yet
MN906-NNWatermarking
66 pages
Pattern Classification 11. Backpropagation & Time-Series Forecasting
No ratings yet
Pattern Classification 11. Backpropagation & Time-Series Forecasting
78 pages
Lecture14 - ML (FF, Autoenc, Dense Networks)
No ratings yet
Lecture14 - ML (FF, Autoenc, Dense Networks)
28 pages
Deep Learning: International Islamic University of Chittagong
No ratings yet
Deep Learning: International Islamic University of Chittagong
31 pages
Mlfa Autumn 22 Lec 05
No ratings yet
Mlfa Autumn 22 Lec 05
29 pages
Clase 4 Backpropagation
No ratings yet
Clase 4 Backpropagation
63 pages
Lecture 2
No ratings yet
Lecture 2
66 pages
ML807_Distributed_and_Federated_Learning_Slides_2
No ratings yet
ML807_Distributed_and_Federated_Learning_Slides_2
211 pages
mod4
No ratings yet
mod4
65 pages
RNN LSTM
No ratings yet
RNN LSTM
71 pages
Lec9 NN I
No ratings yet
Lec9 NN I
47 pages
Chap 6 - Deep FeedForward Networks - Eunjeong Yi
No ratings yet
Chap 6 - Deep FeedForward Networks - Eunjeong Yi
21 pages
AML 03 Dense Neural Networks
No ratings yet
AML 03 Dense Neural Networks
20 pages
Lecture3_upload
No ratings yet
Lecture3_upload
28 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
22 pages
4 - DNN Tip
No ratings yet
4 - DNN Tip
52 pages
Regression
No ratings yet
Regression
17 pages
03 Linear Models
No ratings yet
03 Linear Models
46 pages
2021 Logistic Regression
No ratings yet
2021 Logistic Regression
33 pages
9NeuralNetworksLearning
No ratings yet
9NeuralNetworksLearning
38 pages
Lecture 5-6
No ratings yet
Lecture 5-6
45 pages
Deep Learning_Part II-1
No ratings yet
Deep Learning_Part II-1
23 pages
L03 The Regression Pipeline - 2
No ratings yet
L03 The Regression Pipeline - 2
58 pages
cs224n 2023 Lecture03 Neuralnets
No ratings yet
cs224n 2023 Lecture03 Neuralnets
83 pages
02A-DL2023-NN-basics
No ratings yet
02A-DL2023-NN-basics
52 pages
05_optimization_basics
No ratings yet
05_optimization_basics
94 pages
Mlfa Autumn 23 Optimization
No ratings yet
Mlfa Autumn 23 Optimization
37 pages
ME360: Signal Processing: Liangjing Yang Assistant Professor, ZJU-UIUC Institute
No ratings yet
ME360: Signal Processing: Liangjing Yang Assistant Professor, ZJU-UIUC Institute
27 pages
2-Mathematical Optimization and Deep Learning
No ratings yet
2-Mathematical Optimization and Deep Learning
53 pages
CSE489: Machine Vision (Sheet 7) : Yehia Zakaria
No ratings yet
CSE489: Machine Vision (Sheet 7) : Yehia Zakaria
34 pages
DL Assigment Aryan Gupta UE218015
No ratings yet
DL Assigment Aryan Gupta UE218015
5 pages
第八章
No ratings yet
第八章
28 pages
Deep learning
No ratings yet
Deep learning
15 pages
Christopher Manning Lecture 3: Neural Net Learning: Gradients by Hand (Matrix Calculus) and Algorithmically (The Backpropagation Algorithm)
No ratings yet
Christopher Manning Lecture 3: Neural Net Learning: Gradients by Hand (Matrix Calculus) and Algorithmically (The Backpropagation Algorithm)
84 pages
Face Recognition Using Facenet
No ratings yet
Face Recognition Using Facenet
46 pages
CE6146_Lecture_2
No ratings yet
CE6146_Lecture_2
72 pages
Lecture 4 - Linear Classification
No ratings yet
Lecture 4 - Linear Classification
34 pages
dqn-atari
No ratings yet
dqn-atari
26 pages
MN906 AI Watermarking
No ratings yet
MN906 AI Watermarking
99 pages
LN - ieML SupportVectorMachines
No ratings yet
LN - ieML SupportVectorMachines
36 pages
23 DeepLearning PDF
No ratings yet
23 DeepLearning PDF
74 pages
18 DL Regularization
No ratings yet
18 DL Regularization
41 pages
Lecture04 Neuralnets
No ratings yet
Lecture04 Neuralnets
81 pages
Provable Non-Convex Optimization For ML: Prateek Jain Microsoft Research India
No ratings yet
Provable Non-Convex Optimization For ML: Prateek Jain Microsoft Research India
86 pages
2022 Linear Regression
No ratings yet
2022 Linear Regression
34 pages
Deep Feedforward Networks and Regularization: Licheng Zhang
No ratings yet
Deep Feedforward Networks and Regularization: Licheng Zhang
56 pages
Lec1 PerceptronPocket Recap
No ratings yet
Lec1 PerceptronPocket Recap
61 pages
lesson_09
No ratings yet
lesson_09
15 pages
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
Let's Practise: Maths Workbook Coursebook 7
From Everand
Let's Practise: Maths Workbook Coursebook 7
ExcelSoft Technologies Pvt. Ltd.
No ratings yet
Deep Learning Basics Lecture 3 Regularization I
No ratings yet
Deep Learning Basics Lecture 3 Regularization I
32 pages
Deep Learning Basics Lecture 2 Backpropagation
No ratings yet
Deep Learning Basics Lecture 2 Backpropagation
31 pages
OSRAM SFH 309 Datasheet
No ratings yet
OSRAM SFH 309 Datasheet
16 pages
Deep Learning Basics Lecture 11 Practical Methodology
No ratings yet
Deep Learning Basics Lecture 11 Practical Methodology
25 pages
Deep Learning Basics Lecture 6 Convolutional NN
No ratings yet
Deep Learning Basics Lecture 6 Convolutional NN
36 pages
Deep Learning Basics Lecture 4 Regularization II
No ratings yet
Deep Learning Basics Lecture 4 Regularization II
27 pages
Deep Learning Basics Lecture 8 Autoencoder & DBM
No ratings yet
Deep Learning Basics Lecture 8 Autoencoder & DBM
28 pages
ECE604 f20 hw1
No ratings yet
ECE604 f20 hw1
1 page
SFH 203 - en
No ratings yet
SFH 203 - en
15 pages
Lectures On Electromagnetic Theory - Weng Cho Chew
No ratings yet
Lectures On Electromagnetic Theory - Weng Cho Chew
591 pages
PYu-RC Group 51 RoHS L 12
No ratings yet
PYu-RC Group 51 RoHS L 12
10 pages
SFH 235 Fa - en
No ratings yet
SFH 235 Fa - en
15 pages
Código K-Means en Spyder
No ratings yet
Código K-Means en Spyder
3 pages
Clustering Methods
No ratings yet
Clustering Methods
29 pages
17 Ensemble Techniques Problem Statement
No ratings yet
17 Ensemble Techniques Problem Statement
28 pages
Machine Learning Updated
No ratings yet
Machine Learning Updated
14 pages
Ada Boost
No ratings yet
Ada Boost
2 pages
Flat Clustering & Hierarchical Clustering in I.R
No ratings yet
Flat Clustering & Hierarchical Clustering in I.R
13 pages
Unit II - 3 - Chapter 3 - MNIST Classification
No ratings yet
Unit II - 3 - Chapter 3 - MNIST Classification
13 pages
Spriiprad - Machine Learning Model Basics Intermediate
No ratings yet
Spriiprad - Machine Learning Model Basics Intermediate
2 pages
Droidfusion: A Novel Multilevel Classifier Fusion Approach For Android Malware Detection
No ratings yet
Droidfusion: A Novel Multilevel Classifier Fusion Approach For Android Malware Detection
14 pages
Lalit Suryavanshi - DSA-II - Assignment
No ratings yet
Lalit Suryavanshi - DSA-II - Assignment
36 pages
ML Lesson Plan (21AI63)
No ratings yet
ML Lesson Plan (21AI63)
8 pages
Imported CSV Data: Exercise 1
No ratings yet
Imported CSV Data: Exercise 1
17 pages
ML 1-10
No ratings yet
ML 1-10
53 pages
Materi 4 - Analisis Big Data
No ratings yet
Materi 4 - Analisis Big Data
30 pages
Text Document Classification Quiz: Q1. Classification Techniques Have Been Applied To
0% (3)
Text Document Classification Quiz: Q1. Classification Techniques Have Been Applied To
12 pages
Experiments With A New Boosting Algorithm: Yoav Freund Robert E. Schapire
No ratings yet
Experiments With A New Boosting Algorithm: Yoav Freund Robert E. Schapire
9 pages
How To Choose An Activation Function For Deep Learning
No ratings yet
How To Choose An Activation Function For Deep Learning
15 pages
Clustering
No ratings yet
Clustering
104 pages
K Means Clustering
No ratings yet
K Means Clustering
17 pages
D1-22683 Aam Tyan 2023-24 SMD
No ratings yet
D1-22683 Aam Tyan 2023-24 SMD
6 pages
DWM - Classification-Unit7
No ratings yet
DWM - Classification-Unit7
44 pages
Confusion Matrix
No ratings yet
Confusion Matrix
14 pages
945-Article Text-2920-1-10-20190802
No ratings yet
945-Article Text-2920-1-10-20190802
6 pages
Machine Learning 1707965934
No ratings yet
Machine Learning 1707965934
15 pages
Comparative Study On Spoken Language Identification Based On Deep Learning
No ratings yet
Comparative Study On Spoken Language Identification Based On Deep Learning
5 pages
DW&M Unit 3 Part I
No ratings yet
DW&M Unit 3 Part I
101 pages
Classification Demo
No ratings yet
Classification Demo
4 pages
AIML Lect5 Assignment ID3
No ratings yet
AIML Lect5 Assignment ID3
2 pages
Gradient Boosting in ML
No ratings yet
Gradient Boosting in ML
5 pages
Cs3491 - Aiml - Unit III - Maximum Margin Classifier
No ratings yet
Cs3491 - Aiml - Unit III - Maximum Margin Classifier
11 pages

Deep Learning Basics Lecture 1 Feedforward

Uploaded by

Deep Learning Basics Lecture 1 Feedforward

Uploaded by

Deep Learning Basics

• Collect data and extract features

Red Green Blue

Input 𝑥 Hidden variables ℎ1 ℎ2 ℎ𝐿

• Sometimes require some

• Typical activation function 𝑟

Figure borrowed from Pattern Recognition and Machine Learning, Bishop

Figure from Deep learning, by

You might also like