0 Introduction
0 Introduction
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, M. Nau, S. Jaganathan, C. Liu, N. Maul, L. Folle,
K. Packhäuser, M. Zinnen
Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg
April 15, 2024
Who are we? - Lab Members
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 1
Who are we? - Student Members
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 3
Outline
Motivation
Perceptron
Organizational Matters
Motivation
NVIDIA Stock Market
Source: https://www.google.com/finance/quote/
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 4
The Big Bang of Deep Learning
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 5
ImageNet Large Scale Visual Recognition Challenge
16.4
11.7 Residual Network
6.7
5.1 3.6 3 2.4
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 6
ImageNet Large Scale Visual Recognition Challenge
16.4
11.7 Residual Network
6.7
5.1 3.6 3 2.4
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 6
ImageNet Large Scale Visual Recognition Challenge
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 7
Deep Learning Users
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 8
Playing Go
Source: https://commons.wikimedia.org/wiki/File:FloorGoban.jpg
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 9
Playing Go
Source: https://commons.wikimedia.org/wiki/File:FloorGoban.jpg
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 9
Playing Go
Source: https://commons.wikimedia.org/wiki/File:FloorGoban.jpg
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 9
Playing Go
Source: https://commons.wikimedia.org/wiki/File:FloorGoban.jpg
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 9
Google DeepDream
Attempt to understand the inner workings of the network: What it "dreams" about
when presented with images
Idea:
• Arbitrary image or noise as input
• Instead of adjusting network
parameters, tweak image towards
high activations
• Different layers enhance different
features (low or high level)
Source: https://research.googleblog.com
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 10
Google DeepDream
Source: https://research.googleblog.com
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 11
Google DeepDream
Source: https://research.googleblog.com
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 11
Google DeepDream
Source: https://research.googleblog.com
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 12
Real-Time Object Detection: YOLO, YOLO9000, YOLOv3 [11]–[13]
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 13
Every Day Use
Siri
Source: www.apple.com/ios/siri/
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 14
Google Echo & Amazon Alexa Voice Service
Source: www.amazon.com
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 15
Google Translate
Source: translate.google.de
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction April 15, 2024 16
Introduction - Part 2
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, M. Nau, S. Jaganathan, C. Liu, N. Maul, L. Folle,
K. Packhäuser, M. Zinnen
Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg
April 15, 2024
Research at the Pattern Recognition Lab
Assisted and Automated Driving
Goal
Find new ways to train and update deep learning mechanisms in environments with
high safety requirements
Source: Audi AG
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 18
Assisted and Automated Driving
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 19
Assisted and Automated Driving
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 19
Smart Devices
Problem statement
Renewable energy power 6= energy demand
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 20
Smart Devices
Goal
Establish energy equilibrium by predicting energy consumption
• Example: Interrupt fridge cooling cycle when price is high, start washing
machine when price is low
• Dependencies between tasks, user information and action necessary (e.g.,
washer/dryer)
• Task: Identify time-shiftable loads and assess appropriate time frame
• Approach: Train recurrent neural networks to identify usage patterns and
dependencies between devices
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 21
Cloud Detection for Power Forecast [4]
Goal
Power forecast for solar power plants with a high temporal and spatial resolution
Approach
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 22
Cloud Detection for Power Forecast [4]
...
...
...
b
g
r
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 23
Writer Recognition
Goal
Writer identification with limited training data (few pages per writer)
..
.
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 24
Writer Recognition using CNN Activation Features [6]
Activation features
ut r1
Inp ye r2
La
Classification layer
ye K
La ye
r
Hidden layer
La
...
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 25
Medical Applications
Cell Classification for Tumor Diagnostics [3]
Goal
Identify cells undergoing mitosis to asses tumor proliferation and aggressiveness in
histological images
Challenge
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 26
Cell Classification for Tumor Diagnostics [3]
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 27
Cell Classification for Tumor Diagnostics [3]
Approach
Use spatial transformer networks (STNs) to learn affine transformation and
classification
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 28
Defect Pixel Interpolation
Goal
• Reconstruction of coronaries based on truncated X-ray images
• Create “virtual” digital subtraction angiography
Approach
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 29
Defect Pixel Interpolation
Processing pipeline
Subtraction
Segmentation
algorithm
Masking
Inpainting algorithm
Binary mask Virtual mask image
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 30
Defect Pixel Interpolation
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 31
Organ Search [7]
Goal
Approach
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 32
Organ Search [7]
Brainstem
missing!
109.7%
3 Aortic Arch 100%
Bifurcations
Sternum Tip
Interpolate
Right Kidney Left Kidney
Right Hip-bone 0%
Left Hip-bone
Extrapolate
Right Knee - 31.4%
missing!
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 33
Organ Search [7]
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 34
X-ray-transform Invariant Anatomical Landmark Detection
Goal
• Detect landmarks in X-ray images
• Knowing correspondences enables symbolic reconstruction
: Classic computervision reconstruction
Challenge
• Transmission imaging
: Overlap/superposition of structures
: High variance due to projection
: Artifacts e.g. interventional devices
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 35
X-ray-transform Invariant Anatomical Landmark Detection
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 36
X-ray-transform Invariant Anatomical Landmark Detection
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 37
Organ Prediction
Goal
Estimation of body and organ shapes based on patient’s height and weight for X-ray
exposure estimation.
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 38
Organ Prediction
Goal
Estimation of body and organ shapes based on patient’s height and weight for X-ray
exposure estimation.
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 39
Action Learning for 3D Point Cloud Based Organ Segmentation
• Robustness w.r.t.
1. Individual anatomy
2. Scan protocols
• Time constraints
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 40
Action Learning for 3D Point Cloud Based Organ Segmentation
• Reinforcement learning
• Predict the transformation at given state
Figure: Action prediction pipeline for 3D point cloud based organ segmentation
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 41
Action Learning for 3D Point Cloud Based Organ Segmentation
• Runtime:
1. 0.3 - 2.6s per volume
2. 50 - 100 speedup from U-net [5]
• Very accurate
• Robust to:
1. scan protocol
2. contrast agent
3. organ initialization
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 2 April 15, 2024 42
Introduction - Part 3
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, M. Nau, S. Jaganathan, C. Liu, N. Maul, L. Folle,
K. Packhäuser, M. Zinnen
Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg
April 15, 2024
Limitations
Image Captioning
Figure: "baseball player is Figure: "girl in pink dress is Figure: "man in black shirt is
throwing ball in game." jumping in air." playing guitar."
Source: http://cs.stanford.edu/people/karpathy/deepimagesent
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 44
Image Captioning & Generating
Source: https://twitter.com/maier_ak/status/1713563211574751494
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 45
Image Captioning & Generating
Source: https://twitter.com/maier_ak/status/1713563211574751494
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 46
Image Captioning & Generating
Figure: "pixel version 1." Figure: "pixel version 2." Figure: "pixel version 3."
Source: https://twitter.com/maier_ak/status/1713563211574751494
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 47
Image Captioning & Generating
To DALL-E3, "Show the scene in the style of a massive online roleplaying game"
Figure: "warcraft version 1." Figure: "warcraft version 2." Figure: "warcraft version 3."
Source: https://twitter.com/maier_ak/status/1713563211574751494
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 48
Image Captioning & Generating
To DALL-E3, "Make a few versions that show the scene in Lego Style"
Figure: "lego version 1." Figure: "lego version 2." Figure: "lego version 3."
Source: https://twitter.com/maier_ak/status/1713563211574751494
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 49
Challenges with Training Data
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 50
Challenges with Training Data
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 50
Generating Synthetic Data
Figure: four chest X-ray images sampled from a trained latent diffusion model. Image generation was done in a
conditional way to produce images of specific abnormality classes. The induced abnormality patterns in the
synthetic images are highlighted with red arrows and circles.
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 51
Memorization problem for diffusion models
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 52
Challenges with Trust and Reliability
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 53
Challenges with Trust and Reliability
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 53
Large Language Models for MRI Scanners
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 54
Large Language Models for MRI Scanners
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 54
Future Directions
Learning of Algorithms
• Computed Tomography
• Efficient solution via filtered back-projection:
Z π
f (x , y ) = p(s, θ) ∗ h(s)|s=x cos θ+y sin θ d θ
0
• Three steps:
• Convolution along s
• Back-projection along θ
• Suppress negative values
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 55
Reconstruction Networks
projection
reconstruction
sinogram
loss
func-
tion
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 56
Reconstruction Networks
reconstruction
sinogram
loss
func-
tion
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 57
Application to Incomplete Scans [18]
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 58
Application to Incomplete Scans [18]
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 59
Application to Incomplete Scans [18]
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 60
Application to Incomplete Scans [18]
0 1.0
100 0.8
normalized intensity
200 0.6
height [px]
300 0.4
400
0.2
reference fr
limited angle fl
our model fm
500
0.0
0 100 200 300 400 500 0 100 200 300 400 500
width [px] position [px]
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 61
Parker Weights
2.0
1.5
weight
1.0
0.5
0.0
0
π
4
ga
ntr
π
2
y rot
3π 0
ati
4 100
300 200
on
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 62
Parker Weights
2.0
1.5
weight
1.0
0.5
0.0
0
π
4
ga
ntr
π
2
y rot
3π 0
ati
4 100
300 200
on
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 62
Further Extensions
yVt−1
N + yVt N
λt (yVt N − yN N )
- -Σ
ρ01 ,t
k1 ,t k̄1 ,t
+
...
...
...
Σ
ρ0N ,t +
kN ,t k̄N ,t
Wcos Wcomp C B Ψ(·)
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 63
Further Extensions
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 3 April 15, 2024 64
Introduction - Part 4
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, M. Nau, S. Jaganathan, C. Liu, N. Maul, L. Folle,
K. Packhäuser, M. Zinnen
Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg
April 15, 2024
Machine Learning and Pattern Recognition
Terminology and Notation
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 66
Terminology and Notation
The notation and the terminology will be further developed throughout the lecture.
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 66
“Classical” Image Processing Pipeline
Classification phase
f feature c Ωκ
recording preprocessing classification
extraction
ω
Learning phase training
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 67
“Classical” Image Processing Pipeline
Classification phase
f feature c Ωκ
recording preprocessing classification
extraction
ω
Learning phase training
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 67
“Classical” Image Processing Pipeline
Classification phase
f feature c Ωκ
recording preprocessing classification
extraction
ω
Learning phase training
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 67
“Classical” Image Processing Pipeline: Apple vs. Pears
Source: https://commons.wikimedia.org
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 68
“Classical” Image Processing Pipeline: Apple vs. Pears
Source: https://commons.wikimedia.org
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 68
“Classical” Image Processing Pipeline: Apple vs. Pears
Source: https://commons.wikimedia.org
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 68
“Classical” Image Processing Pipeline: Apple vs. Pears
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 69
“Classical” Image Processing Pipeline: Apple vs. Pears
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 69
Pipeline in Deep Learning
Source: https://xkcd.com/1838/
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 70
Pipeline in Deep Learning
Reminder
feature
measurement preprocessing classification
extraction
training
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 71
Pipeline in Deep Learning
Now
representation
measurement learning
engine
training
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 71
Postulates for Pattern Recognition
6 Postulates:
ω = {1 f(x), . . . , N f(x)} ⊆ Ω.
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 72
Postulates for Pattern Recognition
6 Postulates:
ω = {1 f(x), . . . , N f(x)} ⊆ Ω.
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 72
Postulates for Pattern Recognition (cont.)
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 73
Postulates for Pattern Recognition (cont.)
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 74
Postulates for Pattern Recognition (cont.)
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 74
Postulates for Pattern Recognition (cont.)
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 74
Perceptron
Perceptron Biology - Neural Excitation (simplified)
Source: https://commons.wikimedia.org
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 75
Rosenblatt’s Perceptron
ŷ = sign(w| x), x2 w2
P
where .. .. Activation
. . function
w = (w0 , . . . , wn ): set of weights
(w0 =bias) xn wn
x = (1, x1 , . . . , xn ): input feature
vector inputs weights
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 76
Perceptron Objective Function
Task: Find weights that minimize the distance of misclassified samples to the
decision boundary
Assumptions
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 77
Perceptron Objective Function – Observations
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 78
Perceptron Training
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 4 April 15, 2024 79
Introduction - Part 5
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, M. Nau, S. Jaganathan, C. Liu, N. Maul, L. Folle,
K. Packhäuser, M. Zinnen
Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg
April 15, 2024
Organizational Matters
Grading
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 5 April 15, 2024 81
Exercise Content
• Python introduction
• Developing a neural network framework from scratch
• Feed Forward Neural Networks
• Convolutional Neural Networks
• Regularization
• Recurrent Networks
• Using the PyTorch framework
• Large scale classification
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 5 April 15, 2024 82
Exercise Requirements
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 5 April 15, 2024 83
How it works
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 5 April 15, 2024 84
Summary
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 5 April 15, 2024 85
Next Lecture Block
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 5 April 15, 2024 87
Comprehensive Questions
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 5 April 15, 2024 88
Further Reading
A. Maier, V. Christlein, K. Breininger, Z. Yang, L. Rist, A. Barnhill | Introduction - Part 5 April 15, 2024 89
Questions?
References
References I
[1] David Silver, Julian Schrittwieser, Karen Simonyan, et al. “Mastering the
game of go without human knowledge”. In: Nature 550.7676 (2017), p. 354.
[2] David Silver, Thomas Hubert, Julian Schrittwieser, et al. “Mastering Chess
and Shogi by Self-Play with a General Reinforcement Learning Algorithm”. In:
arXiv preprint arXiv:1712.01815 (2017).
[3] M. Aubreville, M. Krappmann, C. Bertram, et al. “A Guided Spatial
Transformer Network for Histology Cell Differentiation”. In: ArXiv e-prints (July
2017). arXiv: 1707.08525 [cs.CV].
[4] David Bernecker, Christian Riess, Elli Angelopoulou, et al. “Continuous
short-term irradiance forecasts using sky images”. In: Solar Energy 110
(2014), pp. 303–315.
References II
[5] Patrick Ferdinand Christ, Mohamed Ezzeldin A Elshaer, Florian Ettlinger, et al.
“Automatic liver and lesion segmentation in CT using cas-
caded fully convolutional neural networks and 3D conditional random fields”. In:
International Conference on Medical Image Computing and Computer-Assisted
Springer. 2016, pp. 415–423.
[6] Vincent Christlein, David Bernecker, Florian Hönig, et al. “Writer Identification
Using GMM Supervectors and Exemplar-SVMs”. In: Pattern Recognition 63
(2017), pp. 258–267.
[7] Florin Cristian Ghesu, Bogdan Georgescu, Tommaso Mansi, et al. “An
Artificial Agent for Anatomical Landmark Detection in Medical Images”. In:
Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016
Athens, 2016, pp. 229–237.
References III
[8] Jia Deng, Wei Dong, Richard Socher, et al. “Imagenet: A large-scale
hierarchical image database”. In:
Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference
IEEE. 2009, pp. 248–255.
[9] A. Karpathy and L. Fei-Fei. “Deep Visual-Semantic Alignments for Generating
Image Descriptions”. In: ArXiv e-prints (Dec. 2014). arXiv: 1412.2306
[cs.CV].
[10] Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. “ImageNet
Classification with Deep Convolutional Neural Networks”. In:
Advances in Neural Information Processing Systems 25. Curran Associates,
Inc., 2012, pp. 1097–1105.
References IV
[11] Joseph Redmon, Santosh Kumar Divvala, Ross B. Girshick, et al. “You Only
Look Once: Unified, Real-Time Object Detection”. In: CoRR abs/1506.02640
(2015).
[12] J. Redmon and A. Farhadi. “YOLO9000: Better, Faster, Stronger”. In:
ArXiv e-prints (Dec. 2016). arXiv: 1612.08242 [cs.CV].
[13] Joseph Redmon and Ali Farhadi. “YOLOv3: An Incremental Improvement”. In:
arXiv (2018).
[14] Frank Rosenblatt. The Perceptron–a perceiving and recognizing automaton.
85-460-1. Cornell Aeronautical Laboratory, 1957.
[15] Olga Russakovsky, Jia Deng, Hao Su, et al. “ImageNet Large Scale Visual
Recognition Challenge”. In: International Journal of Computer Vision 115.3
(2015), pp. 211–252.
References V
[16] David Silver, Aja Huang, Chris J. Maddison, et al. “Mastering the game of Go
with deep neural networks and tree search”. In: Nature 529.7587 (Jan. 2016),
pp. 484–489.
[17] S. E. Wei, V. Ramakrishna, T. Kanade, et al. “Convolutional Pose Machines”.
In: CVPR. 2016, pp. 4724–4732.
[18] Tobias Würfl, Florin C Ghesu, Vincent Christlein, et al. “Deep learning
computed tomography”. In:
International Conference on Medical Image Computing and Computer-Assisted
Springer International Publishing. 2016, pp. 432–440.