0% found this document useful (0 votes)

83 views

Attack (v8)

The document discusses machine learning classifiers and the need to make them robust against adversarial attacks from malicious inputs. It proposes different attack methods that can fool classifiers, such as Fast Gradient Sign Method, and distinguishes between white box attacks that use a model's parameters and black box attacks that do not require access to the model. The goal is to generate adversarial examples that are misclassified while remaining close to original inputs.

Uploaded by

applead

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

83 views

Attack (v8)

Uploaded by

applead

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

Attack and Defense

Hung-yi Lee

Source of image: http://www.fafa01.com/post865806

Motivation
• We seek to deploy machine learning classifiers not
only in the labs, but also in real world.
• The classifiers that are robust to noises and work
“most of the time” is not sufficient. 光是強還不夠
• We want the classifiers that are robust the inputs
that are built to fool the classifier. 應付來自人類的惡意
• Especially useful for spam classification, malware
detection, network intrusion detection, etc.
Attack

https://www.darksword-armory.com/wp-content/uploads/2014/09/two-handed-danish-sword-
medieval-weapon-1352-3.jpg
What do we want to do?
Original Image

Something Else
Network Tiger Cat
0.64
𝑥0

𝑥1 ∆𝑥1
𝑥2 ∆𝑥2 Attacked
+ Image
𝑥3 ∆𝑥3
⋮ ⋮ 𝑥 ′ = 𝑥 0 + ∆𝑥
Loss Function for Attack
e.g. cat
Network close 𝑦 𝑡𝑟𝑢𝑒
𝑥0 𝑦 0 = 𝑓𝜃 𝑥
𝑓𝜃
close far
? 𝑦 ′ = 𝑓𝜃 𝑥 ′ 𝑦 𝑓𝑎𝑙𝑠𝑒
𝑥′ close
e.g. fish
• Training: 𝐿𝑡𝑟𝑎𝑖𝑛 𝜃 = 𝐶 𝑦 0 , 𝑦 𝑡𝑟𝑢𝑒 𝑥 fixed
• Non-targeted Attack: 𝐿 𝑥 ′ = −𝐶 𝑦 ′ , 𝑦 𝑡𝑟𝑢𝑒 𝜃 fixed
• Targeted Attack:
𝐿 𝑥 ′ = −𝐶 𝑦 ′ , 𝑦 𝑡𝑟𝑢𝑒 + 𝐶 𝑦 ′ , 𝑦 𝑓𝑎𝑙𝑠𝑒
• Constraint: 𝑑 𝑥 0 , 𝑥 ′ ≤ 𝜀 不要被發現
𝑥1′ 𝑥1 ∆𝑥1
𝑥2′ − 𝑥2 ∆𝑥2
Constraint 𝑑 𝑥 0, 𝑥 ′ ≤ 𝜀 𝑥3′ 𝑥3 =
∆𝑥3
⋮ ⋮ ⋮
𝑥′ 𝑥0 ∆𝑥
• L2-norm
small L-∞
𝑑 𝑥0, 𝑥′ = 𝑥0 − 𝑥′ 2
Change
= ∆𝑥1 2 + ∆𝑥2 2 + ∆𝑥3 2⋯
every pixel a
little bit
• L-infinity
same L2
𝑑 𝑥 0 , 𝑥 ′ = 𝑥 0 − 𝑥′ ∞
Change one
= 𝑚𝑎𝑥 ∆𝑥1 , ∆𝑥2 , ∆𝑥3 , ⋯
pixel much
large L-∞
Just like training a neural network,
How to Attack but network parameter 𝜃 is
replaced with input 𝑥 ′

𝑥 ∗ = 𝑎𝑟𝑔 min
0 ′
𝐿 𝑥′
𝑑 𝑥 ,𝑥 ≤𝜀

• Gradient Descent (Modified Version)

Start from original image 𝑥 0

For t = 1 to T 𝜕𝐿 𝑥 Τ𝜕𝑥1
𝑥 𝑡 ← 𝑥 𝑡−1 − 𝜂∇𝐿 𝑥 𝑡−1 𝜕𝐿 𝑥 Τ𝜕𝑥2
∇𝐿 𝑥 =
If 𝑑 𝑥 0 , 𝑥 𝑡 > 𝜀 𝜕𝐿 𝑥 Τ𝜕𝑥3
⋮
𝑥 𝑡 ← 𝑓𝑖𝑥 𝑥 𝑡
Just like training a neural network,
How to Attack but network parameter 𝜃 is
replaced with input 𝑥 ′

𝑥 ∗ = 𝑎𝑟𝑔 min
0 ′
𝐿 𝑥′
𝑑 𝑥 ,𝑥 ≤𝜀

• Gradient Descent (Modified Version)

Start from original image 𝑥 0 def 𝑓𝑖𝑥 𝑥 𝑡

For t = 1 to T
𝑡 𝑡−1 𝑡−1
For all 𝑥 fulfill
𝑥 ←𝑥 − 𝜂∇𝐿 𝑥 𝑑 𝑥0, 𝑥 ≤ 𝜀
If 𝑑 𝑥 0 , 𝑥 𝑡 > 𝜀 Return the one
𝑥 𝑡 ← 𝑓𝑖𝑥 𝑥 𝑡 closest to 𝑥 𝑡
L2-norm 𝑥𝑡
How to Attack 𝑥𝑡

𝑥0

def 𝑓𝑖𝑥 𝑥 𝑡 𝜀

For all 𝑥 fulfill

𝑑 𝑥0, 𝑥 ≤ 𝜀 L-infinity 𝑥𝑡
Return the one
closest to 𝑥 𝑡 𝑥𝑡
𝜀

𝜀
𝐿 𝑥 ′ = −𝐶 𝑦 ′ , 𝑦 𝑡𝑟𝑢𝑒 + 𝐶 𝑦 ′ , 𝑦 𝑓𝑎𝑙𝑠𝑒
Example True = Tiger cat
𝑓= ResNet-50
False = Star Fish

Original Image Attacked Image

Tiger Cat Star Fish

0.64 1.00
Example
=
Original Image Attacked Image

50x -

Tiger Cat Star Fish

0.64 1.00
𝐿 𝑥 ′ = −𝐶 𝑦 ′ , 𝑦 𝑡𝑟𝑢𝑒 + 𝐶 𝑦 ′ , 𝑦 𝑓𝑎𝑙𝑠𝑒
Example True = Tiger cat
𝑓= ResNet-50
False = Keyboard

Original Image Attacked Image

Tiger Cat Keyboard

0.64 0.98
tiger Persian
cat cat

tabby fire
cat screen
What happened?
𝑦𝐸𝑔𝑦𝑝𝑡𝑖𝑎𝑛 𝑐𝑎𝑡 𝑦𝑡𝑖𝑔𝑒𝑟 𝑐𝑎𝑡 𝑦𝑃𝑒𝑟𝑠𝑖𝑎𝑛 𝑐𝑎𝑡

Random
𝑥0
𝑦𝑡𝑖𝑔𝑒𝑟 𝑐𝑎𝑡 𝑦𝑘𝑒𝑦 𝑏𝑜𝑎𝑟𝑑

Specific Direction
𝑥0
Attack Approaches
• FGSM (https://arxiv.org/abs/1412.6572)
• Basic iterative method (https://arxiv.org/abs/1607.02533)
• L-BFGS (https://arxiv.org/abs/1312.6199)
• Deepfool (https://arxiv.org/abs/1511.04599)
• JSMA (https://arxiv.org/abs/1511.07528)
• C&W (https://arxiv.org/abs/1608.04644)
• Elastic net attack (https://arxiv.org/abs/1709.04114)
• Spatially Transformed (https://arxiv.org/abs/1801.02612)
• One Pixel Attack (https://arxiv.org/abs/1710.08864)
• …… only list a few
https://sites.google.com/site/pkms20152a17/_/rsrc/1448428701742/home/125986076.jpg

Attack Approaches
Different optimization methods
𝑥 ∗ = 𝑎𝑟𝑔 min
0 ′
𝐿 𝑥′
𝑑 𝑥 ,𝑥 ≤𝜀
Different constraints
• Fast Gradient Sign Method (FGSM)

𝑥 ∗ ← 𝑥 0 − 𝜀∆𝑥
𝑠𝑖𝑔𝑛 𝜕𝐿Τ𝜕𝑥1
𝑠𝑖𝑔𝑛 𝜕𝐿Τ𝜕𝑥2
∆𝑥 =
𝑠𝑖𝑔𝑛 𝜕𝐿Τ𝜕𝑥3
⋮
only have +1 or -1
https://sites.google.com/site/pkms20152a17/_/rsrc/1448428701742/home/125986076.jpg

Attack Approaches
Different optimization methods
𝑥 ∗ = 𝑎𝑟𝑔 𝑚𝑖𝑛
0 ′
𝐿 𝑥′
𝑑 𝑥 ,𝑥 ≤𝜀
Different constraints
• Fast Gradient Sign Method (FGSM)
𝑥∗
𝑥 ∗ ← 𝑥 0 − 𝜀∆𝑥
𝑥1
𝑠𝑖𝑔𝑛 𝜕𝐿Τ𝜕𝑥1 𝜀 𝑥0
𝑠𝑖𝑔𝑛 𝜕𝐿Τ𝜕𝑥2
∆𝑥 = gradient
𝑠𝑖𝑔𝑛 𝜕𝐿Τ𝜕𝑥3 𝜀
⋮
only have +1 or -1 L-infinity
https://sites.google.com/site/pkms20152a17/_/rsrc/1448428701742/home/125986076.jpg

Attack Approaches
Different optimization methods
∗
𝑥 = 𝑎𝑟𝑔 𝑚𝑖𝑛 𝐿 𝑥 ′ 𝑥1
0 ′
𝑑 𝑥 ,𝑥 ≤𝜀
Different constraints
very large
• Fast Gradient Sign Method (FGSM)
learning rate 𝑥∗
𝑥 ∗ ← 𝑥 0 − 𝜀∆𝑥
𝑠𝑖𝑔𝑛 𝜕𝐿Τ𝜕𝑥1 𝜀 𝑥0
𝑠𝑖𝑔𝑛 𝜕𝐿Τ𝜕𝑥2
∆𝑥 = gradient
𝑠𝑖𝑔𝑛 𝜕𝐿Τ𝜕𝑥3 𝜀
⋮
only have +1 or -1 L-infinity
White Box v.s. Black Box
• In the previous attack, we fix network
parameters 𝜃 to find optimal 𝑥 ′ .
• To attack, we need to know network
parameters 𝜃
• This is called White Box Attack.
• Are we safe if we do not release model? ☺
• You cannot obtain model parameters in
most on-line API.
• No, because Black Box Attack is possible. 
Black Box Attack
If you have the training data of the target network
Train a proxy network yourself
Using the proxy network to generate attacked objects
Otherwise, obtaining input-output pairs from target network
Attacked
Object
Network Network
Black Proxy

Training Data
Black Box Attack
If you have the training data of the target network
Train a proxy network yourself
Using the proxy network to generate attacked objects
Otherwise, obtaining input-output pairs from target network

Black

Proxy

https://arxiv.org/pdf/1611.02770.pdf
Universal
Adversarial
Attack

https://arxiv.org/abs/1610.08401

Black Box Attack is also possible!

Adversarial Reprogramming
• https://arxiv.org/abs/1806.11146

Gamaleldin F. Elsayed, Ian Goodfellow, Jascha Sohl-Dickstein, “Adversarial

Reprogramming of Neural Networks”, ICLR, 2019
Attack in the Real World
Black Box Attack

https://www.youtube.com/watch?v=zQ_uMenoBCk&feature=youtu.be
https://www.cs.cmu.edu/~sbhagava/pape
rs/face-rec-ccs16.pdf

Attack in the Real World

1. An attacker would need to find perturbations that
generalize beyond a single image.
2. Extreme differences between adjacent pixels in the
perturbation are unlikely to be accurately captured by cameras.
3. It is desirable to craft perturbations that are comprised
mostly of colors reproducible by the printer.
https://arxiv.org/ab
s/1707.08945
Beyond Images
• You can attack audio
• https://nicholas.carlini.com/code/audio_adversarial_examples/
• https://adversarial-attacks.net

• You can attack text

https://arxiv.org/pdf/17
07.07328.pdf
Defense

http://3png.com/a-27051273.html
Defense
• Adversarial Attack cannot be defended by weight
regularization, dropout and model ensemble.
• Two types of defense:
• Passive defense: Finding the attached image
without modifying the model
• Special case of Anomaly Detection
• Proactive defense: Training a model that is
robust to adversarial attack
Passive Defense
Do not influence
Original classification

Tiger Cat
Keyboard
+ Filter + Network

e.g.
Smoothing

Attack signal Less harmful

Smoothing

tiger cat tiger cat

0.64 0.45

Smoothing

Keyboard tiger cat

0.98 0.37
Passive Defense
• Feature Squeeze

https://arxiv.org/abs/1704.01155
Randomization at Inference Phase

https://arxiv.org/abs/1711.01991
精神：
Proactive Defense 找出漏洞、補起來

Given training data X = 𝑥 1 , 𝑦ො 1 , 𝑥 2 , 𝑦ො 2 , ⋯ , 𝑥 𝑁 , 𝑦ො 𝑦

Using X to train your model
This method would stop
For t = 1 to T algorithm A, but is still
For n = 1 to N 找出漏洞 vulnerable for algorithm B.

Find adversarial input 𝑥෤ 𝑛 given 𝑥 𝑛

Using algorithm A
by an attack algorithm
We have new training data different in each iteration
X′ = 𝑥෤ 1 , 𝑦ො 1 , 𝑥෤ 2 , 𝑦ො 2 , ⋯ , 𝑥෤ 𝑁 , 𝑦ො 𝑦 Data Augmentation

Using both X′ to update your model 把洞補起來

Concluding Remarks
• Attack: given the network parameters, attack is
very easy.
• Even black box attack is possible
• Defense: Passive & Proactive
• Future: Adaptive Attack / Defense

https://www.gotrip.hk/179304/weekend_lifestyle/pokemon-
go_%E7%B2%BE%E9%9D%88%E9%80%B2%E5%8C%96/
To learn more …
• Reference
• https://adversarial-ml-tutorial.org/ (Zico Kolter and
Aleksander Madry)
• Adversarial Attack Toolbox:
• https://github.com/bethgelab/foolbox
• https://github.com/IBM/adversarial-robustness-toolbox
• https://github.com/tensorflow/cleverhans

unit 4
No ratings yet
unit 4
12 pages
2019 Can You Really Backdoor Federated Learning
No ratings yet
2019 Can You Really Backdoor Federated Learning
10 pages
2308.07673v1
No ratings yet
2308.07673v1
37 pages
Lec1&2 Final
No ratings yet
Lec1&2 Final
37 pages
Attacks Against Machine Learning - Evasion
No ratings yet
Attacks Against Machine Learning - Evasion
45 pages
2019 Adversarial Examples in Modern Machine Learning - A Review
No ratings yet
2019 Adversarial Examples in Modern Machine Learning - A Review
97 pages
w11 ML Security
No ratings yet
w11 ML Security
35 pages
Poisoning Attacks Against Machine Learning Can Machine Learning Be Trustworthy
No ratings yet
Poisoning Attacks Against Machine Learning Can Machine Learning Be Trustworthy
6 pages
Adversarial Attacks On LLMs - Lil'Log
No ratings yet
Adversarial Attacks On LLMs - Lil'Log
30 pages
Adversarial Attacks and Defenses in Machine Learning-Powered Networks: A Contemporary Survey
No ratings yet
Adversarial Attacks and Defenses in Machine Learning-Powered Networks: A Contemporary Survey
46 pages
paper1
No ratings yet
paper1
16 pages
[email protected]
No ratings yet
[email protected]
4 pages
2312.03520v1
No ratings yet
2312.03520v1
9 pages
Face Recognition Attack
No ratings yet
Face Recognition Attack
6 pages
Dataset Security For Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses
No ratings yet
Dataset Security For Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses
37 pages
Security_Engineering_for_Machine_Learning
No ratings yet
Security_Engineering_for_Machine_Learning
4 pages
1 s2.0 S209580991930503X Main
No ratings yet
1 s2.0 S209580991930503X Main
15 pages
1809.08758v2
No ratings yet
1809.08758v2
11 pages
D - B A A: R A A B - B M L M: Ecision Ased Dversarial Ttacks Eliable Ttacks Gainst Lack OX Achine Earning Odels
No ratings yet
D - B A A: R A A B - B M L M: Ecision Ased Dversarial Ttacks Eliable Ttacks Gainst Lack OX Achine Earning Odels
12 pages
NeurIPS-2023-content-based-unrestricted-adversarial-attack-Paper-Conference
No ratings yet
NeurIPS-2023-content-based-unrestricted-adversarial-attack-Paper-Conference
15 pages
IET Image Processing - 2021 - Chen - Boundary augment A data augment method to defend poison attack
No ratings yet
IET Image Processing - 2021 - Chen - Boundary augment A data augment method to defend poison attack
12 pages
Lagrangian Objective Function Leads To Improved Un
No ratings yet
Lagrangian Objective Function Leads To Improved Un
13 pages
Data Security Tutorial 12 - Solutions
No ratings yet
Data Security Tutorial 12 - Solutions
4 pages
2503.20925v1
No ratings yet
2503.20925v1
25 pages
Untitled Presentation
No ratings yet
Untitled Presentation
9 pages
A New Ensemble Adversarial Attack Powered by Long
No ratings yet
A New Ensemble Adversarial Attack Powered by Long
10 pages
SBA - Fault Injection Attack On Deep Neural Network
No ratings yet
SBA - Fault Injection Attack On Deep Neural Network
23 pages
Label-Only Membership Inference Attacks
No ratings yet
Label-Only Membership Inference Attacks
11 pages
Adv-Bot Realistic Adversarial Botnet Attacks Against Network Intrusion Detection Systems
No ratings yet
Adv-Bot Realistic Adversarial Botnet Attacks Against Network Intrusion Detection Systems
18 pages
Query-Efficient Black-Box Attack Against Sequence-Based Malware Classifiers
No ratings yet
Query-Efficient Black-Box Attack Against Sequence-Based Malware Classifiers
29 pages
17 Attacks
No ratings yet
17 Attacks
12 pages
Adversarial Machine Learning
No ratings yet
Adversarial Machine Learning
39 pages
Book - A State of The Art Review On Adversarial Machine Learning
No ratings yet
Book - A State of The Art Review On Adversarial Machine Learning
66 pages
2021-Get a Model! Model Hijacking Attack Against Machine Learning Models
No ratings yet
2021-Get a Model! Model Hijacking Attack Against Machine Learning Models
18 pages
Towards Deep Learning Models Resistant To Adversarial Attacks
No ratings yet
Towards Deep Learning Models Resistant To Adversarial Attacks
28 pages
Adversarial Attacks and Defenses in Deep Learning
No ratings yet
Adversarial Attacks and Defenses in Deep Learning
15 pages
Defense Mechanism Against Adversarial Attacks Using Density-Based Representation of Images
No ratings yet
Defense Mechanism Against Adversarial Attacks Using Density-Based Representation of Images
6 pages
My Project
No ratings yet
My Project
30 pages
Crypto_with_Machine_Learning
No ratings yet
Crypto_with_Machine_Learning
30 pages
Machine Learning Security and Privacy A Review of
No ratings yet
Machine Learning Security and Privacy A Review of
24 pages
Han Xiao 2012 Ecai
No ratings yet
Han Xiao 2012 Ecai
7 pages
Certified defenses for Adversarial patches
No ratings yet
Certified defenses for Adversarial patches
17 pages
Stealing Machine Learning Models: Attacks and Countermeasures For Generative Adversarial Networks
No ratings yet
Stealing Machine Learning Models: Attacks and Countermeasures For Generative Adversarial Networks
16 pages
Sec16 Paper Tramer
No ratings yet
Sec16 Paper Tramer
19 pages
Planting Undetectable Backdoors
No ratings yet
Planting Undetectable Backdoors
53 pages
Rakin_Bit-Flip_Attack_Crushing_Neural_Network_With_Progressive_Bit_Search_ICCV_2019_paper
No ratings yet
Rakin_Bit-Flip_Attack_Crushing_Neural_Network_With_Progressive_Bit_Search_ICCV_2019_paper
10 pages
Secure Machine Learning Against Adversarial Samples at Test Time
No ratings yet
Secure Machine Learning Against Adversarial Samples at Test Time
15 pages
(Chapman & Hall - CRC Machine Learning & Pattern Recognition) Anita C. Faul - A Concise Introduction To Machine Learning-CRC Press (2020)
No ratings yet
(Chapman & Hall - CRC Machine Learning & Pattern Recognition) Anita C. Faul - A Concise Introduction To Machine Learning-CRC Press (2020)
45 pages
Diffdefense: Defending Against Adversarial Attacks Via Diffusion Models
No ratings yet
Diffdefense: Defending Against Adversarial Attacks Via Diffusion Models
12 pages
Adversarial Examples Are Misaligned in Diffusion Model Manifolds
No ratings yet
Adversarial Examples Are Misaligned in Diffusion Model Manifolds
23 pages
2309.01838v2
No ratings yet
2309.01838v2
8 pages
DSCAE a denoising sparse convolutional autoencoder defense against adversarial examples
No ratings yet
DSCAE a denoising sparse convolutional autoencoder defense against adversarial examples
11 pages
Defending Adversarials
No ratings yet
Defending Adversarials
18 pages
Detecting - Conventional - and - Adversarial - Attacks - Using - Deep - Learning - Techniques - A - Systematic - Review
No ratings yet
Detecting - Conventional - and - Adversarial - Attacks - Using - Deep - Learning - Techniques - A - Systematic - Review
7 pages
3595292
No ratings yet
3595292
41 pages
Untargeted, Targeted and Universal Adversarial Attacks and Defenses On Time Series
No ratings yet
Untargeted, Targeted and Universal Adversarial Attacks and Defenses On Time Series
8 pages
Machine Learning Attacks
No ratings yet
Machine Learning Attacks
27 pages
6943 Certified Defenses For Data Poisoning Attacks
No ratings yet
6943 Certified Defenses For Data Poisoning Attacks
13 pages
Can We Trust The Unlabeled Target Data Towards Backdoor Attack and Defense On Model Adaptation
No ratings yet
Can We Trust The Unlabeled Target Data Towards Backdoor Attack and Defense On Model Adaptation
11 pages
Random Spiking and Systematic Evaluation of Defenses Against Adversarial Examples
No ratings yet
Random Spiking and Systematic Evaluation of Defenses Against Adversarial Examples
12 pages
Generalized Fermat Equation
From Everand
Generalized Fermat Equation
Ran Van Vo
No ratings yet
Graph Coloring
No ratings yet
Graph Coloring
13 pages
Automata What Is It?: Lar - Expressions - HTM
No ratings yet
Automata What Is It?: Lar - Expressions - HTM
21 pages
06 Kleene Theorem
No ratings yet
06 Kleene Theorem
3 pages
C2 W2 SoftMax
No ratings yet
C2 W2 SoftMax
7 pages
Faculty of Engineering
No ratings yet
Faculty of Engineering
2 pages
IntSys Lec 05 Inference Rules DR - Mina
No ratings yet
IntSys Lec 05 Inference Rules DR - Mina
20 pages
Math130 Ass#6 Mercado
No ratings yet
Math130 Ass#6 Mercado
11 pages
Introduction To Management Science
No ratings yet
Introduction To Management Science
46 pages
Merge Sort Algorithm
No ratings yet
Merge Sort Algorithm
5 pages
CD 5
No ratings yet
CD 5
146 pages
1D Array Introduction, Insertion, Deletion
No ratings yet
1D Array Introduction, Insertion, Deletion
11 pages
GA Using Matlab
No ratings yet
GA Using Matlab
33 pages
DataStruc Prelim1
No ratings yet
DataStruc Prelim1
25 pages
FL&T Unit 3 - 1 - 1724732026415
No ratings yet
FL&T Unit 3 - 1 - 1724732026415
17 pages
Noc16-Cs24 Week 01 Assignment 01
No ratings yet
Noc16-Cs24 Week 01 Assignment 01
3 pages
Predicate Calculus
No ratings yet
Predicate Calculus
8 pages
Sys Verilog
No ratings yet
Sys Verilog
115 pages
Report Phase1111
No ratings yet
Report Phase1111
8 pages
Graph Theory For B.Sc. CSIT: Prajwal Kansakar
No ratings yet
Graph Theory For B.Sc. CSIT: Prajwal Kansakar
49 pages
Machine Learning, Modeling, and Simulation Principles Schedule
No ratings yet
Machine Learning, Modeling, and Simulation Principles Schedule
3 pages
CS310: Automata Theory 2019: Lecture 11: Applications of Pumping Lemma
No ratings yet
CS310: Automata Theory 2019: Lecture 11: Applications of Pumping Lemma
13 pages
Fundamentals of Computer Algorithms: Single-Source Shortest Paths
No ratings yet
Fundamentals of Computer Algorithms: Single-Source Shortest Paths
21 pages
Phan-Tich-Va-Thiet-Ke-Giai-Thuat - Duong-Tuan-Anh - All - Exercises - (Cuuduongthancong - Com)
No ratings yet
Phan-Tich-Va-Thiet-Ke-Giai-Thuat - Duong-Tuan-Anh - All - Exercises - (Cuuduongthancong - Com)
18 pages
Post Quantum Cryptography and Crypto Analysis
No ratings yet
Post Quantum Cryptography and Crypto Analysis
25 pages
Hash I Mbyllur Kuadratik
No ratings yet
Hash I Mbyllur Kuadratik
5 pages
Math 2222 Assignment - 20 - Series (DKP and MMA)
No ratings yet
Math 2222 Assignment - 20 - Series (DKP and MMA)
2 pages
Hostel Management System
No ratings yet
Hostel Management System
202 pages
Unit 2 Module 1 Notes - Linear Programming
No ratings yet
Unit 2 Module 1 Notes - Linear Programming
17 pages
Tabla de Polinomios Generadores PDF
No ratings yet
Tabla de Polinomios Generadores PDF
9 pages

Attack (v8)

Uploaded by

Attack (v8)

Uploaded by

Attack and Defense

Source of image: http://www.fafa01.com/post865806

• Gradient Descent (Modified Version)

Start from original image 𝑥 0

• Gradient Descent (Modified Version)

Start from original image 𝑥 0 def 𝑓𝑖𝑥 𝑥 𝑡

For all 𝑥 fulfill

Original Image Attacked Image

Tiger Cat Star Fish

Tiger Cat Star Fish

Original Image Attacked Image

Tiger Cat Keyboard

Black Box Attack is also possible!

Gamaleldin F. Elsayed, Ian Goodfellow, Jascha Sohl-Dickstein, “Adversarial

Attack in the Real World

• You can attack text

Attack signal Less harmful

tiger cat tiger cat

Keyboard tiger cat

Given training data X = 𝑥 1 , 𝑦ො 1 , 𝑥 2 , 𝑦ො 2 , ⋯ , 𝑥 𝑁 , 𝑦ො 𝑦

Find adversarial input 𝑥෤ 𝑛 given 𝑥 𝑛

Using both X′ to update your model 把洞補起來

You might also like