0% found this document useful (0 votes)

39 views

Slides Security and Privacy in Machine Learning

Security & privacy in machine learning

Uploaded by

Techohen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views

Slides Security and Privacy in Machine Learning

Security & privacy in machine learning

Uploaded by

Techohen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 59

Security and Privacy

in Machine Learning
Nicolas Papernot
Pennsylvania State University & Google Brain

Lecture for Prof. Trent Jaeger’s CSE 543 Computer Security Class

November 2017 - Penn State

Thank you to my collaborators

Patrick McDaniel
(Penn State)
Martín Abadi (Google Brain) Alexey Kurakin (Google Brain)
Pieter Abbeel (Berkeley) Praveen Manoharan (CISPA)
Michael Backes (CISPA) Ilya Mironov (Google Brain)
Dan Boneh (Stanford) Ananth Raghunathan (Google Brain)
Z. Berkay Celik (Penn State) Arunesh Sinha (U of Michigan)
Yan Duan (OpenAI) Shuang Song (UCSD)
Úlfar Erlingsson (Google Brain) Ananthram Swami (US ARL)
Matt Fredrikson (CMU) Kunal Talwar (Google Brain)
Ian Goodfellow Kathrin Grosse (CISPA) Florian Tramèr (Stanford)
(Google Brain) Sandy Huang (Berkeley) Michael Wellman (U of Michigan)
Somesh Jha (U of Wisconsin) Xi Wu (Google) 2
Machine Learning [0.01, 0.84, 0.02, 0.01, 0.01, 0.01, 0.05, 0.01, 0.03, 0.01]
Classifier

x f(x,θ) [p(0|x,θ), p(1|x,θ), p(2|x,θ), …, p(7|x,θ), p(8|x,θ), p(9|x,θ)]

Classifier: map inputs to one class among a predefined set

3
[0 1 0 0 0 0 0 0 0 0]
[0 1 0 0 0 0 0 0 0 0]
[1 0 0 0 0 0 0 0 0 0]
[0 0 0 0 0 0 0 1 0 0]
Machine Learning [0 0 0 0 0 0 0 0 0 1]
[0 0 0 1 0 0 0 0 0 0]
Classifier [0 0 0 0 0 0 0 0 1 0]
[0 0 0 0 0 0 1 0 0 0]
[0 1 0 0 0 0 0 0 0 0]
[0 0 0 0 1 0 0 0 0 0]

Learning: find internal classifier parameters θ that minimize a

cost/loss function (~model error)

4
Outline of this lecture

1 Security in ML

2 Privacy in ML

5
Part I

Security in machine learning

6
Attack Models

Attacker may see the model: bad even if an attacker needs to know details of the machine
learning model to do an attack --- aka a white-box attacker
ML

Attacker may not need the model: worse if attacker who knows very little (e.g. only gets to
ask a few questions) can do an attack --- aka a black-box attacker
ML

7
Papernot et al. Towards the Science of Security and Privacy in Machine Learning
Attack Models

Attacker may see the model: bad even if an attacker needs to know details of the machine
learning model to do an attack --- aka a white-box attacker
ML

Attacker may not need the model: worse if attacker who knows very little (e.g. only gets to
ask a few questions) can do an attack --- aka a black-box attacker
ML

8
Papernot et al. Towards the Science of Security and Privacy in Machine Learning
Adversarial
examples
(white-box
attacks)

9
Jacobian-based Saliency Map Approach (JSMA)

10
Papernot et al. The Limitations of Deep Learning in Adversarial Settings
Jacobian-Based Iterative Approach: source-target misclassification

11
Papernot et al. The Limitations of Deep Learning in Adversarial Settings
Evading a Neural Network Malware Classifier

DREBIN dataset of Android applications

P[X=Malware] = 0.90
Add constraints to JSMA approach: P[X=Benign] = 0.10
- only add features: keep malware behavior
- only features from manifest: easy to modify
P[X*=Malware] = 0.10
P[X*=Benign] = 0.90

“Most accurate” neural network

- 98% accuracy, with 9.7% FP and 1.3% FN
- Evaded with a 63.08% success rate

12
Grosse et al. Adversarial Perturbations Against Deep Neural Networks for Malware Classification
Supervised vs. reinforcement learning

Supervised learning Reinforcement learning

Observation
Model inputs Environment & Reward function
(e.g., traffic sign, music, email)

Class
Model outputs (e.g., stop/yield, jazz/classical, Action
spam/legitimate)

Maximize reward
Training “goal” Minimize class prediction error
by exploring the environment and
(i.e., cost/loss) over pairs of (inputs, outputs)
taking actions

Example

13
Adversarial attacks on neural network policies

14
Huang et al. Adversarial Attacks on Neural Network Policies
Adversarial
examples
(black-box
attacks)

15
Threat model of a black-box attack

Training data
Adversarial capabilities Model architecture
Model parameters
Model scores
(limited) oracle
access: labels

Adversarial goal Force a ML model remotely accessible through an API to misclassify

Example

16
Our approach to black-box attacks

Alleviate lack of knowledge Alleviate lack of

about model training data

17
Adversarial example transferability
Adversarial examples have a transferability property:

samples crafted to mislead a model A are likely to mislead a model B

These property comes in several variants:

ML A
● Intra-technique transferability:
○ Cross model transferability
○ Cross training set transferability
● Cross-technique transferability

18
Szegedy et al. Intriguing properties of neural networks
Adversarial example transferability
Adversarial examples have a transferability property:

samples crafted to mislead a model A are likely to mislead a model B

These property comes in several variants:

ML A
● Intra-technique transferability:
○ Cross model transferability
○ Cross training set transferability
● Cross-technique transferabilityML B
Victim

19
Szegedy et al. Intriguing properties of neural networks
Adversarial example transferability
Adversarial examples have a transferability property:

samples crafted to mislead a model A are likely to mislead a model B

20
Cross-technique transferability

21
Papernot et al. Transferability in Machine Learning: from Phenomena to Black-Box Attacks using Adversarial Samples
Cross-technique transferability

22
Papernot et al. Transferability in Machine Learning: from Phenomena to Black-Box Attacks using Adversarial Samples
Our approach to black-box attacks

Alleviate lack of knowledge Alleviate lack of

about model training data

Adversarial example
transferability from a
substitute model to
target model

23
Attacking remotely hosted black-box models

Remote
ML sys

“no truck sign”

“STOP sign”
“STOP sign”

(1) The adversary queries remote ML system for labels on inputs of its choice.

24
Attacking remotely hosted black-box models

Local Remote
substitute ML sys

“no truck sign”

“STOP sign”
“STOP sign”

(2) The adversary uses this labeled data to train a local substitute for the remote system.

25
Attacking remotely hosted black-box models

Local Remote
substitute ML sys

“no truck sign”

“STOP sign”

(3) The adversary selects new synthetic inputs for queries to the remote ML system based on the local
substitute’s output surface sensitivity to input variations. 26
Attacking remotely hosted black-box models

Local Remote “yield sign”

substitute ML sys

(4) The adversary then uses the local substitute to craft adversarial examples, which are
misclassified by the remote ML system because of transferability.
27
Our approach to black-box attacks

Alleviate lack of knowledge Alleviate lack of

about model training data

+
Adversarial example
transferability from a Synthetic data
substitute model to generation
target model

28
Results on real-world remote systems

Adversarial examples
Remote Platform ML technique Number of queries misclassified
(after querying)

Deep Learning 6,400 84.24%

Logistic Regression 800 96.19%

Unknown 2,000 97.72%

All remote classifiers are trained on the MNIST dataset (10 classes, 60,000 training samples)
29
[PMG16a] Papernot et al. Practical Black-Box Attacks against Deep Learning Systems using Adversarial Examples
Benchmarking
progress in the
adversarial ML
community

30
31
Growing community

1.3K+ stars
340+ forks
40+ contributors

32
Adversarial examples represent
worst-case distribution drifts

33
[DDS04] Dalvi et al. Adversarial Classification (KDD)
Adversarial examples are a tangible
instance of hypothetical AI safety problems

34
Image source: http://www.nerdist.com/wp-content/uploads/2013/07/Space-Odyssey-4.jpg
Part II

Privacy in machine learning

35
Types of adversaries and our threat model

Black-box
Model querying (black-box adversary)
ML ?
Shokri et al. (2016) Membership Inference Attacks against ML Models
Fredrikson et al. (2015) Model Inversion Attacks

Model inspection (white-box adversary)

Zhang et al. (2017) Understanding DL requires rethinking generalization

In our work, the threat model assumes:

- Adversary can make a potentially unbounded number of queries
- Adversary has access to model internals
36
A definition of privacy

Answer 1
Randomized Answer 2

}
Algorithm ...
Answer n
???
?

Answer 1
Randomized Answer 2
Algorithm ...
Answer n

37
Our design goals

Problem Preserve privacy of training data when learning classifiers

Differential privacy protection guarantees

Goals Intuitive privacy protection guarantees

Generic* (independent of learning algorithm)

*This is a key distinction from previous work, such as

Pathak et al. (2011) Privacy preserving probabilistic inference with hidden markov models
Jagannathan et al. (2013) A semi-supervised learning approach to differential privacy
Shokri et al. (2015) Privacy-preserving Deep Learning
Abadi et al. (2016) Deep Learning with Differential Privacy
Hamm et al. (2016) Learning privately from multiparty data 38
The PATE approach

39
Teacher ensemble

Partition 1 Teacher 1

Partition 2 Teacher 2

Sensitive
Data Partition 3 Teacher 3

... ...

Partition n Teacher n

Training Data flow

40
Aggregation

Count votes Take maximum

41
Intuitive privacy analysis

If most teachers agree on the label, it does not depend on

specific partitions, so the privacy cost is small.

If two classes have close vote counts, the disagreement

may reveal private information.

42
Noisy aggregation

Count votes Add Laplacian noise Take maximum

43
Teacher ensemble

Partition 1 Teacher 1

Partition 2 Teacher 2

Sensitive Aggregated
Data Partition 3 Teacher 3
Teacher

... ...

Partition n Teacher n

Training Data flow

44
Student training
Not available to the adversary Available to the adversary

Partition 1 Teacher 1

Partition 2 Teacher 2

Sensitive Aggregated
Data Partition 3 Teacher 3 Student Queries
Teacher

... ...

Partition n Teacher n
Public
Data

Training Inference Data flow

45
Why train an additional “student” model?

The aggregated teacher violates our threat model:

1 Each prediction increases total privacy loss.

Privacy budgets create a tension between the accuracy and number of predictions.

2 Inspection of internals may reveal private data.

Privacy guarantees should hold in the face of white-box adversaries.

46
Student training
Not available to the adversary Available to the adversary

Partition 1 Teacher 1

Partition 2 Teacher 2

Sensitive Aggregated
Data Partition 3 Teacher 3 Student Queries
Teacher

... ...

Partition n Teacher n
Public
Data

Training Inference Data flow

47
Deployment
Available to the adversary

Student Queries

Inference
48
Differential privacy analysis
Differential privacy:
A randomized algorithm M satisfies ( , ) differential privacy if for all pairs of neighbouring
datasets (d,d’), for all subsets S of outputs:

Application of the Moments Accountant technique (Abadi et al, 2016)

Strong quorum ⟹ Small privacy cost

Bound is data-dependent: computed using the empirical quorum

49
Experimental
results

50
Experimental setup
Student
Dataset Teacher Model
Model

MNIST Convolutional Neural Network Generative Adversarial Networks

SVHN Convolutional Neural Network Generative Adversarial Networks

UCI Adult Random Forest Random Forest

UCI Diabetes Random Forest Random Forest

/ /models/tree/master/differential_privacy/multiple_teachers
51
Aggregated teacher accuracy

52
Trade-off between student accuracy and privacy

53
Trade-off between student accuracy and privacy

UCI Diabetes

1.44

10-5

Non-private
93.81%
baseline

Student 93.94%
accuracy

54
Synergy between privacy and generalization

55
Some online ressources:

Blog on S&P in ML (joint work w/ Ian Goodfellow) www.cleverhans.io

ML course https://coursera.org/learn/machine-learning
DL course https://coursera.org/learn/neural-networks

Assigned reading and more in-depth technical survey paper:

Machine Learning in Adversarial Settings

Patrick McDaniel, Nicolas Papernot, Z. Berkay Celik

Towards the Science of Security and Privacy in Machine Learning

Nicolas Papernot, Patrick McDaniel, Arunesh Sinha, and Michael Wellman

www.papernot.fr
@NicolasPapernot 56
57
Gradient masking

58
Tramèr et al. Ensemble Adversarial Training: Attacks and Defenses
Gradient masking

59
Tramèr et al. Ensemble Adversarial Training: Attacks and Defenses

(Turing) Guidelines For Technical Writing Assessment (March 2024)
33% (3)
(Turing) Guidelines For Technical Writing Assessment (March 2024)
4 pages
Leveraging IT to Reduce Government Footprint f
No ratings yet
Leveraging IT to Reduce Government Footprint f
29 pages
Psychedelics in The Age of Intelligent Machines, Terence Mackenna
No ratings yet
Psychedelics in The Age of Intelligent Machines, Terence Mackenna
14 pages
Pmwj80 Apr2019 Wang How To Aply AI in Project Management
No ratings yet
Pmwj80 Apr2019 Wang How To Aply AI in Project Management
12 pages
w11 ML Security
No ratings yet
w11 ML Security
35 pages
L12 - UCLxDeepMind DL2020
No ratings yet
L12 - UCLxDeepMind DL2020
152 pages
Applied Machine Learning
No ratings yet
Applied Machine Learning
49 pages
Contemporary ML For Physicists
No ratings yet
Contemporary ML For Physicists
91 pages
1 Lecture 1: Introduction To Machine Learning
No ratings yet
1 Lecture 1: Introduction To Machine Learning
12 pages
CSC 462 AI Week 07 ML
No ratings yet
CSC 462 AI Week 07 ML
29 pages
Machine Learning Security and Privacy A Review of
No ratings yet
Machine Learning Security and Privacy A Review of
24 pages
1. U1 ML Intro and Applications
No ratings yet
1. U1 ML Intro and Applications
123 pages
1 Overview
No ratings yet
1 Overview
22 pages
01 - Introduction
No ratings yet
01 - Introduction
35 pages
Security_Engineering_for_Machine_Learning
No ratings yet
Security_Engineering_for_Machine_Learning
4 pages
er
No ratings yet
er
133 pages
AI WITH GENERATED DATA
No ratings yet
AI WITH GENERATED DATA
42 pages
Adversarial Machine Learning
No ratings yet
Adversarial Machine Learning
39 pages
Mlfa Autumn 22 Lec 01
No ratings yet
Mlfa Autumn 22 Lec 01
43 pages
Practical Black-Box Attacks Against Machine Learning: Nicolas Papernot Patrick Mcdaniel Ian Goodfellow
No ratings yet
Practical Black-Box Attacks Against Machine Learning: Nicolas Papernot Patrick Mcdaniel Ian Goodfellow
14 pages
unit 1 ml
No ratings yet
unit 1 ml
41 pages
A Critical Overview of Privacy in Machine Learning
No ratings yet
A Critical Overview of Privacy in Machine Learning
9 pages
Lec1&2 Final
No ratings yet
Lec1&2 Final
37 pages
MLP Unit-I
No ratings yet
MLP Unit-I
62 pages
أخلاقيات الذكاء الإصطناعي1
No ratings yet
أخلاقيات الذكاء الإصطناعي1
24 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
45 pages
JJ
No ratings yet
JJ
5 pages
Basic_concepts_of_Machine_Learning_for_Beginners_1732109263
No ratings yet
Basic_concepts_of_Machine_Learning_for_Beginners_1732109263
102 pages
Machine Learning Batch 8 2021
100% (1)
Machine Learning Batch 8 2021
73 pages
MAI Lecture 01 Introduction
No ratings yet
MAI Lecture 01 Introduction
52 pages
Sok: Security and Privacy in Machine Learning
No ratings yet
Sok: Security and Privacy in Machine Learning
16 pages
[email protected]
No ratings yet
[email protected]
4 pages
UNIT I-Part 1
No ratings yet
UNIT I-Part 1
52 pages
1.Introduction
No ratings yet
1.Introduction
24 pages
Machine Learning Methods For Malware Detection 1611630481
No ratings yet
Machine Learning Methods For Malware Detection 1611630481
18 pages
2023 LSE MY474 Applied Machine Learning Social Science, Lecture1
No ratings yet
2023 LSE MY474 Applied Machine Learning Social Science, Lecture1
65 pages
Introduction to machine learning
No ratings yet
Introduction to machine learning
33 pages
Lecture 1
No ratings yet
Lecture 1
43 pages
Module 1
No ratings yet
Module 1
22 pages
Security Boulevard Onelogin-Mastering Machine Learning
No ratings yet
Security Boulevard Onelogin-Mastering Machine Learning
13 pages
Lecture Notes: Introduction To Machine Learning For The Sciences
No ratings yet
Lecture Notes: Introduction To Machine Learning For The Sciences
80 pages
02 ML Fundatmentals 2
No ratings yet
02 ML Fundatmentals 2
81 pages
Introduction To AI and ML - Day 1: Gururajan Narasimhan Erode
No ratings yet
Introduction To AI and ML - Day 1: Gururajan Narasimhan Erode
39 pages
s10207-024-00813-3
No ratings yet
s10207-024-00813-3
28 pages
Reviewer
No ratings yet
Reviewer
7 pages
Machine Leaning 1 unit
No ratings yet
Machine Leaning 1 unit
10 pages
Lecture02
No ratings yet
Lecture02
26 pages
MCA -ML Question Bank Answer
No ratings yet
MCA -ML Question Bank Answer
139 pages
AI321: Theoretical Foundations of Machine Learning: Dr. Motaz El-Saban
No ratings yet
AI321: Theoretical Foundations of Machine Learning: Dr. Motaz El-Saban
44 pages
1c Machinelearning
No ratings yet
1c Machinelearning
50 pages
Introduction To Machine Learning: Dr.S.Sankar Ganesh Vellore Institute of Technology
No ratings yet
Introduction To Machine Learning: Dr.S.Sankar Ganesh Vellore Institute of Technology
132 pages
Lecture6 Neural Network Basics v1.1
No ratings yet
Lecture6 Neural Network Basics v1.1
40 pages
Tirth.pdf
No ratings yet
Tirth.pdf
19 pages
Attacks Against Machine Learning - Evasion
No ratings yet
Attacks Against Machine Learning - Evasion
45 pages
machine learning
No ratings yet
machine learning
2 pages
Crypto_with_Machine_Learning
No ratings yet
Crypto_with_Machine_Learning
30 pages
Unit 1 ML
No ratings yet
Unit 1 ML
70 pages
AI Chapter 3 Part 1
No ratings yet
AI Chapter 3 Part 1
33 pages
SBA - Fault Injection Attack On Deep Neural Network
No ratings yet
SBA - Fault Injection Attack On Deep Neural Network
23 pages
Machine Learning Deep Learning Overview AIST
No ratings yet
Machine Learning Deep Learning Overview AIST
86 pages
2312.03520v1
No ratings yet
2312.03520v1
9 pages
The Art of AI Security Professional & Work
From Everand
The Art of AI Security Professional & Work
Tom Henricksen
No ratings yet
Machine Learning: Adaptive Behaviour Through Experience: Thinking Machines
From Everand
Machine Learning: Adaptive Behaviour Through Experience: Thinking Machines
alasdair gilchrist
4.5/5 (5)
Deep learning: deep learning explained to your granny – a guide for beginners
From Everand
Deep learning: deep learning explained to your granny – a guide for beginners
PAT NAKAMOTO
3/5 (2)
ORCALES MEJAYACEL - Reaction Paper on 21st Century Research Paradigm
No ratings yet
ORCALES MEJAYACEL - Reaction Paper on 21st Century Research Paradigm
1 page
ANN Final Exam
100% (1)
ANN Final Exam
13 pages
OBIEE Vs OAS Key Differences Features and Benefits
No ratings yet
OBIEE Vs OAS Key Differences Features and Benefits
10 pages
Trashbot RRL
No ratings yet
Trashbot RRL
1 page
AEn G3 GROUP-ESSAY-OUTLINE
No ratings yet
AEn G3 GROUP-ESSAY-OUTLINE
6 pages
Meesho Is Solving A Unique Problem Which No Other Company Is Trying To Solve
No ratings yet
Meesho Is Solving A Unique Problem Which No Other Company Is Trying To Solve
2 pages
Digest - Dimensionality Reduction of Neural Spike Train Data Using Factor Analysis
No ratings yet
Digest - Dimensionality Reduction of Neural Spike Train Data Using Factor Analysis
3 pages
Statistical Learning Intro
No ratings yet
Statistical Learning Intro
10 pages
Dua Slides
No ratings yet
Dua Slides
52 pages
Problems DFA
No ratings yet
Problems DFA
9 pages
Supply Network 5.0: How to Improve Human Automation in the Supply Chain Bernardo Nicoletti instant download
100% (1)
Supply Network 5.0: How to Improve Human Automation in the Supply Chain Bernardo Nicoletti instant download
81 pages
FYP Final Report (Template)
No ratings yet
FYP Final Report (Template)
51 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
37 pages
Cs3491-Aiml Lab Manual
No ratings yet
Cs3491-Aiml Lab Manual
59 pages
Ijaerv13n10 194
No ratings yet
Ijaerv13n10 194
8 pages
Intelligent Manufacturing Systems Web References
No ratings yet
Intelligent Manufacturing Systems Web References
2 pages
Fuzzy Relations and the Extension Principle: 陳德育博士 (Dr. Te-Yu Chen) 元智大學電機工程研究所
No ratings yet
Fuzzy Relations and the Extension Principle: 陳德育博士 (Dr. Te-Yu Chen) 元智大學電機工程研究所
23 pages
ML Day 1
No ratings yet
ML Day 1
15 pages
Automated Vehicle License Plate Detection System Using Image Processing Algorithms PDF
No ratings yet
Automated Vehicle License Plate Detection System Using Image Processing Algorithms PDF
5 pages
Boost Your Vocabulary Cambridge IELTS 16 - Dinh Thang - A&M IELTS Version 18.05.2023
No ratings yet
Boost Your Vocabulary Cambridge IELTS 16 - Dinh Thang - A&M IELTS Version 18.05.2023
61 pages
cs8691 Ai Unit I Notes For Ai 1st Unit
No ratings yet
cs8691 Ai Unit I Notes For Ai 1st Unit
38 pages
CO1-CC-PPT Session-3
No ratings yet
CO1-CC-PPT Session-3
10 pages
KTU BTech RB 2019scheme 2019Scheme-S8 2019 Syllabus
No ratings yet
KTU BTech RB 2019scheme 2019Scheme-S8 2019 Syllabus
155 pages
Components of Ai System Design PDF
No ratings yet
Components of Ai System Design PDF
1 page
An Improved Automatic Image Annotation Approach Using Convolutional Neural Network-Slantlet Transform
No ratings yet
An Improved Automatic Image Annotation Approach Using Convolutional Neural Network-Slantlet Transform
13 pages
498 FA2019 Lecture01
No ratings yet
498 FA2019 Lecture01
61 pages

Slides Security and Privacy in Machine Learning

Uploaded by

Slides Security and Privacy in Machine Learning

Uploaded by

Security and Privacy

November 2017 - Penn State

x f(x,θ) [p(0|x,θ), p(1|x,θ), p(2|x,θ), …, p(7|x,θ), p(8|x,θ), p(9|x,θ)]

Classifier: map inputs to one class among a predefined set

Learning: find internal classifier parameters θ that minimize a

Security in machine learning

DREBIN dataset of Android applications

“Most accurate” neural network

Supervised learning Reinforcement learning

Adversarial goal Force a ML model remotely accessible through an API to misclassify

Alleviate lack of knowledge Alleviate lack of

samples crafted to mislead a model A are likely to mislead a model B

These property comes in several variants:

samples crafted to mislead a model A are likely to mislead a model B

These property comes in several variants:

samples crafted to mislead a model A are likely to mislead a model B

Alleviate lack of knowledge Alleviate lack of

“no truck sign”

“no truck sign”

“no truck sign”

Local Remote “yield sign”

Alleviate lack of knowledge Alleviate lack of

Deep Learning 6,400 84.24%

Logistic Regression 800 96.19%

Unknown 2,000 97.72%

Privacy in machine learning

Model inspection (white-box adversary)

In our work, the threat model assumes:

Problem Preserve privacy of training data when learning classifiers

Differential privacy protection guarantees

Goals Intuitive privacy protection guarantees

*This is a key distinction from previous work, such as

Training Data flow

Count votes Take maximum

If most teachers agree on the label, it does not depend on

If two classes have close vote counts, the disagreement

Count votes Add Laplacian noise Take maximum

Training Data flow

Training Inference Data flow

The aggregated teacher violates our threat model:

1 Each prediction increases total privacy loss.

2 Inspection of internals may reveal private data.

Training Inference Data flow

Application of the Moments Accountant technique (Abadi et al, 2016)

Strong quorum ⟹ Small privacy cost

Bound is data-dependent: computed using the empirical quorum

MNIST Convolutional Neural Network Generative Adversarial Networks

SVHN Convolutional Neural Network Generative Adversarial Networks

UCI Adult Random Forest Random Forest

UCI Diabetes Random Forest Random Forest

Blog on S&P in ML (joint work w/ Ian Goodfellow) www.cleverhans.io

Assigned reading and more in-depth technical survey paper:

Machine Learning in Adversarial Settings

Towards the Science of Security and Privacy in Machine Learning

You might also like