0% found this document useful (0 votes)

22 views

NPTEL

Uploaded by

poojaajithan160701

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views

NPTEL

Uploaded by

poojaajithan160701

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

Fairness in Machine Learning - Overview

Arun Rajkumar
Dept of DSAI, IITM

Disclaimer: Several images used in this presentation are sourced from google images. No copyright violation intended
Machine Learning Pipeline

Data Algorithm Model

Principal Component Analysis - Recap

Input Dataset

Principal Components

Test Another Test Image

Image
Logistic Regression - Recap

1
0.8
0.7
0.6

0.5 0.5

0.4
0.3
0.2
0

Linearly Separable Dataset Probabilisitc Model Logistic Function

P(y=1/x) = 1/(1+exp(w.x))
Logistic Regression - Recap
Logistic Regression - Recap
Logistic Regression - Recap

Gradient
Source: https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing
Source: https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing
Source: https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing
https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2208240
Biases in Word Embeddings

https://blog.acolyer.org/2020/12/08/bias-in-word-embeddings/
How do we fix it?

But first, what exactly is fairness?

Fair unsupervised learning
UnFair PCA

https://sites.google.com/site/ssamadi/home/fair-pca-homepage
UnFair PCA

https://sites.google.com/site/ssamadi/home/fair-pca-homepage
https://sites.google.com/site/ssamadi/home/fair-pca-homepage
Fair Supervised learning
No single answer
Statistical Parity

X – set of all data points (people)

C – Set of all data points (people) belonging to a certain protected group
M: X -> {0,1} - a classifier, say logistic regression

Bias of classifier for the protected group:

Parity(M,C) = Pr(M(x) = 1/x in C) - Pr(M(x) = 1)

Ideal classifier => Parity = 0

In practice, min |parity(M,C)}
Equality of Opportunity

X – set of all data points (people)

C – Set of all data points (people) belonging to a certain
protected group
M: X -> {0,1} - a classifier, say logistic regression.

opportunity_inequality(M,C) = Pr(M(x) = 1| y=1 and C) - Pr(M(x) = 1|y=1)

Ideal classifier: opportunity_inequality = 0

In practice: min |opp_ineq(M,C)|
Individual Fairness

X – set of all data points (people)

M X -> Δ (simplex) - a classifier, say logistic regression.
d: X * X -> R - distance function
D: Δ * Δ -> R – distance between probability vectors

D(M(x),M(x’)) <= d(x,x;’) for any x.x’

Outcome Test (Predictive Parity)

X – set of all data points (people)

C – Set of all data points (people) belonging to a certain
protected group
M: X -> {0,1} - a classifier, say logistic regression.

Pred_parity(M,C) = Pr(y=1| M(x) = 1 and C) - Pr(y=1| M(x) = 1)

Ideal classifier: Pred_parity = 0

In practice: min |Pred_parity(M,C)|
Impossiblity Result

Opportunity_inequality(M,C) = Pr(M(x) = 1| y=1 and C) - Pr(M(x) = 1|y=1)

Calibration within groups – Pr(M(x) = 1| x in C) ~= Pr(y=1| x in C)

Fair Logistic Regression
https://dl.acm.org/doi/fullHtml/10.1145/3308560.3317584

Parity = |Pr(M(x) = 1|C = 1 ) - Pr(M(x) = 1| C= 0 )|

Regularized logistic loss on dataset

Measures how correlated

are the prediction probabilities
to the Protected attribute.

Protected attribute for

data point i
Fair Multi Armed Bandits

0.8 0.4 0.5 0.9

Case in point: Swiggy/Zomato wants to assign partners to orders.

Goal: Maximize expected reward over time.

A simple strategy: Pick current best arm with (0.9) probability and uniformly at random with 0.1 prob.
Fair Multi Armed Bandits

Notion of Fairness:

If partner-a is truly better than partner b,

then
Probability that algorithm assigns partners a >=
Probability that algorithm assigns partners b

A simple strategy: Pick current best arm with (0.9) probability and uniformly at random with 0.1 prob.
0.6

0.5

0.4
0.5 0.35 0.6 0.38
0.3

IDEA: Sample uniformly from those whose confidence interval overlaps with the winner
● Fair PCA
● Fair LogReg
● Fair MAB
Original Loss function

Perturbed Loss function

Fairness
constraint `
space
Model
Thank you!

https://cerai.iitm.ac.in

[email protected]

Sample Sizes - Cannon & Roe
100% (5)
Sample Sizes - Cannon & Roe
14 pages
NeurIPS 2018 Why is My Classifier Discriminatory Paper
No ratings yet
NeurIPS 2018 Why is My Classifier Discriminatory Paper
12 pages
NIPS 2016 Equality of Opportunity in Supervised Learning Paper
No ratings yet
NIPS 2016 Equality of Opportunity in Supervised Learning Paper
9 pages
Lecture 1 - Novi Quadrianto
No ratings yet
Lecture 1 - Novi Quadrianto
57 pages
n15 PDF
No ratings yet
n15 PDF
4 pages
Besse Et Al. (2020)
No ratings yet
Besse Et Al. (2020)
25 pages
AI Magazine - 2024 - 8 Zhao - Fair and optimal prediction via post‐processing
No ratings yet
AI Magazine - 2024 - 8 Zhao - Fair and optimal prediction via post‐processing
8 pages
Fairness Lectures-21
No ratings yet
Fairness Lectures-21
63 pages
2102.08410
No ratings yet
2102.08410
16 pages
2106.05964
No ratings yet
2106.05964
72 pages
Fair SVM
No ratings yet
Fair SVM
10 pages
Calibration for the (Computationally-Identifiable) Masses
No ratings yet
Calibration for the (Computationally-Identifiable) Masses
10 pages
1806.03195v2
No ratings yet
1806.03195v2
25 pages
Data Science Ethics - Lecture 4 - Discrimination and Privacy in Data Preprocessing (re-identification) v2
No ratings yet
Data Science Ethics - Lecture 4 - Discrimination and Privacy in Data Preprocessing (re-identification) v2
47 pages
Tackling Bias in Machine Learning 1563051351
No ratings yet
Tackling Bias in Machine Learning 1563051351
8 pages
2302.08077
No ratings yet
2302.08077
28 pages
Transitioning From Real To Synthetic Data Quantifying The Bias in Model
No ratings yet
Transitioning From Real To Synthetic Data Quantifying The Bias in Model
7 pages
Class 2 Algorithmic Decision Making - Sahami - 10-09-2024
No ratings yet
Class 2 Algorithmic Decision Making - Sahami - 10-09-2024
13 pages
Fairness-Aware Machine Learning
No ratings yet
Fairness-Aware Machine Learning
60 pages
Black Box Fairness Testing of Machine Learning Models
No ratings yet
Black Box Fairness Testing of Machine Learning Models
11 pages
entropy-23-00018-v2-25
No ratings yet
entropy-23-00018-v2-25
1 page
linear-models-for-classification
No ratings yet
linear-models-for-classification
21 pages
2002.11651
No ratings yet
2002.11651
37 pages
Philosophy Public Affairs - 2023 - Beigang - Reconciling Algorithmic Fairness Criteria
No ratings yet
Philosophy Public Affairs - 2023 - Beigang - Reconciling Algorithmic Fairness Criteria
25 pages
Empirical Risk Minimization Under Fairness Constraints: Michele Donini Luca Oneto Shai Ben-David
No ratings yet
Empirical Risk Minimization Under Fairness Constraints: Michele Donini Luca Oneto Shai Ben-David
17 pages
A Comparative Analysis of Logistic Regression and Random Forest for Individual Fairness in Machine Learning
No ratings yet
A Comparative Analysis of Logistic Regression and Random Forest for Individual Fairness in Machine Learning
5 pages
A Reductions Approach To Fair Classification: Alekh Agarwal Alina Beygelzimer Miroslav Dud Ik John Langford Hanna Wallach
No ratings yet
A Reductions Approach To Fair Classification: Alekh Agarwal Alina Beygelzimer Miroslav Dud Ik John Langford Hanna Wallach
18 pages
Capstone Project Report
No ratings yet
Capstone Project Report
8 pages
On The Incompatibility of Accuracy and Equal Opportunity
No ratings yet
On The Incompatibility of Accuracy and Equal Opportunity
30 pages
EE 769 Introduction To Machine Learning: Sheet 4 - 2020-21-2 Linear Classification
No ratings yet
EE 769 Introduction To Machine Learning: Sheet 4 - 2020-21-2 Linear Classification
4 pages
Machine Learning: Foundations: Prof. Nathan Intrator
No ratings yet
Machine Learning: Foundations: Prof. Nathan Intrator
60 pages
Implement Ethical and Unbiased Algorithms
No ratings yet
Implement Ethical and Unbiased Algorithms
19 pages
Lec11 Introduction2BayesianStatistics
No ratings yet
Lec11 Introduction2BayesianStatistics
48 pages
Chapter 4: Linear Models For Classification: Grit Hein & Susanne Leiberg
No ratings yet
Chapter 4: Linear Models For Classification: Grit Hein & Susanne Leiberg
21 pages
Machine Learning Handbook - Radivojac and White
No ratings yet
Machine Learning Handbook - Radivojac and White
108 pages
2410.06423v1
No ratings yet
2410.06423v1
41 pages
UNIT I-Part 2
No ratings yet
UNIT I-Part 2
35 pages
A Novel Regularization Approach To Fair ML
No ratings yet
A Novel Regularization Approach To Fair ML
20 pages
Causal Reasoning For Algorithmic Fairness
No ratings yet
Causal Reasoning For Algorithmic Fairness
21 pages
04 Probability and Learning PDF
No ratings yet
04 Probability and Learning PDF
34 pages
MLSlides2 Selected Shared (3)
No ratings yet
MLSlides2 Selected Shared (3)
29 pages
Fairness in Machine Learning Lessons From Politica
No ratings yet
Fairness in Machine Learning Lessons From Politica
11 pages
Session01 DataScience
No ratings yet
Session01 DataScience
79 pages
Cs329s 12 Slides Sara Google
No ratings yet
Cs329s 12 Slides Sara Google
114 pages
MLP-RL1
No ratings yet
MLP-RL1
6 pages
Ryan Adams 140814 Bayesopt Ncap
No ratings yet
Ryan Adams 140814 Bayesopt Ncap
84 pages
Fair Enough: Improving Fairness in Budget-Constrained Decision Making Using Confidence Thresholds
No ratings yet
Fair Enough: Improving Fairness in Budget-Constrained Decision Making Using Confidence Thresholds
13 pages
Cours1 ML
No ratings yet
Cours1 ML
41 pages
Machine Learning and Pattern Recognition Week 3 Intro - Classification
No ratings yet
Machine Learning and Pattern Recognition Week 3 Intro - Classification
5 pages
Machine Learning UNIT-2: Logistic Regression
No ratings yet
Machine Learning UNIT-2: Logistic Regression
12 pages
BI Lecture-Mod 2
No ratings yet
BI Lecture-Mod 2
113 pages
Adversarial Learning
No ratings yet
Adversarial Learning
10 pages
16931-Article Text-20425-1-2-20210518 (2)
No ratings yet
16931-Article Text-20425-1-2-20210518 (2)
10 pages
1903 07609v1 PDF
No ratings yet
1903 07609v1 PDF
10 pages
Unit 2 Notes
No ratings yet
Unit 2 Notes
7 pages
1907.12059v1
No ratings yet
1907.12059v1
15 pages
ML-2-PPT-UNIT-2
No ratings yet
ML-2-PPT-UNIT-2
214 pages
Algorithmic Fairness: Advances in Big Data Research in Economics
No ratings yet
Algorithmic Fairness: Advances in Big Data Research in Economics
6 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
Modelling Air Passenger Traffic Flow in Murtala Muhammad International Airport Lagos, Nigeria A Time Series Approach
No ratings yet
Modelling Air Passenger Traffic Flow in Murtala Muhammad International Airport Lagos, Nigeria A Time Series Approach
100 pages
MTH5120 Exam 2017
No ratings yet
MTH5120 Exam 2017
5 pages
BSM201 MAKAUT CA2 1ST YEAR
No ratings yet
BSM201 MAKAUT CA2 1ST YEAR
7 pages
Annuities Modeling With R: Giorgio Alfredo Spedicato, PH.D C.Stat ACAS
No ratings yet
Annuities Modeling With R: Giorgio Alfredo Spedicato, PH.D C.Stat ACAS
34 pages
lectut-CEN-614-ppt-point Analysis
No ratings yet
lectut-CEN-614-ppt-point Analysis
23 pages
Regression Analysis
No ratings yet
Regression Analysis
2 pages
Lecture 7 & 8 Brief Lecture Notes On Probability Distributions: Binomial, Poisson and Normal Distribution
No ratings yet
Lecture 7 & 8 Brief Lecture Notes On Probability Distributions: Binomial, Poisson and Normal Distribution
17 pages
Research Methodology 22 Year Question
No ratings yet
Research Methodology 22 Year Question
3 pages
Hamming Code: Mathematical Block Length Message Length
No ratings yet
Hamming Code: Mathematical Block Length Message Length
4 pages
Naive Bayes With R
No ratings yet
Naive Bayes With R
2 pages
Randomnumbers
No ratings yet
Randomnumbers
26 pages
Prediction of RBNS and IBNR Claims Using Claim Amounts and Claim Counts
No ratings yet
Prediction of RBNS and IBNR Claims Using Claim Amounts and Claim Counts
19 pages
Me 3rd Sem Control System
No ratings yet
Me 3rd Sem Control System
7 pages
Lec 4 Cost Behavior
No ratings yet
Lec 4 Cost Behavior
37 pages
Design and Analysis of Experiments
No ratings yet
Design and Analysis of Experiments
30 pages
Chapter 3: Quantitative Demand Analysis Answers To Questions and Problems
No ratings yet
Chapter 3: Quantitative Demand Analysis Answers To Questions and Problems
14 pages
Binary Logistic
No ratings yet
Binary Logistic
87 pages
Computing Binomial Probabilities With Minitab
No ratings yet
Computing Binomial Probabilities With Minitab
6 pages
Experimental Psychology - Quiz 5 (Chapter 9) : Southern Luzon State University College of Arts and Sciences
No ratings yet
Experimental Psychology - Quiz 5 (Chapter 9) : Southern Luzon State University College of Arts and Sciences
2 pages
cs-CFA
No ratings yet
cs-CFA
3 pages
Josh Rombach Case 2
No ratings yet
Josh Rombach Case 2
5 pages
Gujarati Chap 3
No ratings yet
Gujarati Chap 3
44 pages
Mat212 PDF
No ratings yet
Mat212 PDF
4 pages
Discrete Probability Distributions: Mcgraw-Hill/Irwin
No ratings yet
Discrete Probability Distributions: Mcgraw-Hill/Irwin
40 pages
Math 215 Cheat Sheet
No ratings yet
Math 215 Cheat Sheet
3 pages
Statistical Foundations For Econometric
No ratings yet
Statistical Foundations For Econometric
413 pages
Reliability Glossary
100% (1)
Reliability Glossary
12 pages
BST 230 RP Final
No ratings yet
BST 230 RP Final
9 pages
Common Confusions and Mistakes
No ratings yet
Common Confusions and Mistakes
8 pages

NPTEL

Uploaded by

NPTEL

Uploaded by

Fairness in Machine Learning - Overview

Data Algorithm Model

Test Another Test Image

Linearly Separable Dataset Probabilisitc Model Logistic Function

But first, what exactly is fairness?

X – set of all data points (people)

Bias of classifier for the protected group:

Ideal classifier => Parity = 0

X – set of all data points (people)

opportunity_inequality(M,C) = Pr(M(x) = 1| y=1 and C) - Pr(M(x) = 1|y=1)

Ideal classifier: opportunity_inequality = 0

X – set of all data points (people)

D(M(x),M(x’)) <= d(x,x;’) for any x.x’

X – set of all data points (people)

Pred_parity(M,C) = Pr(y=1| M(x) = 1 and C) - Pr(y=1| M(x) = 1)

Ideal classifier: Pred_parity = 0

Opportunity_inequality(M,C) = Pr(M(x) = 1| y=1 and C) - Pr(M(x) = 1|y=1)

Opportunity_inequality(M,C) = Pr(M(x) = 1| y=1 and C) - Pr(M(x) = 1|y=1)

Calibration within groups – Pr(M(x) = 1| x in C) ~= Pr(y=1| x in C)

Parity = |Pr(M(x) = 1|C = 1 ) - Pr(M(x) = 1| C= 0 )|

Regularized logistic loss on dataset

Measures how correlated

Protected attribute for

0.8 0.4 0.5 0.9

Case in point: Swiggy/Zomato wants to assign partners to orders.

If partner-a is truly better than partner b,

Perturbed Loss function

You might also like