Chapter5
Chapter5
Christophe Ley
Statistical Modelling
Outline
Responsible AI
Goal from the OECD : “AI that is innovative and trustworthy and
respects human rights and democratic values”
Quality questions
Pure prediction enjoys the benefit of being able to exploit all patterns,
possibly involving many covariates and their mutual interactions,
present in the dataset in order to get the best prediction. On the
contrary, significance testing generally focuses on verifying whether
one covariate has a sufficiently large effect on the outcome.
Outline
Interpretable covariates
Comprehensible aggregation
Decision trees and linear models are preferred from this perspective
over Artificial neural networks.
The first category are “interpretable models”, while the second covers
“post-hoc techniques”, where interpretability is regarded as an
addition to a model that was constructed in advance.
Model-agnosticism
Not every method from category (ii) is able to extend any model. For
instance, guided backpropagation is a method that is limited to the
convolutional neural network model. This method makes clever use of
the model’s structure to provide a qualitative explanation, hence is
tailored to take advantage of the neural network’s structure, and can’t
be applied to other machine learning models.
Outline
Outline
Concrete example
Data about the details of patients who have tested positive for
COVID-19 along with their demographic details, age, sex, symptoms,
and outcome. Using this data an ML model predicts the likelihood of a
patient recovering given the age, sex and symptoms of the patient.
Outline
Single PDP
Multiple PDPs