0% found this document useful (0 votes)

10 views11 pages

Junk 3

This document introduces Shapley residuals, which quantify information lost when using Shapley values to explain nonlinear models. Shapley residuals capture variable interactions and mismatches between dependent features in data and independent variables in models. The residuals can indicate when Shapley values provide a poor approximation and should not be overinterpreted. An algorithm is provided for computing residuals, which characterize modeling settings based on residual values and capture additional predictive information beyond Shapley values.

Uploaded by

rikap49530

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views11 pages

Junk 3

Uploaded by

rikap49530

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Shapley Residuals: Quantifying the limits of the

Shapley value for explanations

I. Elizabeth Kumar Carlos Scheidegger

Department of Computer Science Department of Computer Science
Brown University University of Arizona
Providence, RI 02912 Tucson, AZ 85721
[email protected] [email protected]

Suresh Venkatasubramanian Sorelle A. Friedler

Department of Computer Science Computer Science Dept.
Brown University Haverford College
Providence, RI 02912 Haverford, PA 19041
[email protected] [email protected]

Abstract

Popular feature importance techniques compute additive approximations to nonlin-

ear models by first defining a cooperative game describing the value of different
subsets of the model’s features, then calculating the resulting game’s Shapley
values to attribute credit additively between the features. However, the specific
modeling settings in which the Shapley values are a poor approximation for the
true game have not been well-described. In this paper we utilize an interpretation
of Shapley values as the result of an orthogonal projection between vector spaces
to calculate a residual representing the kernel component of that projection. We
provide an algorithm for computing these residuals, characterize different modeling
settings based on the value of the residuals, and demonstrate that they capture infor-
mation about model predictions that Shapley values cannot. Shapley residuals can
thus act as a warning to practitioners against overestimating the degree to which
Shapley-value-based explanations give them insight into a model.

1 Introduction
There have been many recent efforts to quantify the importance of features to a model [19, 4, 1, 15,
12, 13]. Many of these determine the importance through estimating the Shapley value of a game
designed to assign importance to sets of features [4, 12, 5, 13, 14, 25]. These Shapley-value-based
feature importance methods are used widely in practice [2].
At the same time, there have been increasing concerns that these game theoretic values may not
completely capture human or technical notions of feature importance [10, 21, 24]. A particularly
salient issue is that users have misconceptions about what Shapley values represent and what ac-
tionable information can be gleaned from them [8]. Non-linear complex models, and models built
on correlated features, do not have Shapley values that can be interpreted as the effect of a direct
intervention [10], e.g., so that increasing a variable value changes the model outcome in a predictable
way. The goal of this work is to quantify the extent of these concerns and provide a theoretical
foundation for understanding the limits of Shapley values.
In this work, we introduce Shapley Residuals, vector-valued objects that capture a specific type of
quantitative information lost by Shapley values. Shapley residuals can be associated with individual

35th Conference on Neural Information Processing Systems (NeurIPS 2021).

variables, as well as with sets of variables. When the residual of a feature exhibits a large norm, the
associated Shapley value should be taken with skepticism: the resulting importance is not just due
to the variable acting by itself. On the other hand, if a residual is small, most of the effect of the
variable on the model is explainable by the variable acting independently (we make these statements
precise in Section 3). The Shapley residual, then, communicates important details about what the
explanation actually represents.
To build an intuition for why this is an important problem, consider an algorithm which makes
admissions decisions purely on the basis of gender and department: f (g, d) = g + d 2dg, where
g = 1 if the applicant is male and g = 1 otherwise, and there are two departments, represented by
d = 1 and d = 1. In this contrived scenario, the applicant is admitted if f (g, d) > 0 (which only
happens when g and d have different signs) and is rejected otherwise. Clearly, the admissions decision
is affected by gender–yet if each of the two variables are distributed with mean 0, the KernelSHAP
values [12] which are supposed to explain the decision f (1, 1) = 0 are both 0, since to compute
the Shapley value, each features’ univariate and interaction influences are averaged together and
cancel each other out. In this way, the computation of the Shapley value has implicitly obscured a
discriminatory effect, and the corresponding nonzero Shapley residuals would demonstrate that the
Shapley values are not telling the whole story.
To more precisely describe what Shapley residuals capture, consider the following two motivating
scenarios. First, suppose a practitioner uses Shapley values to determine the effect of data interven-
tions on model outcomes. Consider two models f1 and f2 . In a real-world scenario, the practitioner
will often only have black-box access to such models, and the models will often be significantly more
complex. Here, we use these simple models:
f1 (x1 , x2 , x3 ) = x1 + x2 + x3
f2 (x1 , x2 , x3 ) = x1 + 2x2 x3
Suppose the practitioner seeks to explain the output f1 (1, 1, 1) = 3 or f2 (1, 1, 1) = 3, using
KernelSHAP to compute local feature importances. For both models, the Shapley values of x1 ,
x2 , and x3 are all 1. Despite that, intervening by increasing the value of x2 changes f2 more than
increasing the value of x1 ; in f1 , this clearly does not happen. The Shapley residuals for all variables
in f1 are zero, indicating that variables in f1 do not interact (as we prove in Section 3). The Shapley
residuals for x2 and x3 in f2 , on the other hand, are nonzero, while the Shapley residual of x1 is
still zero. Finally, the Shapley residual for the set of variables {x2 , x3 } is also zero. As we show in
Section 3, these statements imply the following behavior for variables of f2 : x1 has no interactions
with other variables (its residual is zero); x2 and x3 interact with other variables (their residuals are
non zero); x2 and x3 only interact with each other (the residual of the set {x2 , x3 } is zero). Thus,
access to Shapley residuals gives warning that intervening on x2 or x3 in f2 could act differently than
x1 due to an interaction between x2 and x3 .

Table 1: KernelSHAP game for Example 1 - the input (1, 1, 1) to f (x1 , x2 , x3 ) = x1 + 2x2 x3 where
xi are iid N (0, 1) features.
Hypercube v(S) Definition v(S) Value
S Coordinate for explaining (1, 1, 1) with KernelSHAP given i.i.d.
xi ⇠ N (0, 1)
; (0,0,0) E[f (x) ] 0
{x1 } (1,0,0) E[f (x)|x1 = 1 ] 1
{x2 } (0,1,0) E[f (x)| x2 = 1 ] 0
{x3 } (0,0,1) E[f (x)| x3 = 1] 0
{x1 , x2 } (1,1,0) E[f (x)|x1 = 1, x2 = 1 ] 1
{x1 , x3 } (1,0,1) E[f (x)|x1 = 1, x3 = 1] 1
{x2 , x3 } (0,1,1) E[f (x)| x2 = 1, x3 = 1] 2
{x1 , x2 , x3 } (1,1,1) E[f (x)|x1 = 1, x2 = 1, x3 = 1] 3

In the second scenario, consider a data generating distribution where ↵ controls the correlation
between two features in X and a regression target y:
✓ ✓  ◆ ◆
1 ↵
(X, y) ⇠ N (0, 0), , hX, (3, 1)i .
↵ 1

2
We examine a regression model f (x1 , x2 ) = 1 x1 + 2 x2 determined via linear least squares. Assume
access to infinitely many IID samples from (X, y), = (3, 1). Suppose a practitioner wanted to
explain the output of f (1, 1) = 1 + 2 , this time using Conditional Expectation SHAP [24]. The
Shapley values are 1 + ↵( 2 1 )/2 for x1 and 2 + ↵( 1 2 )/2 for x2 . When ↵ ⇡ 0, Shapley
values correspond to model weights 1 , 2 , and support a (valid) interventional interpretation that
changing x1 yields a larger change to the output of f than does x2 . However, if ↵ ⇡ 1, Shapley
values do not support this interpretation. A practitioner employing Shapley values alone lacks the
information to distinguish these scenarios. Shapley residuals provide useful diagnostic information;
the norm of the residuals for x1 and x2 is exactly linearly proportional to ↵.
In these simple scenarios, it is clear that Shapley residuals capture, respectively, variable interactions
and mismatches between dependent features in the data and independent variables in the model. As
we show in Section 6, these observations apply to real-world scenarios as well.
In summary, we:

• introduce Shapley residuals (Section 3), which characterize the limits of Shapley values as
explanatory mechanisms for cooperative games,
• study the properties of Shapley residuals both in general and in context of existing formula-
tions for explanatory games (Sections 3, 4 and 5),
• show via a number of experiments that Shapley residuals capture meaningful information
for model explanations in realistic scenarios (Section 6),
• discuss the limitations of Shapley residuals themselves (Section 7).

2 Background
In this section, we begin by setting up the mathematical definitions and background we’ll need for the
rest of the paper. To help illustrate these ideas, we’ll use the running example from the introduction
of function f = x1 + 2x2 x3 ; we refer to this as Example 1. We begin by describing Shapley values
and cooperative games.

Games. A cooperative game consists of d players and a value function v : 2[d] ! R where
[d] , {1, . . . , d}. The quantity v(S) represents the value of the game for a coalition of players
S 2 N , 2[d] . Without loss of generality we will assume that v(;) = 0, and that we can identify the
game with v. Let the space of games be denoted by G.
The Shapley value is a way to fairly allocate the value of the grand coalition v([d]) among the players.
Definition 1 (Shapley values[20]). The Shapley values i (v), i 2 [d] are the unique values satisfying
the properties
Pd
Efficiency: i=1 i (v) = v([d]).
Dummy: If v(S [ {i}) = v(S) for all S ⇢ [d] \ {i}, then i (v) = 0.
Symmetry: If v(S [ {i}) = v(S [ {j}) for all S ⇢ [d] \ {i, j}, then i (v) = j (v).

Linearity: If v, v are two games on d players, then

0
i (↵v + ↵ v ) = ↵ i (v) + ↵0 i (v 0 ).
0 0

Given a model f (x1 , x2 , ..., xd ), the features from 1 to d can be considered players in a game in
which the payoff v is some measure of the importance or influence of a subset of features. The
Shapley value i (v) can then be viewed as a fairly attributed “influence" of i on the outcome v([d]).
In KernelSHAP, for instance, a function’s prediction on a certain input given a data distribution is
modeled as a game as shown in Table 1.
It will be useful for us to visualize a game as a function over the vertices of a d-dimensional
hypercube. Each coordinate corresponds to the presence or absence of a certain player, and each
vertex corresponds to a subset of players. Specifically, we can think of the set N as the d-dimensional
hypercube G = (V = N, E) with each vertex labeled by a set S ✓ [d] and edges between sets S and
S [ {i} for all i 2 [d], S. We depict this interpretation in Figure 1(a).

3
(a) Graphical representation of v (b) Graphical representation of rv

Figure 1: Visualizing the game and gradient of the game corresponding to Example 1.

Gradients on the hypercube. Let RV be the space of functions from V to R and let RE be the
space of functions from E to R. In particular, the game v is an element of RV .
The differential operator r : RV ! RE is then defined as rv(S, S [ {i}) = v(S [ i) v(S) for
any v 2 RV . Essentially r is a discrete gradient operator on G, mapping functions on vertices to
functions on edges (see Figure 1(b)).
We will also define a partial gradient ri : RV ! RE :
⇢
u(S [ j) u(S) i=j
ri u(S, S [ {j}) =
0 otherwise
Intuitively, ri evaluates a gradient for edges corresponding to the insertion of i, and takes the value 0
everywhere else. On the hypercube, only edges on the ith axis of ri v will take a nonzero value. See
the Edge Space portion of Figure 2(a) for an illustration of this procedure on the running example.

2.1 Geometric characterization of Shapley values

A geometric interpretation of Shapley values dates back at least to Kleinberg and Weiss [9], showing
they can be expressed in terms of projections from the space of games to the space of cooperative
games with independently contributing players. A key advance was made by Stern and Tettenhorst
[22], building on earlier work by Candogan et al. [3] who proposed viewing the game as a scalar
function defined on the hypercube and studying its discrete gradient. To understand this advance, we
first introduce a special class of games.

Inessential games Let I denote the space of games v such that for all S ✓ [d], v(S) =
P
i2S v({i}). I is called the space of inessential games. Intuitively an inessential game is one
in which the player interactions are simple and additive: every player adds a fixed value v({i}) to a
coalition S independent of the composition of S. Inessentiality is a key feature of what makes Shapley
values attractive for feature importance – if each contribution is fixed and combines additively, we
have a natural interpretation for how much each feature contributes to the overall model output.
Specifically, if a game is inessential, it then follows that the Shapley value for player i is v({i}). In
our running example using KernelSHAP, this is E[f (x)|xi = 1], the contribution (averaged over
other variables) of the variable xi .
In general though, a game might not be inessential. The key insight of Stern and Tettenhorst [22] was
to express inessentiality of games in terms of gradients on the hypercube.
Proposition 1 ([22, Prop 3.3]). The game v is inessential if and only if for each i 2 [d] there exists
vi 2 RV such that ri v = rvi .
The main result by Stern and Tettenhorst [22] is a decomposition of an arbitrary game v into games
that are “close to being inessential” and allow extraction of Shapley values. If v is not inessential, we
cannot be sure to find vi such that ri v = rvi , but we can find the “closest” such vi as the solution to
the least squares problem
min krx ri vk
x2RV ,x(;)=0
Theorem 1 (Stern and Tettenhorst [22]). Given a game v, let vi be defined as above. Then
P
1. vi = v

4
2. If v(S [ {i}) = v(S) for all S ⇢ [d], then vi = 0
3. For any ↵, ↵0 2 R and games v, v 0 , (↵v + ↵0 v 0 )i = ↵vi + ↵0 vi0
4. If ⇡ is a permutation of [d] and ⇡ v is the game ⇡ v(S) = v(⇡(S)), then (⇡ v)i = v⇡(i)
P
Consider the mapping (v)(S) = i2S vi ([d]). The above result implies this is a Shapley mapping
and therefore i (v) = vi ([d]) are the Shapley values of v. We illustrate the construction in Figure 2(a).

3 Shapley Residuals
The inessentiality of a game is inextricably linked to the meaningfulness of Shapley values for the
reasons given above. The idea we explore now is the converse: can the degree to which a game is not
inessential provide insights into where Shapley values are not able to capture feature influence?
By the fundamental theorem of linear algebra, we can write
ri v = rvi + ri
where ri is orthogonal to rvi . This allows us to interpret ri (a vector with one value for each edge of
the hypercube) as a measure of deviation from inessentiality, because by Proposition 1, this vector is
identically 0 if and only if the game is inessential.
We can generalize these ideas further to subsets of players. We begin with a generalized notion of
inessentiality:
Definition 2. The game v is inessential relative to S if v(C) = v(S) + v(C \ S) for all S and C
such that S ⇢ C ⇢ [d].

That is, each coalition containing S obtains a value equal to the subcoalition S working separately
from C \ S; in this sense, inessentiality with respect to S can speak to the lack of interactions
between S and its complement. In addition, inessentiality relative to a single player i is the same as
inessentiality relative to the singleton set {i}.
Next, we generalize the notion of a partial derivative.
P
Definition 3. For a subset S ⇢ [d], let rS : RV ! RE be the operator rS = i2S ri , or
⇢
ru (C, C [ {i}) if i = j and i 2 S,
rS u(C, C [ {j}) =
0 otherwise.

We can now prove a result similar to Proposition 1 for relative inessentiality.

Proposition 2. The game v is inessential relative to S if and only if there exists vS 2 RV such that
rS v = rvS .

To understand the limits of Shapley values, we propose to quantify the degree of deviation from
inessentiality with the following definition:
Definition 4 (ShapleyP Residuals). We call ri = ri v rvi the Shapley Residual of player i.
Analogously, rS = i2S ri is the Shapley Residual of set S.

Shapley Residuals are a novel diagnostic tool for feature importance, and enjoy a number of relevant
properties.
Proposition 3. If v is inessential, then v is inessential relative to all i 2 [d] and all subsets S ⇢ [d].
If v is inessential with respect to each player of i, j, . . . z then v is inessential relative to the set
{i, j, . . . , z}.

The proof of this proposition is in the appendix. The following corollaries are straightforward.
Corollary 1. v is inessential iff ri = 0 for each i 2 [d].
P
Corollary 2. v is inessential relative to S iff rS = i2S ri = 0.
P P
This allows us to interpret i2N ||ri ||2 as the deviation from inessentiality of v and || i2S ri ||2 as
the deviation from inessentiality of v relative to S.

5
(a) The decomposition of a game proposed by (b) The construction of Shapley residuals
Stern and Tettenhorst [22]

Figure 2: Visualizing the decomposition of a game and its residuals.

In this paper we will focus on the computation and evaluation of residuals with respect to individual
players i.e rS for S = {i}. Figure 2(b) illustrates the construction of residuals. Algorithm 1 describes
how to compute residuals.1

Algorithm 1 Exactly calculate the ith Shapley value and Shapley residual of v
Compute ri v
Solve vi = argminx2RV ||ri v rx||22
Compute rvi
Return Shapley residual ri = ri v rvi
Return Shapley value i = vi (S) vi (;) where S is the set of all players

4 Feature Importance, Inessentiality and Residuals

We have established that the norm of the residual ri characterizes the degree to which the value
function v is not inessential with respect to the player i. We now show how to interpret this when
attributing feature importance via Shapley values for two popular methods. As has been noted, the
different methods for Shapley value-based explanation (whether local or global) all reduce to a specific
choice for the game v, at which point the Shapley values of v are estimated and returned [24, 10, 16].
The definitions of Shapley sampling values [23], as well as SHAP values [12], are derived from
defining v as the conditional expected model output on a data point when only the features in S
are known: vf,x
Cond
(S) = E[f (X)|XS = xS ] We call this Conditional Expectation SHAP after
Sundararajan and Najmi [24].
The Interventional SHAP value function, which defines KernelSHAP, is derived from defining v by
taking an expectation of f over the joint distribution of S̄ while fixing the feature values from S:
Int
vf,x (S) = E[f ([xS , XS̄ ])] Notably, the two values are the same if the features in S̄ are independent
from those in S.
We will show that the residual rS captures the degree to which interactions between the features in S
and its complement arise in the model or in the data, depending on which form of Shapley-based
feature importance is used to define the value function v.

1
We can take an unconstrained minimum here and subtract vi (;) at the end because adding a constant value
to v does not change rv.

6
4.1 Inessentiality and Interactions

Interventional SHAP Recall the problem of explaining two models where f1 (x1 , x2 , x3 ) = x1 +
x2 + x3 and f2 (x1 , x2 , x3 ) = x1 + 2x2 x3 . Note that in the first model all three variables contribute
independently to the model output, whereas in the second model the variables x2 and x3 interact in
their contribution. We can compute the associated residuals r1 , r2 , r3 and their norms for these two
models. For the first one, all residuals are identically zero. However, in the second model if x2 and
x3 are nonzero for a certain input, they will have a nonzero residual. In other words, the residual
captures feature interactions in the model. Our first result, which we prove in the appendix, shows
that this intuition can be made precise.
Lemma 1. Let f : X = {X1 , X2 , ..., Xd } ! Y be a multivariate function. Suppose f can
be decomposed as f (x) = g(xS ) + h(xS̄ ), for some functions g : {Xj : j 2 S} ! Y and
h : {Xj : j 62 S} ! Y . Let z = {z1 , z2 , ..., zn } 2 X. Then vf,z Int
is relatively inessential with
respect to the set S.
This is important because if the model really does decompose additively for a certain variable
i, the practitioner understands what to expect when variable i is perturbed. The Interventional
Shapley residuals thus quantify the extent to which the SHAP values must be augmented with more
information to capture interaction effects in the model.

Conditional Expectation SHAP As the residual for Interventional SHAP can be thought of as
detecting feature interactions in a model, the residuals of Conditional  Expectation SHAP can detect
1 ↵
feature interactions in the data. Let X ⇠ N ([0, 0]T , ⌃) for ⌃ = , and let Y = f (X) =
↵ 1
T
X (note that ordinary least squares will recover f in the limit of infinite data). Given input x1 , x2 ,
the SHAP values of f are 1 = 1 x1 + ↵ 2 x1 2 1 x2 2 = 2 x2 + ↵ 1 x2 2 2 x1 . That is to say, they
are linearly dependent on the correlation between the two variables. In particular, consider explaining
the input [1, 1] to the function x1 + 3x2 ; the SHAP values are 1 = 1 + ↵ and 2 = 3 ↵ and the
residuals are both 2↵. Notably, as interaction between variables increases in the data (measured by
↵), the residual increases and the SHAP values deviate further and further from the coefficients of the
actual model. We can make this intuition precise.
Lemma 2. Let f : X = {X1 , X2 , ..., Xd } ! Y be a multivariate function. Suppose f can
be decomposed as f (x) = g(xS ) + h(xS̄ ), for some functions g : {Xj : j 2 S} ! Y and
h : {Xj : j 62 S} ! Y . Let z = {z1 , z2 , ..., zn } 2 X. Suppose further that all Xj : j 2 S are
distributed independently from all Xj : j 62 S. Then vf,z
Cond
is relatively inessential with respect to set
S.
The residual on Conditional Expectation SHAP thus quantifies the extent to which an interpretation
of the SHAP values can be interpreted as interventional, because depending on the causal structure of
the data, correlated features could imply that perturbing a feature i could result in the perturbation of
a different feature as well.

Inspecting Shapley residuals in practice Shapley residuals are vectors in the same space as
gradients, and are generally high-dimensional entities; a full study of their properties remains an
important topic for future work. The characterization in this section shows that the norm of the
residual vectors captures important limitations of Shapley values. Thus, our experiments use the
scaled norm of the residual vectors, defined to be the norm of the residual vector divided by the
norm of the discrete gradient vector. Normalized residuals make them easier to compare across
experiments.

5 Relationship with Other Interaction Indices

[25], similarly recognizing that Shapley values lose information about interactions, proposed Shapley-
Taylor Interaction Indices, a generalization of Shapley values which attributes influence among
interaction terms. Specifically, the Shapley-Taylor
P explanation for x of order k assigns values ISk to
subsets of features S of size |S|  k such that IS = f (x). The terms for the subsets for which
k

|S| < k represent a discrete Taylor series around v(;). When |S| = k, ISk is defined similarly to the
Shapley value: a discrete derivative averaged over permutations.

7
Our residuals capture fundamentally different information about interactions than Shapley-Taylor.
Consider some subset S for which |S| < k. Our residual rS is 0 when the marginal value of adding
S to a coalition W is constant with respect to sets for which W \ S = ;; in other words, it is about
the presence or absence of interactions of S with other variables. The Shapley-Taylor interaction
index ISk , on the other hand, is 0 when v(S) can be inferred from the values of v(W ) for {W ⇢ S}.
The Taylor terms of the Shapley-Taylor explanation thus capture information about how the players
in S interact with each other when no other variables are involved. For instance, if for a certain game
v({i}) + v({j}) = v({i, j}), this means that the term v({i, j}) provides no interaction information
about the two players, and I{i,j}
k
for explanation sizes k > 2 will be 0.
However, the Taylor indices for a coalition S say nothing about whether the variables within S
interact once a player outside of S is involved. Consider a three-player game between a, b, and c,
where v({a}) + v({b}) = v({a, b}) and v({a}) + v({c}) = v({a, c}); this would make I{a,b} k
and
I{a,c} equal to 0, implying that a and b do not interact, and a and c do not interact. But it could be that
k

v is not relatively inessential with respect to a. If v({a, b, c}) v({b, c}) 6= v({a, c}) v({c}), then
a’s relative contribution with respect to c changes once b is involved. This constitutes an interaction
between a and c that is not described by the Shapley-Taylor index for {a, c}, but is rather captured by
the third-order interaction of {a, b, c}. In this scenario, our residuals would show ra 6= 0, alerting us
to the fact that a interacts with {b, c}; additionally, it would have r{a,b} 6= 0, alerting us to the fact
that {a, b} interacts with c.
In general, since ri captures information about all of i’s interactions, we can state the following
connection between the two Shapley extensions:
Lemma 3. Given subset S, |S| < k, if 9i 2 S s.t ri = 0, then the Shapley-Taylor index ISk = 0.

We have focused our attention on Shapley-Taylor interaction indices because of their proposed use for
explanations. It should be noted that they (as well as the Shapley interaction index proposed in [13])
are special cases of a general class of interaction indexes investigated in a long line of work starting
with [17] and surveyed in [6] (including the Grabisch-Roubens[7] Shapley and Banzhaf interaction
indices). All of these differ from Shapley residuals – the latter are meant to represent the information
lost when computing singleton Shapley values, not their higher-dimensional extensions, which are
based on a different notion of a derivative.

6 Experiments
Having theoretically justified Shapley residuals in previous sections, we now focus on illustrating
what these residuals can help us understand about models on a real-world dataset. Throughout, we
use our own implementation of KernelSHAP to calculate the exact Shapley values and residuals (see
Algorithm 1 in Section 3).2 Some additional experiments can be found in the appendix.

On comparisons to other feature importance methods We note that Shapley residuals are not a
feature importance evaluation method, nor are they an “explanation method" in and of themselves.
Rather, they are a quantification of the (valuable) information lost by Shapley values. A direct
comparison of different feature influence evaluation methods makes sense when there is a clear
objective to compare against. Such an objective doesn’t really exist here. Rather, we choose to
provide an internal validation that lays out the mathematical foundation on which the method rests.
This allows a user to decide the context in which to employ one method or another. For example, as
we discussed in Section 5, Shapley-Taylor indices and Shapley residuals appear to capture different
kinds of interactions that are potentially of interest to a user. There is no meaningful way to compare
them in a vacuum because one is not "better" than another.

Variable Interactions in Occupancy Detection Consider the Shapley values and residuals for an
occupancy detection dataset3 (20,560 instances) used to predict whether an office room is occupied.
The 7 attributes include a date stamp for an hour and day of the week. A decision tree model
with maximum depth 3 is trained on 75% of the data using the features light and hour. When
2
Code is provided in the supplementary material
3
https://archive.ics.uci.edu/ml/datasets/Occupancy+Detection+

8
(a) KernelSHAP values (b) KernelSHAP scaled residual (c) KernelSHAP sampling for
norms (10,320)

Figure 3: Shapley values and residuals on a decision tree for the Occupancy Detection task

evaluated on the remaining test set, the ROC-AUC for this decision tree is 0.991. We then calculate
the Shapley values and residuals (using 50 randomly sampled background rows from the test set) for
1000 randomly sampled test instances. The results for the variable “light" are shown in Figure 3.
The reason that the cluster of points in the middle has a high residual is illustrated in Figure 3(c).
Calculating the expected prediction while fixing a light value of 320, unlike most other possible
values, results in a mix of low and high predictions. These average to 0.4, while both the overall
expectation and particular prediction for occupancy probability for those points are 0.25.

+.01
E[f (H, L)] = .24 E[f (10, L)] = .25
+.16 .24
.39
E[f (H, 320)] = .40 f (10, 320) = .01

Figure 4: Geometric representation of the KernelSHAP game for f (10, 320), where arrows to the
right indicate inclusion of the light feature and arrows down indicate inclusion of the hour feature.

Specifically, the KernelSHAP game for f (H, L) = P (occupant = T ) for L = 320 and H = 10 is
shown in Figure 4. L = 320 is a positive indicator of occupancy if H is unknown (+.16) but is a
“negative” indicator of occupancy is H is known to be 10 (-.24), due to the interactions in the model
in this area of the feature space. The light Shapley value is close to 0 for points in this range, then,
because it is the average of a positive and negative number – not because it is of “low importance” –
and the non-inessentiality of this feature is what is being captured by the residual.

7 Limitations and Future Work

Usability considerations Our motivation for this work is to contribute further to the theoretical
foundation of Shapley-value-based feature importance measures and, critically, to introduce Shapley
residuals to quantify missing importance. Our goal is that residuals be a warning attached to specific
Shapley values and thus alert practitioners to model complexities and importances that have previously
gone unattended. Further research is needed to investigate whether these residuals can be effectively
utilized by humans to make better decisions about their models. An empirical, human-centered
investigation is critical because, like Shapley values themselves, the meaning of these residuals may
be hard for practitioners to understand, and therefore errors in the interpretation of these residuals
may cause unanticipated negative consequences.

Performance considerations SHAP implementations provide a partial evaluation of the game

vector [12, 11], which provides analysts with results even in high-dimensional settings. Unfortunately,
there is no assumption-free provable bound on the relationship between partially evaluated game
vectors and actual Shapley values. Our goal here is more precisely characterize the information
not conveyed by Shapley values and so we always compute the full vector. Thus, the runtime is
ultimately exponential in the number of variables to analyze. This currently limits the number of

9
variables for which Shapley residuals can be practically computed to 20 to 30 (with corresponding
vectors of length between a million and a billion elements). Since the derivative operator is sparse and
well-conditioned, the least squares problem is efficiently solved by the LSQR method [18]. Still, in
future work, we hope to efficiently identify whether a particular residual is nonzero, and approximate
properties of residuals which capture the entirety of non-linear interactions of a particular feature.

Conclusion A goal in interpretable machine learning, and within Shapley-value-based feature

importance, is to give a rigorous theoretical foundation to interpretability notions so that practitioners
can better understand the impacts of their models. This is especially important in contexts where
models make high-stakes decisions about people, e.g., via criminal risk assessments and interview
screening algorithms. We believe people have the right to understand those decisions, and particularly
which features were important for the decision. Putting such feature importance measurements on
solid theoretical grounds is important for the validity of these feature importance claims. Their
validity is an important part of the ethics of algorithms as societal interventions.

8 Acknowledgements
This research was funded in part by the NSF under grants DMR 1928882, IIS 1955162, and IIS
1956286, and by the DARPA SD2 program.

References
[1] Philip Adler, Casey Falk, Sorelle A Friedler, Tionney Nix, Gabriel Rybeck, Carlos Scheidegger,
Brandon Smith, and Suresh Venkatasubramanian. Auditing black-box models for indirect
influence. Knowledge and Information Systems, 54(1):95–122, 2018.
[2] Umang Bhatt, Alice Xiang, Shubham Sharma, Adrian Weller, Ankur Taly, Yunhan Jia, Joydeep
Ghosh, Ruchir Puri, José MF Moura, and Peter Eckersley. Explainable machine learning
in deployment. In Proceedings of the 2020 Conference on Fairness, Accountability, and
Transparency, pages 648–657, 2020.
[3] Ozan Candogan, Ishai Menache, Asuman Ozdaglar, and Pablo A Parrilo. Flows and decomposi-
tions of games: Harmonic and potential games. Mathematics of Operations Research, 36(3):
474–503, 2011.
[4] Anupam Datta, Shayak Sen, and Yair Zick. Algorithmic transparency via quantitative input
influence: Theory and experiments with learning systems. In 2016 IEEE symposium on security
and privacy (SP), pages 598–617. IEEE, 2016.
[5] Christopher Frye, Ilya Feige, and Colin Rowat. Asymmetric shapley values: incorporating
causal knowledge into model-agnostic explainability. arXiv preprint arXiv:1910.06358, 2019.
[6] Katsushige Fujimoto, Ivan Kojadinovic, and Jean-Luc Marichal. Axiomatic characterizations
of probabilistic and cardinal-probabilistic interaction indices. Games and Economic Behavior,
55(1):72–99, 2006. ISSN 0899-8256. doi: https://doi.org/10.1016/j.geb.2005.03.002. URL
https://www.sciencedirect.com/science/article/pii/S0899825605000278.
[7] Michel Grabisch and Marc Roubens. An axiomatic approach to the concept of interaction
among players in cooperative games. International Journal of game theory, 28(4):547–565,
1999.
[8] Harmanpreet Kaur, Harsha Nori, Samuel Jenkins, Rich Caruana, Hanna Wallach, and Jennifer
Wortman Vaughan. Interpreting interpretability: Understanding data scientists’ use of inter-
pretability tools for machine learning. In Proceedings of the 2020 CHI Conference on Human
Factors in Computing Systems, CHI ’20, pages 1–14, New York, NY, USA, 2020. Association
for Computing Machinery. ISBN 9781450367080. doi: 10.1145/3313831.3376219. URL
https://doi.org/10.1145/3313831.3376219.
[9] Norman L Kleinberg and Jeffrey H Weiss. Weak values, the core, and new axioms for the
shapley value. Mathematical Social Sciences, 12(1):21–30, 1986.
[10] I Elizabeth Kumar, Suresh Venkatasubramanian, Carlos Scheidegger, and Sorelle Friedler.
Problems with shapley-value-based explanations as feature importance measures. arXiv preprint
arXiv:2002.11097, 2020. 37th International Conference on Machine Learning, to appear.

10
[11] Scott M Lundberg. shap: A game theoretic approach to explain the output of any
machine learning model, 2021. URL https://github.com/slundberg/shap/blob/
9ae91cce0e010189625c38a267b9520bd89f4b04/shap/explainers/_kernel.py#
L281-L374.
[12] Scott M Lundberg and Su-In Lee. A unified approach to interpreting model predictions. In
Advances in neural information processing systems, pages 4765–4774, 2017.
[13] Scott M Lundberg, Gabriel G Erion, and Su-In Lee. Consistent individualized feature attribution
for tree ensembles. arXiv preprint arXiv:1802.03888, 2018.
[14] Scott M Lundberg, Bala Nair, Monica S Vavilala, Mayumi Horibe, Michael J Eisses, Trevor
Adams, David E Liston, Daniel King-Wai Low, Shu-Fang Newman, Jerry Kim, et al. Explain-
able machine-learning predictions for the prevention of hypoxaemia during surgery. Nature
biomedical engineering, 2(10):749–760, 2018.
[15] Charles Marx, Richard Phillips, Sorelle Friedler, Carlos Scheidegger, and Suresh Venkatasubra-
manian. Disentangling influence: Using disentangled representations to audit model predictions.
In Advances in Neural Information Processing Systems, pages 4498–4508, 2019.
[16] Luke Merrick and Ankur Taly. The explanation game: Explaining machine learning models
with cooperative game theory, 2019.
[17] Guillermo Owen. Multilinear extensions of games. Management Science, 18(5):P64–P79, 1972.
ISSN 00251909, 15265501. URL http://www.jstor.org/stable/2661445.
[18] Christopher C Paige and Michael A Saunders. Lsqr: An algorithm for sparse linear equations
and sparse least squares. ACM Transactions on Mathematical Software (TOMS), 8(1):43–71,
1982.
[19] Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. “why should i trust you?” explaining
the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international
conference on knowledge discovery and data mining, pages 1135–1144, 2016.
[20] Lloyd S Shapley. A value for n-person games. Technical report, Rand Corp Santa Monica CA,
1952.
[21] Dylan Slack, Sophie Hilgard, Emily Jia, Sameer Singh, and Himabindu Lakkaraju. Fooling
LIME and SHAP: Adversarial attacks on post hoc explanation methods. In Proceedings of the
AAAI/ACM Conference on AI, Ethics, and Society, pages 180–186, 2020.
[22] Ari Stern and Alexander Tettenhorst. Hodge decomposition and the shapley value of a coopera-
tive game. Games and Economic Behavior, 113:186–198, 2019.
[23] Erik Štrumbelj and Igor Kononenko. Explaining prediction models and individual predictions
with feature contributions. Knowledge and information systems, 41(3):647–665, 2014.
[24] Mukund Sundararajan and Amir Najmi. The many shapley values for model explanation. arXiv
preprint arXiv:1908.08474, 2019. 37th International Conference on Machine Learning, to
appear.
[25] Mukund Sundararajan, Kedar Dhamdhere, and Ashish Agarwal. The shapley taylor interaction
index. In Hal Daume III and Aarti Singh, editors, Proceedings of the 37th International
Conference on Machine Learning, volume 119 of Proceedings of Machine Learning Research,
pages 9259–9268. PMLR, 13–18 Jul 2020. URL http://proceedings.mlr.press/v119/
sundararajan20a.html.

Shap
100% (1)
Shap
214 pages
Static-General-Knowledge-2024-sscstudy.com_
No ratings yet
Static-General-Knowledge-2024-sscstudy.com_
164 pages
CS 236, Fall 2018 Midterm Exam: Stanford University Honor Code
100% (1)
CS 236, Fall 2018 Midterm Exam: Stanford University Honor Code
6 pages
Sky - The - Pony - 24 Dukke Dyr Hest Sød
100% (8)
Sky - The - Pony - 24 Dukke Dyr Hest Sød
17 pages
B.inggris - pptx1
No ratings yet
B.inggris - pptx1
35 pages
CLW3060 Product Data
No ratings yet
CLW3060 Product Data
1 page
Recipes From Asian Tofu by Andrea Nguyen
67% (6)
Recipes From Asian Tofu by Andrea Nguyen
17 pages
SHAP
No ratings yet
SHAP
44 pages
The Explanation Game: Explaining Machine Learning Models Using Shapley Values
No ratings yet
The Explanation Game: Explaining Machine Learning Models Using Shapley Values
20 pages
Problems With Shapley-Value-Based Explanations As Feature Importance Measures
No ratings yet
Problems With Shapley-Value-Based Explanations As Feature Importance Measures
10 pages
SHAP1
No ratings yet
SHAP1
68 pages
El-Yacoubi - Intrepretability of Machine Learning Models - Part 2
No ratings yet
El-Yacoubi - Intrepretability of Machine Learning Models - Part 2
60 pages
The Many Shapley Values For Model Explanation
No ratings yet
The Many Shapley Values For Model Explanation
11 pages
Algorithms to estimate Shapley value feature attributions
No ratings yet
Algorithms to estimate Shapley value feature attributions
33 pages
SHAP-Based Explanation Methods: A Review For NLP Interpretability
No ratings yet
SHAP-Based Explanation Methods: A Review For NLP Interpretability
11 pages
9.5 Shapley Values _ Interpretable Machine Learning
No ratings yet
9.5 Shapley Values _ Interpretable Machine Learning
14 pages
Post Hoc Explanations Feature Attributions 2 of 4
No ratings yet
Post Hoc Explanations Feature Attributions 2 of 4
21 pages
SHAP-IQ Unified Approximation of Any-Order Shapley
No ratings yet
SHAP-IQ Unified Approximation of Any-Order Shapley
27 pages
An Introduction to Explainable AI With Shapley Values — SHAP Latest Documentation
No ratings yet
An Introduction to Explainable AI With Shapley Values — SHAP Latest Documentation
20 pages
9.5 Shapley Values: 9.5.1 General Idea
No ratings yet
9.5 Shapley Values: 9.5.1 General Idea
14 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Econ ML2
No ratings yet
Econ ML2
6 pages
Seminar1 Research Progress
No ratings yet
Seminar1 Research Progress
20 pages
A Distributional Framework For Data Valuation
No ratings yet
A Distributional Framework For Data Valuation
10 pages
Econ ML3
No ratings yet
Econ ML3
5 pages
2208.08798v1
No ratings yet
2208.08798v1
26 pages
How Shapley Values Work - A Simple Guide
No ratings yet
How Shapley Values Work - A Simple Guide
11 pages
20250206_Statistical Significance of Feature Importance Rankings
No ratings yet
20250206_Statistical Significance of Feature Importance Rankings
20 pages
2024-04-16 Jagoda Bobińska Pasquale Gravante SHAP
No ratings yet
2024-04-16 Jagoda Bobińska Pasquale Gravante SHAP
11 pages
T9
No ratings yet
T9
45 pages
To Louse 23 Hand Out
No ratings yet
To Louse 23 Hand Out
31 pages
Functions and Probability for Sixth Graders
From Everand
Functions and Probability for Sixth Graders
Home School Brew
No ratings yet
Cooperation_Preference_Aware_Shapley_Value_Modeling_Algorithms_and_Applications
No ratings yet
Cooperation_Preference_Aware_Shapley_Value_Modeling_Algorithms_and_Applications
15 pages
Towards_explainable_artificial_intelligence_with_p
No ratings yet
Towards_explainable_artificial_intelligence_with_p
15 pages
T9 IML
No ratings yet
T9 IML
44 pages
Solution concepts for TU-games III The Shapley value: σ,σ (i) σ,σ (i) −1 σ,σ (i) σ,σ (i) −1
No ratings yet
Solution concepts for TU-games III The Shapley value: σ,σ (i) σ,σ (i) −1 σ,σ (i) σ,σ (i) −1
7 pages
ELCE7025: Modeling and Theoretical Analysis For Communication System
No ratings yet
ELCE7025: Modeling and Theoretical Analysis For Communication System
19 pages
A Unified Approach To Interpreting Model Predictions
No ratings yet
A Unified Approach To Interpreting Model Predictions
9 pages
Exercises Week 8 (Text Only)
No ratings yet
Exercises Week 8 (Text Only)
3 pages
1904.02868v2
No ratings yet
1904.02868v2
23 pages
The Shapley Value for ML Models. What is a Shapley Value, And Why is It… _ by Divya Gopinath _ Towards Data Science
No ratings yet
The Shapley Value for ML Models. What is a Shapley Value, And Why is It… _ by Divya Gopinath _ Towards Data Science
16 pages
Shapley-Based Explainable AI For Clustering
No ratings yet
Shapley-Based Explainable AI For Clustering
23 pages
( ) Opening Up The Neural Network Classifier For Shap Score Computation
No ratings yet
( ) Opening Up The Neural Network Classifier For Shap Score Computation
11 pages
DM Shapley
No ratings yet
DM Shapley
29 pages
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Game Theory (III)
No ratings yet
Game Theory (III)
69 pages
Generat Bilbao
No ratings yet
Generat Bilbao
18 pages
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
23-0893
No ratings yet
23-0893
64 pages
Explanation+Lec (1)
No ratings yet
Explanation+Lec (1)
42 pages
Fletcher-Hill_ Computing Shapley Values in the English Premier League 2
No ratings yet
Fletcher-Hill_ Computing Shapley Values in the English Premier League 2
10 pages
Computing The Shapley Value of Facts in Query Answering: Daniel Deutch Nave Frost Benny Kimelfeld Mikaël Monet
No ratings yet
Computing The Shapley Value of Facts in Query Answering: Daniel Deutch Nave Frost Benny Kimelfeld Mikaël Monet
14 pages
Harbrecht Randrhs
No ratings yet
Harbrecht Randrhs
23 pages
SHAP For Actuaries: Explain Any Model
No ratings yet
SHAP For Actuaries: Explain Any Model
25 pages
Vapnik - Complete Statistical Theory of Learning Learning U
No ratings yet
Vapnik - Complete Statistical Theory of Learning Learning U
59 pages
Deep Learning Assignment3 Solution
No ratings yet
Deep Learning Assignment3 Solution
9 pages
Paper 7-Analysis of Gumbel Model For Software Reliability Using Bayesian Paradigm
No ratings yet
Paper 7-Analysis of Gumbel Model For Software Reliability Using Bayesian Paradigm
7 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
2322 An Empirical and Comparative A
No ratings yet
2322 An Empirical and Comparative A
17 pages
Training Data Influence Analysis and Estimate: A Survey Machine Learning 194918
No ratings yet
Training Data Influence Analysis and Estimate: A Survey Machine Learning 194918
2 pages
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Winter 2002
No ratings yet
Winter 2002
30 pages
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
Recursive Analysis
From Everand
Recursive Analysis
R. L. Goodstein
No ratings yet
The Summation of Series
From Everand
The Summation of Series
Harold T. Davis
4/5 (1)
Lectures 1-4 PDF
No ratings yet
Lectures 1-4 PDF
14 pages
Dany Gilles - StriCube
No ratings yet
Dany Gilles - StriCube
3 pages
Rites of Sense by Meena Alexander Intro To The Poet
No ratings yet
Rites of Sense by Meena Alexander Intro To The Poet
5 pages
AAI PAPER
No ratings yet
AAI PAPER
43 pages
What Is Stress Concentration - Definition, Causes, Effects and Prevention
No ratings yet
What Is Stress Concentration - Definition, Causes, Effects and Prevention
2 pages
6G Technology
100% (3)
6G Technology
15 pages
BV100 Operations Manual
No ratings yet
BV100 Operations Manual
46 pages
Introduction To Heat Transfer
No ratings yet
Introduction To Heat Transfer
15 pages
2073 2005 For Manufacturers FINAL
No ratings yet
2073 2005 For Manufacturers FINAL
6 pages
WELS Flyer
No ratings yet
WELS Flyer
2 pages
Mammalia SBMZ v2023-1 Dez
No ratings yet
Mammalia SBMZ v2023-1 Dez
267 pages
Alcon Capaciter
No ratings yet
Alcon Capaciter
8 pages
Agilent 8590B Datasheet
No ratings yet
Agilent 8590B Datasheet
3 pages
TS 2013 Catalog Email
No ratings yet
TS 2013 Catalog Email
28 pages
Retro Venieri 1033C Ingles 2022
No ratings yet
Retro Venieri 1033C Ingles 2022
2 pages
Au t2 S 1596 Southern Lights Powerpoint English Ver 1
No ratings yet
Au t2 S 1596 Southern Lights Powerpoint English Ver 1
7 pages
Aeolian Shawl
100% (2)
Aeolian Shawl
16 pages
CE - MECH - 2 DYNAMIC OF RIGID BODIES (2nd Semester S.Y. 2020-2021)
No ratings yet
CE - MECH - 2 DYNAMIC OF RIGID BODIES (2nd Semester S.Y. 2020-2021)
5 pages
Competitors Contactors AB ABB GE Schneider
No ratings yet
Competitors Contactors AB ABB GE Schneider
18 pages
Tutorial 1: EEN 206: Power Transmission and Distribution
100% (1)
Tutorial 1: EEN 206: Power Transmission and Distribution
28 pages
applications-of-electrolysis-easy
No ratings yet
applications-of-electrolysis-easy
16 pages
24-27 - Ac Generation Monitoring and Indicating PDF
No ratings yet
24-27 - Ac Generation Monitoring and Indicating PDF
19 pages
Development and Validation of GC Method For The Estimation of Eugenol in Clove Extract
No ratings yet
Development and Validation of GC Method For The Estimation of Eugenol in Clove Extract
5 pages
Uso de Puertos Serie Virtuales en Una Máquina Virtual
No ratings yet
Uso de Puertos Serie Virtuales en Una Máquina Virtual
4 pages
Lecture No.8 - Bridge Foundations
100% (1)
Lecture No.8 - Bridge Foundations
40 pages

Junk 3

Uploaded by

Junk 3

Uploaded by

Shapley Residuals: Quantifying the limits of the

Shapley value for explanations

I. Elizabeth Kumar Carlos Scheidegger

Suresh Venkatasubramanian Sorelle A. Friedler

Popular feature importance techniques compute additive approximations to nonlin-

35th Conference on Neural Information Processing Systems (NeurIPS 2021).

Linearity: If v, v are two games on d players, then

2.1 Geometric characterization of Shapley values

We can now prove a result similar to Proposition 1 for relative inessentiality.

Figure 2: Visualizing the decomposition of a game and its residuals.

4 Feature Importance, Inessentiality and Residuals

5 Relationship with Other Interaction Indices

7 Limitations and Future Work

Performance considerations SHAP implementations provide a partial evaluation of the game

Conclusion A goal in interpretable machine learning, and within Shapley-value-based feature

You might also like