Measuring Development - The UNDP's Human Development Index (Journal of International Development, Vol. 5, Issue 2) (1993)
Measuring Development - The UNDP's Human Development Index (Journal of International Development, Vol. 5, Issue 2) (1993)
2, 183-192 (1993)
Abstract: This paper critically examines the UNDP’s Human Development Index
(HDI). We conclude that its treatment of income is inappropriate; the lack of year-to-
year comparability is undesirable; it is robust with respect to measurement error; and
that its contribution to the assessment of development levels differs markedly among
country groups.
1 INTRODUCTION
The UNDP’s composite Human Development Index (HDI), based on life expect-
ancy, educational attainment and material living standards, is an attempt to reorient
the assessment of development levels away from income to more broadly based
measures. Whilst we are sympathetic to this attempt, we believe there are a number
of shortcomings in the construction and presentation of the HDI (for critiques of a
more ideological nature see Desai (1991), Rao (1991) and Hopkins (1991)).
Section 2 presents the index and details of its construction, leading in Section 3 to
an analysis of the income and HDI disparities between countries. In Section 4 we
examine the UNDP’s claim that the HDI ranks countries very differently to per-
capita income and the possibility that such differences as there are, are simply the
result of measurement error. Section 5 examines a number of possible refinements
to the index and concluding remarks are made in Section 6 .
09.54-1748/93/020183-10$10.00
1993 by John Wiley & Sons, Ltd.
184 M . McGillivray and H . White
where Ijj is the ith indicator of human deprivation in country j (i = 1, 2 and 3 and j
- 1, . . ., 1). In the 1990 Report deprivation was calculated from life expectancy
-
( X l j )in 1987, adult literacy (X2,) in 1985 and the logarithm of purchasing power
adjusted G D P per capita (X3,) in 1987. Each variable is scaled from zero to one by
the formula:
where LIT is the adult literacy rate in 1985 and YRS mean years of schooling
received per person aged 25 years and over in 1980. The second refinement was in
the treatment of income. The observed range of GDPs per capita were divided in
multiples of the poverty line y * :
where Yj is country j’s real G D P per capita and y* is a poverty line real G D P per
capita of $US4829. The 1990 HDI formulation by comparison was:
’Our comments are, for the moment, confined to the ‘basic’ HDI only, and therefore ignore innovations
such as the gender-sensitive, distribution-adjusted and intertemporal HDIs (UNDP, 1991, Technical
Notes 3 to 5 ) .
The UNDP’s Human Development Index 185
Unlike the 1990 formulation, the 1991 formulation therefore allows Xjli to rise
marginally after exceeding the poverty line.
The 1990 Report makes two claims in particular for the HDI: (1) that the disparity
between countries is less for the HDI than it is for income per capita; and (2) that it
ranks countries very differently to income measures. In this section we consider the
first of these and the treatment of income in constructing the HDI.
The 1990 Report contains a graph headed ‘Ranking of countries’ GNP per capita
and HDI’. This graph is reproduced as Figure 1.2It is simply a graph of HDI and
GDP per capita scores (indexed so that the highest value equals 100 in our
example) each sorted separately in ascending order. There is no information on
how country rankings compare by the two criteria. The graph is accompanied in the
Report by the following statement:
The chart shows two separate distributions of countries. The upper curve
shows their ranking according to the human development index while the
lower curve shows their ranking according to GNP per capita. The two curves
reveal that the disparity among countries is much greater in income than in
human development. There is no automatic link between the level of per
capita income in a country and the level of its development (UNDP, 1990:
15).
This last statement cannot be verified from the graph, since the two indices are
ranked separately. But we can see that there are many countries with relatively
low per-capita income levels compared to a few (the top 20 per cent or so) with
much higher income-by contrast the HDI rises steadily over the range of coun-
tries. Thus it is correct that disparities in the HDI are less than those in GDP per
capita.
What is the source of this disparity? There are two potential sources. First, the
inclusion of life expectation and literacy. Second, the treatment of income in con-
structing the HDI. Physiological limits on life expectancy and a 100 per cent ceiling
on literacy restrict the disparity that may be displayed by these variables, so that
their inclusion in the HDI will clearly reduce disparity (though whether or not they
affect ranking will depend on their correlation with per-capita income).
There are two adjustments made to income in the 1990 Report. The first is to
take logs. A log transformation of income is appealing to economists, since it
captures diminishing marginal utility of income-that is, successive increments in
income give successively smaller increments in utility. Such a transformation ‘closes
the gap’ at the top end of the income scale, so that the disparity between countries
in log income per capita is much less than that in income per capita-and indeed
* W e have used the purchasing power parity (PPP) GDP per capita for this figure. The HDR uses GNP
per capita-which increases the disparity. We prefer the former measure as both the use of domestic,
rather than national, product and applying PPP exchange rates give a more accurate income-based
measure of development.
186 M . McCillivray and H . White
COUNTRY
less than that in the HDI.3 So it is not only the inclusion of health and education
indicators that explains the more equitable spread of the HDI compared to per-
capita income, it is also the use of the log transformation on income itself.
Yet the 1990 Report does not stop with the use of logs. It goes on to impose a
ceiling above which increments in income are assumed to have no value to develop-
ment whatsoever. U p to the cut-off of $4861 the log value of GDP per capita is used
in calculating the index. At any higher income level the log of $4861 is used. This
appears a difficult position to support-would the authors of the Report accept a
marginal rate of taxation of 100 per cent above an income level of $4861 since this
apparently will not damage their welfare?
The 1991 Report conceded that the zero weight on income above the poverty
threshold was ‘too drastic’, and proposed a new treatment in which income below
the threshold was taken at its actual value and above that successive roots (square
root, etc., as shown in equation 4) were taken for each multiple of the poverty
level. In practice this new treatment makes little difference: the cut-off is practically
the horizontal one of the previous year. The first $4829 is valued at its actual value
and the second $4829 at only $139-the 100 per cent tax rate has been replaced
with one of 97 per cent!
The 1991 HDR defends its methodology in Technical Notes 1: reporting the rank
correlation of the HDI with the HDI constructed with different treatments of
income. These correlations are all very high-the lowest being 0.994 if unlogged
income per capita is used with no cut-off. Such high correlations are to be expected:
if the different components of the index are highly correlated then excluding one of
’The Gini coefficient calculated using the log of PPP GDP per capita is only 0.07. compared 0.34 for the
HDI. That using (unlogged) PPP GDP per capita is 0.53 (still rather less than those usually reported that
do not use PPPs, which are typically 0.65-0.70).
The UNDP’s Human Development Index 187
them (even if only over a range of countries, as the capping procedure does) will
not greatly alter the ran king^.^ Two conclusions may therefore be drawn from the
UNDP’s own analysis of the different treatment of income (but which are not
drawn by UNDP themselves): (1) they add support to the position that income is a
good proxy for the other variables; and (2) if a simple treatment yields the same
result as a more complex one then why not adopt the simple treatment?
We conclude that the capping of income should be abandoned-the use of log
GDP per capita is a theoretically appealing and practically transparent alternative.
The UNDP has claimed that the ‘human development index ranks countries very
differently from the way GNP per capita ranks them’ (UNDP, 1990: 14). A number
of previous studies have assessed predecessors of the HDI on this criterion. For
example, Larson and Wilford (1979), using simple correlation coefficients, argued
that the ‘Physical Quality of Life Index’ (a composite of life expectancy, adult literacy
and infant mortality) was redundant in that it ranked countries in a ‘statistically
indistinguishable fashion’ to GNP per capita (Larson and Wilford, 1979: 583).
Similar exercises have been conducted by Hicks and Streeten (1979), who con-
sidered a ‘Development Index’ (a composite of 18 social and economic indicator^)^,
and McGillivray (1991), who looked at the 1990 HDI.
A problem with these studies is ambiguity over the extent of statistical associa-
tion, as measured by a correlation coefficient, that actually determines one variable
redundant with respect to another.6 An arbitrary threshold must be specified which
differentiates redundancy from non-redundancy. In this paper we opt for two
threshold values: 0.90 and 0.70. If the rank correlation coefficient between the HDI
and an existing indicator (in this case GNP per capita) is not significantly less than
0.90 we call this ‘Level 1’ redundancy. If it is not significantly less than 0.70 we call
this ‘Level 2’ redundancy. Clearly, the second of these tests is the more difficult for
the HDI to pass in terms of not being redundant.
Why have we chosen these levels? We believe that 0.90 is sufficiently high to say
that if two variables have a correlation of this order of magnitude then it is difficult
to claim that one is imparting additional information to that given by the first. The
lower threshold of 0.70 is used by existing studies, which imply that a new index is
redundant if the majority of its variation is accounted for by an existing indicator.
This can be formally interpreted as a test that the coefficient of variation be not less
than 0.50, which we convert to a correlation coefficient of 0.70.
Such a high coefficient is also not surprising, since there are over 100 countries below the threshold,
and their ranks change very little (only the denominator of their income term changes). On the other
hand, for those countries whose income is more than three times the threshold the Spearman’s is only
0.74. This rises to 0.95 for those countries whose per-capita GDP is two to three times the threshold.
’For further details of this index. see McGranahan et al. (1972) and for the PQLI see Morris (1979).
Larson and Wilford, using a sample of 150 countries, concluded that the PQLI was redundant vis-u-vis
GNP per capita on the basis of a rank correlation coefficient of 0.776. McGillivray drew the same
conclusion concerning the 1990 HDI and GNP per capita for a sample of 119 countries on the basis of a
rank correlation coefficient of 0.889.
188 M . McGillivray and H . White
A related question concerns the redundancy of the HDI vis-h-vis its own
components. It would be overly demanding to correlate the HDI with each of its
components, since this correlates the HDI partially with itself. Therefore, we
calculated a 'restricted HDI' based on two of its three components only, and
correlated these values with the excluded component. This not only tests the
redundancy of the index vis-a-vis the excluded component, both with and with-
out this component, but also the composition of the HDI. If the component is
correlated at the chosen threshold(s), then its inclusion provides little additional
insight.
Correlation coefficients for the total sample of 160 countries for 1991 and for
subsamples of low, medium and high human development (LHD, MHD and HHD
respectively) are given in Table 1.'
Table 1. Rank correlation coefficients between HDI. its components and GNP per capita
(1991).
'Results were also calculated for 1990 and using industrial and developing country subsamples. These
results, which support the conclusions here. may be found in McGillivray and White (1992).
The U N D P s Human Development Index 189
considering subgroupings rather than all countries. This is unsurprising in that small
variations in rankings have a greater effect in smaller samples. But is it possible that
this result arises from variations in rank that may be explained solely by measure-
ment error?
To investigate this question we constructed ‘true’ values of the HDI by assuming
that the reported value of each of the different indicators differs from the true value
by some random error.8 We then compared the true values with the observed
values for the HDI and each of its components, to determine how much difference
in ranking could arise simply from measurement error.
The results shown in Table 2 assume a 10 per cent error on all variables for all
groups. The changes in rank are not very large-the largest is 18 places. The
overwhelming number of cases move less than five places-resulting in rank
Ranking changes
Maximum 18 14 6 22
Minimum 18 12 -5 -24
Distribution
-16 and below 2 0 0 4
-15 to -6 25 9 0 18
-5 to 5 108 138 160 110
6 to 15 24 13 0 24
16 and above 1 0 0 24
Standard deviation 5.83 3.94 1.31 6.57
Rank correlation
between ‘true’ and
observed values 0.992 9.996 1 .ooo 0.990
~~ ~~~~
LE is life expectancy, EA is educational attainment, and GDP PPP is GDP per capita.
correlation coefficients between true and observed levels of over 0.99 for every
single indicator and the HDI. The HDI thus appears to be robust with respect to
measurement error, a finding that was reinforced by repeating the analysis with
errors of up to k 15 per cent and by calculating the coefficients at the subsample
level (see McGillivray and White, 1992). The greater differences in ranking
observed within subsamples do not therefore simply reflect the effects of measure-
ment error.
The HDI is least redundant, and therefore makes its greatest contribution to assess-
ment of inter-country development levels, when used to compare broadly similar
groups of countries. By contrast, if used to compare all countries it adds little new
information to that provided by per capita income or any of the index’s components
‘ A more detailed account of the procedure may be found in McGillivray and White (1992).
190 M . McGillivray and H . White
(a concept of relative poverty) does not, in our view, offset the advantages to be
gained from having a comparable time-series. We would recommend that maxima
and minima be set for each variable that may be used across a range of reports.’
6 CONCLUSION
We have reviewed the usefulness of the UNDP’s HDI from a number of perspect-
ives. In terms of assessing development levels, we find the HDI to be largely
redundant vis-h-vis GNP per capita at varying degrees for various country samples.
It tends to make its largest contribution when looking at country subsamples
classified according to observed human development levels. The HDI itself and
these conclusions are shown to be robust in the face of possible measurement error.
Attempts to differentiate across the whole range have proved unsuccessful, and we
suspect this will remain the case. Thus, we would suggest the index be given a more
limited application than some of the grandiose claims in the Human Development
Reports. We also make a number of suggestions on improving the index, regarding
weights, composition and comparability.
ACKNOWLEDGEMENTS
An earlier version of this paper was presented at the ESRC Development Eco-
nomics Study Group Annual Conference, Leicester, March 1992. The authors
would like to thank the conference participants, especially Graham Pyatt, for
helpful comments and suggestions. Thanks are also due to participants at a Queen
Elizabeth House, Oxford, seminar, especially Sudhir Anand, Robert Cassen and
Frances Stewart.
REFERENCES
For adult literacy the maximum of 100 per cent presents itself. For the other variables maxima should
be set above the conceivably attainable values over the next, say, 10 years (e.g. 85 for life expectancy).
This will mean that n o country may score the maximum of unity, but this secms no particular loss. The
setting of minima is also problematic-if the lowest current value is used a country may fall below that in
a subsequent period. One alternative is to set minima of zero, the other is to pick values below which it is
believed no country will fall.
192 M . McGillivray and H . White