Lecture 7
Lecture 7
Origin-Destination flows
James P. LeSage1
University of Toledo
Department of Economics
Toledo, OH 43606
[email protected]
and
R. Kelley Pace
LREC Endowed Chair of Real Estate
Department of Finance
E.J. Ourso College of Business Administration
Louisiana State University
Baton Rouge, LA 70803-6308
OFF: (225)-388-6256, FAX: (225)-334-1227
[email protected], www.spatial-statistics.com
1
The authors would like to acknowledge Randall W. Jackson and Wolfgang
Polasek for helpful discussions on this topic.
Abstract
1
The notion that use of distance functions in conventional spatial interac-
tion models effectively capture spatial dependence in the interregional flows
being analyzed has been challenged in recent work by Porojon (2001) for the
case of international trade flows, Lee and Pace (2004) for retail sales and
unpublished work that utilizes both German and Canadian transportation
network flows. The residuals from conventional models were found to ex-
hibit spatial dependence, which could be exploited to improve the precision
of inference as well as prediction accuracy.
A family of successive spatial filtering models is introduced here that
represent an extension of the spatial regression models introduced in Anselin
(1988). Spatial regression models have served as the workhorse in applied
spatial econometric analysis, and the models introduced here should play
an important role in modeling interregional flow matrices. Another focus
of this study is maximum likelihood and Bayesian estimation of the models
introduced here. We demonstrate how simple extensions of widely available
software algorithms for implementing conventional spatial regression models
can be employed to estimate the models set forth here.
2
l(o) o(o) d(o)
1 1 1
.. ..
. 1 .
n 1 n
.. .. ..
. . . (1)
n2 n + 1 n 1
.. .. ..
. . .
n 2 n n
3
matrix into a variable vector.2 This results in a regression model of the type
shown in (3)
y = + Xd d + Xo o + D + (3)
In (3), the explanatory variable matrices Xd , Xo represent n2 by k ma-
trices containing destination and origin characteristics respectively and the
associated k by 1 parameter vectors are d and o . The vector D denotes
the vectorized origin-destination distance matrix, and , are scalar pa-
rameters. For now we assume N (0, 2 In2 ), but generalizations will be
taken up later.
4
represents a region. We can create a typical n by n first-order contiguity or
m nearest neighbors weight matrix W that reflects relations between the n
destinations/origin regions. This can be repeated using In W to create an
n2 by n2 row-standardized spatial weight matrix that we label Wo , shown
in (4), where 0 represents an n by n matrix of zeros.
W 0 ... 0
..
0 W 0 .
Wo = .. .. (4)
. 0 . 0
0 ... 0 W
Using this matrix to form a spatial lag of the dependent variable, Wo y,
(where Wo = In W with W row-standardized), we capture origin-based
spatial dependence relations using an average of flows from neighbors to each
origin region to each of the destinations. Intuitively, it seems plausible that
forces leading to flows from any origin to a particular destination region may
create similar flows from neighbors to this origin to the same destination.
This is what the spatial lag Wo y captures.
As an example, consider a single row i of the spatial lag vector Wo y
that represents flows from the origin state/region of Florida to the desti-
nation state/region of Washington. First-order contiguous neighbors to the
origin Florida are Alabama and Georgia, and neighbors to the destination
Washington are Oregon and Idaho. The spatial lag Wo y would represent an
average of the flows from Alabama and Georgia (neighbors to the origin) to
the destination state Washington.
A similar interpretation applies to other rows of the spatial lag Wo y.
For example when examining flows from the origin state of Alabama to
the destination state of Washington, the spatial lag Wo y would represent
an average of the flows from Florida, Georgia, Mississippi and Tennessee
(neighbors to the origin) to the destination state Washington.
A second type of spatial dependence that could arise in the gravity model
would be destination-based dependence. Intuitively, it seems plausible
that forces leading to flows from an origin state to a destination state may
create similar flows to nearby or neighboring destinations.
A spatial weight matrix that we label Wd can be constructed to capture
this type of dependence using W In , producing an n2 by n2 spatial weight
matrix that captures connectivity relations between the flows from an origin
state to neighbors of the destination state.
To provide an example of this we consider four regions located in a row
as presented in Table 1.
5
Table 1: Location of 4 Regions in Space
6
of the destination state. One motivation for this matrix product might be
a spatial filtering perspective. We might envision a spatial autoregressive
model of the type shown in (7) based on successive filtering. We transform or
filter the dependent variable successively by (In2 1 Wo ), and (In2 2 Wd ).
The motivation is that we are removing destination dependence first and
subsequently origin dependence, or vice-versa.
y = 1 Wo y + 2 Wd y 1 2 Ww y + + Xd d + Xo o + D + (8)
Using our example of flows from the origin state of Florida to the desti-
nation state of Washington, the spatial lag vector Ww y represent an average
of: flows from Alabama and Georgia (neighbors to the origin state) to Idaho
(a neighbor to the destination state), and flows from Alabama and Geor-
gia (neighbors to the origin state) to Oregon (a neighbor to the destination
state). In the case of our other example based on flows from the origin state
of Alabama to the destination state of Washington, the spatial lag vector
Ww y represent an average of: flows from Florida, Georgia, Mississippi and
Tennessee (neighbors to the origin state) to Idaho (a neighbor to the des-
tination state) and flows from Florida, Georgia, Mississippi and Tennessee
(neighbors to the origin state) to Oregon (a neighbor to the destination
state).
Note, the implementation of this does not require the actual formation of
the n2 by n2 matrices Wo , Wd , or Ww . Given arbitrary, conformable matrices
A, B, C, (C 0 A)vec(B) = vec(ABC) (Horn and Johnson, 1991, p. 255,
Lemma 4.3.1). Since Wo y = (I W )~(Y ), then Wo y = vec(W Y ). Similarly,
Wd = vec(Y W 0 ), and Ww = vec(W Y W 0 ). These expressions also aid in the
interpretation of origin-destination dependence. The algebra of Kronecker
products can be used to form moment matrices without dealing directly
with n2 by n2 matrices. For example, Xd0 i Xoj equals ( Xi )0 (Xj ) =
P P
Xi Xj . Also, the moment sub-matrices involving only origin variables
or destination variables are very simple (Xd0 i Xdj = Xo0 i Xoj = nXi0 Xj ).
In concluding we note that spatial lags involving the disturbance process
could also be constructed using weight matrices Wo , Wd and the product Ww .
This would allow for a model where spatial dependence arises in the error
7
terms of the model. In the successive filtering case this would take the form
in (9).
y = + Xd d + Xo o + D + u (9)
u = (In2 1 Wo )(In2 2 Wd )u +
y = 1 Wo y + 2 Wd y + 3 Ww y + + Xd d + Xo o + D + (10)
8
6. The restriction: 1 = 2 , 3 = 21 = 22 , produces another single
weight matrix model based on Wo + Wd + Ww . This reflects a lack
of separability between the impacts of origin, destination and origin-
destination interaction effects in favor of a cumulative impact.
7. The restriction: 3 = 0, leads to a model with separable origin and
destination autoregressive dependence embodied in the two weight ma-
trices Wo and Wd , while ruling out dependence between neighbors of
the origin and destination locations that would be captured by Ww .
8. The restriction: 3 = 1 2 results in a successive filtering or model
involving both origin Wo , and destination Wd dependence as well as
product separable interaction Ww , constrained to reflect the filter:
(In2 1 Wo )(In2 2 Wd ) = (In2 + 1 Wo + 2 Wd 1 2 Ww ).
9. No restrictions produces the ninth member of the family of models
based on an unrestricted variant of the filter: (In2 1 Wo )(In2
2 Wd ) = (In2 + 1 Wo + 2 Wd + 3 Ww )
Each of the single spatial weight matrix model specifications in 1) to 6)
would obey the usual properties of row-normalized weight matrices, allowing
use of existing algorithms for maximum likelihood (Pace and Barry, 1997),
Bayesian (LeSage, 1997) or generalized method of moments estimation esti-
mation (Kelejian and Prucha, 1999).
We note that specifications 1) to 6) based on single weight matrices are
also amenable to variants of spatial regression models of the type shown in
(11) to (13), which we label SAR, SEM and SAC models, respectively. In
these equations, we use Wj to denote the single spatial weight matrix.3
y = Wj y + + Xd d + Xo o + D + (11)
y = + Xd d + Xo o + D + u (12)
u = Wj u +
y = Wj y + + Xd d + Xo o + D + u (13)
u = Wj u +
We note that use of conventional algorithms for maximum likelihood,
Bayesian or generalized method of moments estimation of the spatial econo-
metric origin-destination interregional flow models becomes difficult as the
3
There are widely available algorithms for estimation of these alternative specifications,
e.g., the spatial econometrics toolbox, www.spatial-econometrics.com and spatial statistics
toolbox, www.spatial-statistics.com.
9
number of observations increases. For example, use of an origin-destination
flow matrix for the sample of approximately 3,100 US counties would re-
sult in sparse spatial weight matrices of dimension n2 by n2 where n2 =
9, 610, 000. Maximum likelihood and Bayesian estimation both require cal-
culation of the logged determinant for the n2 by n2 matrix (In2 Wj ).
While specialized approaches to calculating log-determinants of very large
matrices have been proposed by Pace and LeSage (2004) and Smirnov and
Anselin (2001), it turns out there are much more efficient approaches that
can exploit the special structure of matrices like Wd = In W , Wo = W In
and Ww = Wo Wd = W W . We turn attention to this topic in the next
section.
10
number, would reduce the magnitude of the estimation residuals to a neg-
ligible level. The log-determinant term serves as a penalty to prevent such
pathological transformations from obtaining an advantage in estimation.
Consequently, the likelihood is invariant to such scalings.
The log-determinant of the transformation is the trace of the matrix
logarithm of the transformation, and the Taylor series expansion of this has
a simple form for the positive definite matrix transformation In2 Wj ,
shown in (15).
t
X tr(Wjt )
ln |In2 Wj | = tr (ln(In2 Wj )) = (15)
t=1
t
For the case of destination or origin weight matrices, Wd = In W or
Wo = W In , which we designate Wk , k = o, d,
11
(In2 1 Wo )(In2 2 Wd ) = (In2 1 (In W ))(In2 2 (W In ))
= In2 1 (In W ) 2 (W In ) (18)
+1 2 (W W )
Wf = 1 (In W ) + 2 (W In ) + 3 (W W ) (20)
The case of tr(Wf ) where t = 1 is immediate, and equals zero since
tr(W ) = 0. The case of tr(Wf2 ) is slightly more challenging as shown in
(21).
For the quadratic, there are 9 possible terms and 6 of these are unique.
Note, tr(Wf2 ) is the highest order term associated with W . Extrapolating,
computations of tr(Wft ) only require computing tr(W t ) based on the n by
n weight matrix W , a much less demanding task. Individual terms have the
form in (22).
i1 j2 k3 tr Woi Wdj Wwk = i1 j2 k3 tr(W (i+k) )tr(W (j+k) ) (22)
12
Given a table of tr(W t ) for t = 1 m, each term involves the multiplica-
tion of five scalars. However, there are 3m terms, and this becomes difficult
for large m. Other than computing tr(W t ), none of these computations are
dependent upon n, and so it takes just as long for a problems with many
origins and destinations as for smaller problems. For small n, calculating
exact tr(W t ) requires little time. For large n, calculating tr(W t ) can be
approximate as in Barry and Pace (1999) who show how to do this with an
O(n) algorithm.
Given the m moments and the conditions on W , it becomes easy to
compute an relatively short interval containing the log-determinant as shown
in (23).
m tr(W t )
X Xm tr(W t )
X tr(W m)
f f f
ln |In2 Wf | + (23)
t=1
t t=1
t t=m+1
t
Pace and LeSage (2002) show how the moments tr(Wft ) must monotonically
decline for t > 1, and this sets up the bounds. The interval is narrow
provided (1 + 2 + 3 )m+1 /(m + 1) is reasonably small. A requirement for
stability is that 1 + 2 + 3 < 1, making this a reasonable presumption.
Summarizing, we derived a family of nine model specifications that em-
phasize different spatial connectivity relations between origin and destina-
tion regions. Since members of the family of specifications reflect models
based on parameter restrictions, these can be easily tested to draw infer-
ences regarding the nature of spatial dependence in any applied problem.
Potential computational problems that might plague estimation for models
involving n2 observations on origin-destination flows were eliminated by re-
ducing the troublesome logged determinant calculation to one involving only
traces of n by n matrices. As already noted, successive filtering of the type
described here could also be applied to the disturbance process, producing
a family of nine models of the type we have labelled SEM, or to both the
dependent variable and disturbance vectors resulting in nine more models
of the type we labelled SAC.
13
over from the period covering 1985 to 1990 and flows for the period 1995 to
2000 were used.4 The sample was restricted to the 48 contiguous states plus
the District of Columbia resulting in n = 49 and n2 = 2, 401 observations.
The growth rates for flows of population within each state were set to zero
to emphasize flows between states which should exhibit the type of spatial
dependence of interest here.
Another benefit of the growth rates transformation is alleviation of the
problem noted earlier that arises with flows that are very large within re-
gions relative to many zero values for interregional flows. Figure 1 shows
a histogram of the annualized growth rates in the flows alongside a normal
probably density plot. From the figure we see some evidence of fat tails
reflecting more extremely large or small growth rates than one would expect
in a normal distribution. We also see the impact of setting 49 within-region
flows to zero values. An approach to dealing with the fat-tailed nature of
the distribution of flows during estimation will be illustrated in Section 5.
Explanatory variables for the matrices Xo , Xd for each state were taken
from the 1990 Census, with the exception of the unemployment rate variable,
which was constructed as the ratio of state-level unemployment rates in 1995
to 1990. These variables are documented in Table 2.
The motivation for including the age variables near retirement, and re-
tired is that these should exert an impact on migration decisions. Apriori,
we would expect that retired would increase flows from the origin, whereas
near retirement should decrease flows from both destinations and origins.
We note that an increase in flows is indicated by a positive coefficient esti-
mate and a decrease by a negative estimate.
Population that lived in another state in 1985 might increase flows at
both the origin and destination as this is an indicator of population mobility.
The effect of foreign born population seems unknown apriori, depending on
the mobility of this population relative to the average.
Population holding graduate and professional degrees should be the most
mobile, leading to increased flows at both the origin and destination, whereas
persons with less than ninth grade education should be less mobile. The
impact of associate degrees, college degrees, and sales jobs is less clear.
Rents and unemployment rates should increase flows at the origin and
decrease flows at the destination, whereas per capita income should decrease
flows at the origin and increase flows at the destination. The area variable
4
Available on the internet State-to-State Migration Flows: 1995 to 2000 Cen-
sus 2000 Special Reports. The data are based on a sample. For information
on confidentiality protection, sampling error, nonsampling error, and definitions, see
http://www.census.gov/prod/cen2000/doc/sf3.pdf.
14
was included to control for the impact of variation in the size of the states
on migration growth rates.
In addition to the variables included in the matrices Xo , Xd , the log of
distance from each origin to each destination was included in the model,
along with a constant term.
The family of nine model specifications described in Section 3 were es-
timated using maximum likelihood methods with a numerical hessian ap-
proach used to compute estimates of dispersion and tstatistics. The log-
likelihood function values for the family of nine models are shown in Table 3,
ordered from high to low, along with a likelihood ratio (LR) test of the re-
strictions imposed by each model versus the unrestricted model. It is clear
from the table that the Models 7, 8 and 9 based on the filtering specification
that contains separate spatial weight matrices for the origin and destination
provide a significantly higher likelihood than Models 2 through 6 that use a
single spatial weight matrix. There is also a noticeable drop in the likelihood
values when going from models 5 and 6 to models models 2, 3 and 4. We
note that models 5 and 6 are based on single weight matrices constructed
by summing information from both origin and destination weight matrices,
whereas to models 2, 3 and 4, are based on only and origin or only destina-
tion, or only the interaction weight matrices. This would seem to support
the notion that both origin and destination dependence/connectivity infor-
mation are important.
The LR tests indicate that the Model 7 restriction 3 = 0 does not
significantly reduce the likelihood function value. This restriction eliminates
the weight matrix Ww reflecting connectivity between neighbors to the origin
and neighbors to the destination. We might interpret this result as indicating
these relations are relatively unimportant in explaining growth rates in the
migration flows over our time period. The parameter estimates for the
unrestricted model are presented in Table 4, where we see that the parameter
3 is not significantly different from zero, consistent with the LR test results.
The LR test result for Model 8 based on the restriction that 3 = 1 2 ,
versus the unrestricted model rejects this restriction as consistent with the
sample data, at the 95 percent level.
Finally, it is clear that least-squares which ignores spatial dependence in
the growth rates of the migration flows and assumes these are independent
produces a much lower likelihood function value.
Turning to the parameter estimates from least-squares and the unre-
stricted spatial Model 9, shown in Table 4, we see estimates for 1 = 0.313
and 2 = 0.280, indicating spatial dependence of equal importance between:
neighbors to the origin and the destination, and neighbors to the destina-
15
tion and the origin. As indicated above, the estimate for 3 = 0.0072 is
not significantly different from zero, allowing us to infer that dependence
between neighbors to the origin and neighbors to the destination specified
by the weight matrix Ww is not important.
Turning attention to the parameter estimates, we see that distance is
positive and significant in the least-squares model, but insignificant in the
spatial model. (Distance was insignificant in all 8 spatial models.) Typically,
regression-based gravity models produce a negative influence of distance
on the flows, which seems intuitively appealing, whereas this is not the
case here. We note that if the true data-generating process was in fact a
model containing a spatial lag, then least-squares estimates are biased and
inconsistent (see LeSage and Pace, 2004).
From Table 4, we see many cases where least-squares estimates (in ab-
solute value terms) are larger than those from the spatial model, which is
typical of least-squares, since it attributes variation assigned to the spatial
lags of the dependent variable by the spatial model to explanatory variables.
For example, spatial model estimates for: D nearretirement, D diffstate,
D rents, D associate O college, O gradprof O rents, and O unemp, take on
values around one-half those from least-squares, while D unemployment is
an exception.
Three of the variables exhibit a change from significantly different from
zero to insignificant between the spatial and least-squares models: D sales,
O sales, and distance are all insignificant in the spatial model. Three other
variables change in the level of significance: D college, D unemp, have a
higher level of significance in the spatial model and O unemp has a lower
level of significance.
The estimates from the spatial filtering model indicate that: higher rents,
higher unemployment and more associate degrees lead to higher growth rates
in origin flows, but lower destination flows, as might be expected. Per capita
incomes also have the expected positive impact on destination flows, with
an insignificant impact on origin flows.
Persons near retirement (aged 60 to 64) reduces flows at both the ori-
gin and destination, as do college graduates. Persons with graduate and
professional degrees increase flows at both the origin and destination as do
persons that lived in a different state in 1985 than in 1990. This suggest
these groups reflect higher than average mobility, which seems plausible. In
contrast, foreign-born population reduces flows at both the origin and desti-
nation, suggesting lower mobility. Retired persons (aged 65 to 74) increase
flows only at the origin, having an insignificant impact on the destination.
Finally, area exerts a positive impact on flows at both the origin and
16
destination suggesting that states with larger physical areas exhibit higher
growth rates in migration flows when controlling for other factors.
17
assume that:
Where we note the presence of the logged determinant term as in the case
of maximum likelihood estimation. We can rely on the same algorithms
for rapidly evaluating this expression in the context of Bayesian MCMC
estimation as in maximum likelihood. Sampling for the parameters i , i =
5
See LeSage (2004), pp. 232-233 for the exact expressions needed here.
18
1, 2, 3 is accomplished using expressions similar to (25) in a Metropolis-
Hastings algorithm based on a tuned normal random-walk proposal.
Table 5 presents the posterior means and highest posterior density (HPD)
intervals based on 0.05 and 0.95 percentiles for the parameters of the most
general spatial filtering model. Maximum likelihood estimates are also in-
cluded in the table to facilitate comparison. We see 19 parameters whose
posterior mean is different from zero based on the 0.95 HPD intervals, which
contrasts with 18 such parameters from maximum likelihood estimation us-
ing the 95 percent level of significance. Differences arise for four destination
characteristics: D sales, D foreignborn, D grade9 and D associate; and two
origin characteristics: O nearretirement, and O sales.
There are also differences in the magnitudes of the parameter estimates,
even when both Bayesian and maximum likelihood estimates are signifi-
cantly different from zero. For example, the MCMC estimate of D sales is
twice as large as maximum likelihood, whereas maximum likelihood esti-
mates for O college and O gradprof are about twice those from the robust
Bayesian model. and O rents is twice as large.
Finally, we see evidence of stronger spatial dependence in larger posterior
mean estimates for the parameters 1 and 2 . We note that the 0.05 and
0.95 HPD intervals for these two parameters do not include the maximum
likelihood estimates, suggesting a substantial increase in spatial dependence
when we account for non-constant variance.
Turning attention to the variance scalar estimates, these provide a diag-
nostic for observations (or OD pairs) that do not conform well to the model
relationship. Large estimates for these variances suggest an outlier or aber-
rant observation. Of the 2,401 observations, 2,233 of the posterior mean
values for the scalars vi had values of 3 or less, and 2,354 values of 6 or less.
There were 17 vi estimates whose posterior means exceeded a value of 10,
indicative of exceptionally high or low migration growth rates that could not
be explained by the models origin and destination variables/characteristics
or the spatial autoregressive dependence structure.
Table 6 shows the origin-destination state pairs for the 17 cases where the
posterior mean vi values exceeded 10 along with the mean vi estimate. These
observations would be downweighted by the inverse of the vi values during
MCMC estimation of the robust model. This is in contrast to the maximum
likelihood estimation procedure where all observations are assigned equal
weight. This accounts for the differences between the Bayesian and max-
imum likelihood estimates. From the table, we see that origin-destination
state pairs identified as outliers conform to intuition, reflecting the growth
rates in migration flows from mostly small states to other small states. This
19
is where we would expect to see large variances in the growth rates of mi-
gration flows over the 1985-90 and 1995-2000 periods, likely due to volatility
that arises in growth rates calculated on the basis of small levels of flows.
6 Conclusions
We set forth a method for incorporating spatial autoregressive structures in
conventional regression-based gravity models. This extension allows a family
of conventional spatial regression models that explicitly model the spatial
dependence structure between cross-sectional observations to be employed
in modeling origin-destination flows.
The approach introduced here allows for application of conventional spa-
tial regression algorithms for estimation and inference in the case of small
samples. In addition, we provide a solution for more realistic cases where
large samples involving flows between US counties numbering nearly 10 mil-
lion observations can be estimated in a matter of seconds. This requires only
slight modification to existing algorithms. Much of what we have learned
about maximum likelihood and Bayesian estimation of spatial regression
models can be immediately applied to origin-destination flow modeling.
As an extension to conventional spatial autoregressive and spatial error
models, we introduce a family of models that subsume these conventional
models as a special case. Simple tests of parameter restrictions that produce
varying specifications for spatial dependence can be carried out, resolving
contentious model specification issues that often arise.
Three special issues that arise in origin-destination flow modeling were
discussed: 1) the fat-tailed nature of the distribution of the vectorized flow
matrix; 2) the presence of numerous zeros for large flow matrices reflecting a
lack of interaction between numerous regions in the sample; and 3) the pres-
ence of large flows on the diagonal of the flow matrix reflecting a large degree
of intra-regional connectivity. Solutions to some of these problems may be
possible by drawing on past work from conventional spatial econometrics.
As an illustration, robust Bayesian Markov Chain Monte Carlo estimates
for the model introduced in this study were presented. These estimates ac-
commodate the fat-tailed nature of the distribution of the vectorized flow
matrix.
20
7 References
Besag, J. E., York, J.C. and Mollie, A. (1991). Bayesian image
restoration, with two applications in spatial statistics (with discus-
sion), Annals of the Institute of Statistical Mathematics, Vol. 43, pp.
1-59.
21
Lee,Ming-Long and R. Kelley Pace (2005) Spatial Distribution of
Retail Sales, Journal of Real Estate Finance and Economics, Volume
31, number 1, pp. 53-69.
Pace, R.K, and J.P. LeSage (2004) Techniques for Improved Approx-
imation of the Determinant Term in the Spatial Likelihood Function,
Computational Statistics and Data Analysis, 2004, Volume 45, pp.
179-196.
Sen, Ashish and Tony E. Smith (1995), Gravity Models of Spatial In-
teraction Behavior, Heidelberg: Springer-Verlag.
22
Table 2: Explanatory variables used in the model
23
Table 4: Estimates from least-squares and the unrestricted spatial Model 9
24
Table 5: Estimates from maximum likelihood and robust Bayesian versions
of spatial Model 9
25
Table 6: Origin-Destination pairs for variance scalar estimates greater than
10
26
450
Histogram
Normal plot
400
350
300
250
27
Frequency
200
150
100
50
0
0.4 -0.3 -0.2 -0.1 0 0.1 0.2 0.3
Figure 1: Distribution of Annualized Migration Growth Rates