Polya Urn Model
Polya Urn Model
The Polya urn model1 is useful in survival analysis situations in which a system undergoes
repeated stresses, but the stresses are not independent of each other. For example, in the study of
the spread of a contagious disease in a closed population, the more infected people there are in
the population, the larger the likelihood that a non-infected person will become infected; i.e., the
more shocks affecting the individual. Polya’s urn model has often been applied to the spread of
contagious diseases.2
Consider an urn initially containing red balls and m white balls. We randomly draw one ball
from the urn, note its color, and replace the ball, along with another ball of the same color. We
repeat this process n times. We want to know the probability distribution for the count of the
number of red balls drawn from the urn in n trials.
Assume that the result holds true for n. Now we draw one more ball from the urn. The
probability that, after the (n+1)st draw, we will have drawn a total of k red balls is given by
P X n1 k P red on n 1 draw | X n k 1 P X n k 1
st
P white on n 1
draw | X n k P X n k .
st
Now, if k – 1 red balls were drawn in n draws, then at the (n+1)st draw, the urn contains
+ k – 1 red balls, and m + n – k + 1 white balls, so that the probability of obtaining a red ball is
k 1
. On the other hand, if k red balls were drawn on the first n draws, then at the (n+1)st
mn
draw, the urn contains + k red balls and m + n – k white balls, so that the probability of
mnk
obtaining a white ball is .
mn
Hence, the probability that k red balls are drawn in n + 1 draws is
k 1 mnk
P X n 1 k P X n k 1 P X n k .
mn mn
1
Eggenberger, F. and Polya, G. (1923). Über die Statistik verketteter Vorgänge. Zeitschrift für
Angewandte Mathematik and Mechanik 1, 297-289.
2
Lundberg, O. (1964). On Random Processes and Their Application to Sickness and Accident
Statistics, 2nd Edition, Almqvist and Wiksells, Stockholm.