Lecture 16: Parametric Survival Models: James J. Dignam
Lecture 16: Parametric Survival Models: James J. Dignam
James J. Dignam
h i (t ) = ψ(x i )h 0 (t ) (1)
η i = β1 x 1i + β2 x 2i + · · · + βp x pi (2)
Pp
so that i =1 β j x j i . In matrix notation, η i = βT x i .
The general proportional hazards model then becomes
h i (t ) = exp(β1 x 1i + β2 x 2i + · · · + βp x pi )h 0 (t ) (3)
equivalently,
h i (t )
log( ) = β1 x 1i + β2 x 2i + · · · + βp x pi (4)
h 0 (t )
the proportional hazards model may be regarded as a linear
model for the (natural) logarithm of the hazard ratio.
J. Dignam (UChicago) Lecture 16 Mar. 5, 2020 4 / 49
Proportional hazards (continued)
Weibull distribution:
We now make the additional assumption that the survival times for
the individual in Group I have a Weibull distribution W (λ, γ),
If the lines are not particularly straight, the Weibull model may not
be appropriate. However, it the two curves can be taken to be
parallel, this would mean that the proportional hazards model is
valid, and can use a model (with different hazard or even without
specifying h0 (t )) is appropriate.
Table 1: Survival times (in months) of women with tumors that were
negatively or positively stained for Helix promatia HPA. Censored times are
labeled with an asterisk. From Collett book (Leatham &Brooks Lancet 1987)
Negative staining Positive staining
23 5 68
47 8 71
69 10 76*
70* 13 105*
71* 18 107*
100* 24 109*
101* 26 113
148 26 116*
181 31 118
198* 35 143
208* 40 154*
212* 41 162*
224* 48 188*
50 212*
59 217*
61 225*
. use p r o g n o s i s _ b r e a s t _ c a n c e r . d t a
. s t s e t time s t a t u s
. s t s graph , by ( s t a i n )
stain = 1 stain = 2
. s t s generate s u r v f = s , by ( s t a i n )
. generate cumhazard = − l o g ( s u r v f )
. generate l t i m e = l o g ( t i m e )
2 3 4 5
ltime
W e i b u l l r e g r e s s i o n −− l o g r e l a t i v e − hazard form
No . o f s u b j e c t s = 45 Number o f obs = 45
No . o f f a i l u r e s = 26
Time a t r i s k = 4331
LR c h i 2 ( 1 ) = 4.14
Log l i k e l i h o o d =
− 60.883962 Prob > c h i 2 = 0.0418
−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
_ t | Haz . R a t i o Std . E r r . z P> | z | [95% Conf . I n t e r v a l ]
−−−−−−−−−−−−−+−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
2. stain | 2.545372 1.271665 1.87 0.061 .9560751 6.776579
_cons | .0041365 .0037257 − 6.09 0.000 .0007079 .0241707
−−−−−−−−−−−−−+−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
/ l n _ p | − .0646417 .1673746 − 0.39 0.699 − .3926898 .2634064
−−−−−−−−−−−−−+−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
p | .9374033 .1568975 .6752382 1.301355
1/p | 1.066777 .1785513 .7684296 1.480959
−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
The default produced by STATA with coefficients expressed as hazard ratios. This form gives estimates
for e β , λ, γ
Option "nohr" express coefficients on the log relative hazard scale, and gives estimates for β, log(λ), γ
NOTE In Stata output: cons = λ, p = γ
Quantiles
The median and other percentiles of the survival time distributions
in the two groups can be estimated from the values of β̂, λ̂ and γ̂.
The estimated p th percentile for those with negative staining is
given by
1 100
t̂ p = { log( )}1/γ̂ (8)
100 λ̂ 100 −p
The estimated p th percentile for those with positive staining is given
by
1 100
t̂ p ={ log( )}1/γ̂ (9)
100
e β̂ λ̂ 100 − p
. s t r e g i . s t a i n , d i s t ( e x p o n e n t i a l ) nolog
f a i l u r e _d : status
analysis time _t : time
E x p o n e n t i a l r e g r e s s i o n −− l o g r e l a t i v e − hazard form
No . o f s u b j e c t s = 45 Number o f obs = 45
No . o f f a i l u r e s = 26
Time a t r i s k = 4331
LR c h i 2 ( 1 ) = 4.36
Log l i k e l i h o o d = − 60.960708 Prob > c h i 2 = 0.0369
−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
_ t | Haz . R a t i o Std . E r r . z P> | z | [95% Conf . I n t e r v a l ]
−−−−−−−−−−−−−+−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
2. stain | 2.589922 1.28878 1.91 0.056 .9766015 6.868405
_cons | .0030266 − 12.97 .0013536
0.000 .0012598 .0072716
−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
Note that β for stain effect (H R = 2.59) not much different from
Weibull (H R = 2.52)
scale parameter (rate of failure) not much different either at
0.0030266. What is this number and how does it relate to staining?
Note that the simple incidence rates are reproduced by the model
parameters λ and the coefficient for treatment
J. Dignam (UChicago) Lecture 16 Mar. 5, 2020 20 / 49
Fit Exponential Model for Breast Cancer Data
Plotting the curves
. s t c u r v e , s u r v i v a l a t 1 ( s t a i n =1) a t 2 ( s t a i n =2)
Exponential regression
1
.8
Survival
.6
.4
.2
stain=1 stain=2
h i (t ) = exp(β1 x 1i + β2 x 2i + · · · + βp x pi )h 0 (t ) (10)
From the form of (11), the survival time of the i th individual in the study
has a Weibull distribution W (exp(βT x i )λ, γ)
The survivor function corresponding to the hazard function in (11) is
Treatment of hypernephroma
In a study carried out at the University of Oklahoma Health Sciences
Center, data were obtained on the survival times of 36 patients with a
maglignant tumor in the kidney, or hypernephroma.
. use treatment_of_hypernephroma . d t a
. l i s t i n 1/10
+−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−+
| nephre~y age time status |
|−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−|
1. | 0 1 9 1 |
2. | 0 1 6 1 |
3. | 0 1 21 1 |
4. | 0 2 15 1 |
5. | 0 2 8 1 |
|−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−|
6. | 0 2 17 1 |
7. | 0 3 12 1 |
8. | 1 1 104 0 |
9. | 1 1 9 1 |
10. | 1 1 56 1 |
+−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−+
Let Ag e _2 be the indicator for age being in the range 60-70, and
Ag e _3 be the indicator for age being in the range >70. Then a
Weibull proportional hazard model could be specified to be
f a i l u r e _d : status
analysis time _t : time
W e i b u l l r e g r e s s i o n −− l o g r e l a t i v e − hazard form
No . o f s u b j e c t s = 36 Number o f obs = 36
No . o f f a i l u r e s = 32
Time a t r i s k = 1340
LR c h i 2 ( 3 ) = 17.13
Log l i k e l i h o o d =
− 43.87881 Prob > c h i 2 = 0.0007
−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
_ t | Haz . R a t i o Std . E r r . z P> | z | [95% Conf . I n t e r v a l ]
−−−−−−−−−−−−−+−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
nephrectomy | .1919802 .1006891 − 3.15 0.002 .0686785 .5366509
|
age |
2 | 1.085593 .4363889 0.20 0.838 .4937408 2.386903
3 | 5.218136 3.088109 2.79 0.005 1.635956 16.64406
|
_cons | .0170522 .0131496 − 5.28 0.000 .0037617 .0772992
−−−−−−−−−−−−−+−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
/ ln_p | .3438972 .1411602 2.44 0.015 .0672284 .620566
−−−−−−−−−−−−−+−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
p | 1.410434 .199097 1.06954 1.859981
1/p | .7090018 .1000828 .53764 .9349817
−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
f a i l u r e _d : status
analysis time _t : time
W e i b u l l r e g r e s s i o n −− l o g r e l a t i v e − hazard form
No . o f s u b j e c t s = 36 Number o f obs = 36
No . o f f a i l u r e s = 32
Time a t r i s k = 1340
LR c h i 2 ( 5 ) = 21.82
Log l i k e l i h o o d = − 41.532133 Prob > c h i 2 = 0.0006
−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
_ t | Haz . R a t i o Std . E r r . z P> | z | [95% Conf . I n t e r v a l ]
−−−−−−−−−−−−−−+−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
_Iage_2 | .9183851 .7500492 − 0.10 0.917 .1852869 4.552028
_Iage_3 | 1.121983 1.297394 0.10 0.921 .1163344 10.82093
nephrectomy | .0875388 .0632624 − 3.37 0.001 .0212351 .3608657
_IageXnephr_2 | 1.128947 1.061025 0.13 0.897 .1789303 7.123004
_IageXnephr_3 | 12.65381 16.80464 1.91 0.056 .9371311 170.8609
_cons | .0187577 .0159487 − 4.68 0.000 .0035436 .0992918
−−−−−−−−−−−−−−+−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
/ ln_p | .4407051 .1457169 3.02 0.002 .1551052 .7263049
−−−−−−−−−−−−−−+−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
p | 1.553802 .2264153 1.167781 2.067427
1/p | .6435825 .0937809 .483693 .8563251
−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
. d i c h i 2 t a i l ( 2 , 4.693354)
.0956866
Interaction weak, not really needed
S i (t ) = S 0 (t φxi ) (16)
t Np = e β t Sp (19)
100 100
S i (t ) = S 0 (t e βxi ) (20)
. use p r o g n o s i s _ b r e a s t _ c a n c e r . d t a
. s t s e t time s t a t u s
. s t s l i s t , by ( s t a i n )
f a i l u r e _d : status
analysis time _t : time
. clear
P~negat~n P~ p o s i t ~n
1. 47 13
2. 69 26
3. 148 35
4. 181 48
5. end
. graph twoway ( s c a t t e r P e r c e n t i l e _ p o s i t i v e _ s t a i n P e r c e n t i l e _ n e g a t i v e _ s t a i n ) / / /
> ( l f i t Percentile_positive_stain Percentile_negative_stain ) / / /
> , xlabel (0(50)200) ylabel (0(20)60)
60
40
20
0
The points fall on a reasonably straight line roughly through the origin, suggesting that the accelerated
failure time model would not be inappropriate. (However, this conclusion must be regarded with some
caution due to the limited number of points in the graph.)
The slope of the line (a rough estimate for e β is smaller than 1, suggesting that for women whose
tumors were positively stained, the disease process is speeded up relative to those whose tumors were
negatively stained. E.g., the median survival time for those HLA+ tumors is shorter compared to the
median survival time for women with HLA- tumors.
J. Dignam (UChicago) Lecture 16 Mar. 5, 2020 42 / 49
Prognosis for women with breast cancer : Weibull
accelerated failure time model
S 0 (t ) = exp(−λt γ ) (21)
. s t r e g s t a i n , d i s t r i b u t i o n ( w e i b u l l ) nolog t i m e
. . .
W e i b u l l r e g r e s s i o n −− a c c e l e r a t e d f a i l u r e − t i m e form
No . o f s u b j e c t s = 45 Number o f obs = 45
No . o f f a i l u r e s = 26
Time a t r i s k = 4331
LR c h i 2 ( 1 ) = 4.14
Log l i k e l i h o o d =
− 60.883962 Prob > c h i 2 = 0.0418
−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
_t | Coef . Std . E r r . z P> | z | [95% Conf . I n t e r v a l ]
−−−−−−−−−−−−−+−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
s t a i n | − .9966647 .5440936 − 1.83 0.067 − 2.063069 .0697391
_cons | 5.854364 .4988778 11.74 0.000 4.876581 6.832146
−−−−−−−−−−−−−+−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
/ l n _ p | − .0646417 .1673746 − 0.39 0.699 − .3926898 .2634064
−−−−−−−−−−−−−+−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
p | .9374033 .1568975 .6752382 1.301355
1/p | 1.066777 .1785513 .7684296 1.480959
−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
λ̂ = 0.00413652
The estimated S(t ) for the negative staining group (baseline):
Ŝ 0 (t ) = exp(−λ̂t γ̂ ) (23)
Ŝ i (t ) = Ŝ 0 (t exp(β̂x i )), thus the estimated survivor function for
positive staining group (xi = 1) is given by
Ŝ pos (t ) = Ŝ 0 (t e −.9966647 ) = Ŝ 0 (.3691085t ) (24)
f a i l u r e _d : status
analysis time _t : time
W e i b u l l r e g r e s s i o n −− a c c e l e r a t e d f a i l u r e − t i m e form
No . o f s u b j e c t s = 45 Number o f obs = 45
No . o f f a i l u r e s = 26
Time a t r i s k = 4331
LR c h i 2 ( 1 ) = 4.14
Log l i k e l i h o o d = − 60.883962 Prob > c h i 2 = 0.0418
−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
_ t | Time R a t i o Std . E r r . z P> | z | [95% Conf . I n t e r v a l ]
−−−−−−−−−−−−−+−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
stain1 | .3691085 .2008296 − 1.83 0.067 .1270635 1.072228
_cons | 348.753 173.9851 11.74 0.000 131.1814 927.1788
−−−−−−−−−−−−−+−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
/ l n _ p | − .0646417 .1673746 − 0.39 0.699 − .3926898 .2634064
−−−−−−−−−−−−−+−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
p | .9374033 .1568975 .6752382 1.301355
1/p | 1.066777 .1785513 .7684296 1.480959
−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−
stain1 = 0 stain1 = 1
S i (t ) = S 0 (t exp(β1 x 1i + β2 x 2i + · · · + βp x pi )) (25)