Professional Documents
Culture Documents
405 (4,5) PDF
405 (4,5) PDF
Mahbub Latif
Institute of Statistical Research and Training (ISRT)
University Dhaka, Dhaka 1000, Bangladesh
(mlatif@isrt.ac.bd)
October 2023
𝜕 2 ℓ(𝜽)
ℐ(𝜽) = 𝐸[ − ] (5)
𝜕𝜽 𝜕𝜽′
• Observed information matrix
𝜕 ℓ(𝜽) 2
𝐼(𝜽)̂ = − ∣ (6)
𝜕𝜽 𝜕𝜽′ 𝜽=𝜽̂
• Data: {(𝑡𝑖 , 𝛿𝑖 ), 𝑖 = 1, … , 𝑛}
• 𝑡𝑖 is a sample realization of 𝑇𝑖̃ = min(𝑇𝑖 , 𝐶𝑖 )
• lifetime 𝑇𝑖 follows a distribution with pdf 𝑓(𝑡𝑖 ; 𝜽) and the corresponding survivor
function 𝑆(𝑡𝑖 ; 𝜽)
• censoring time 𝐶𝑖 could be random or fixed depending on the censoring
mechanism
• 𝛿𝑖 = 𝐼(𝑇𝑖 ≤ 𝐶𝑖 ), censoring indicator
{(𝑡𝑖 , 𝛿𝑖 ), 𝑖 = 1, … , 𝑛}
• Likelihood function
𝑛
𝛿 1−𝛿𝑖
𝐿(𝜽) = ∏ [𝑓(𝑡𝑖 ; 𝜽] 𝑖 [𝑆(𝑡𝑖 ; 𝜽)]
𝑖=1
𝑛
𝛿 1−𝛿𝑖
= ∏ [𝑆(𝑡𝑖 ; 𝜽) ℎ(𝑡𝑖 ; 𝜽)] 𝑖 [𝑆(𝑡𝑖 ; 𝜽)]
𝑖=1
𝑛
𝛿𝑖
= ∏ [𝑆(𝑡𝑖 ; 𝜽] [ℎ(𝑡𝑖 ; 𝜽)] (7)
𝑖=1
𝑡(1) ⩽ ⋯ ⩽ 𝑡(𝑟)
• The remaining (𝑛 − 𝑟) observations are considered as censored at 𝑡(𝑟)
• Likelihood function
𝑟 𝑛−𝑟
𝐿(𝜽) = [ ∏ 𝑓(𝑡(𝑖) ; 𝜽)] [𝑆(𝑡(𝑟) ; 𝜽)] (8)
𝑖=1
• The exponential distribution is the first lifetime model for which statistical
methodology were extensively developed
• Exact tests can be developed for exponential distribution for certain type of
censoring mechanism
• Exponential distribution assume constant hazard and its use is limited for
analyzing real life problems
• Hazard function
• Survivor function
• 𝑝𝑡ℎ quantile
• Log-likelihood function
𝑛
ℓ(𝜃) = ∑ [ − (𝑡𝑖 /𝜃) − 𝛿𝑖 log(𝜃)]
𝑖=1
1
= − ∑ 𝑡𝑖 − 𝑟 log(𝜃) (13)
𝜃 𝑖
• MLE
𝜕ℓ(𝜃) 1 𝑟
∣ =0 ⇒ ∑ 𝑡𝑖 − = 0
𝜕𝜃 𝜃2̂ 𝑖 𝜃̂
𝜃=𝜃 ̂
𝑛
𝜃 ̂ = ∑ 𝑡𝑖 /𝑟 (14)
𝑖=1
𝜕 2 ℓ(𝜃)
𝐼(𝜃) = −
𝜕𝜃2
𝜕 1 𝑟
= − [ 2 ∑ 𝑡𝑖 − ]
𝜕𝜃 𝜃 𝑖 𝜃
2 𝑟
= 3
∑ 𝑡𝑖 − 2 (15)
𝜃 𝑖 𝜃
̂ asymptotic distribution
• Using 𝜃’s
𝜃 − 𝜃̂ 𝜃 − 𝜃̂
𝑍1 = = √ ∼ 𝒩(0, 1)
̂ −1/2
[𝐼(𝜃)] 𝜃/̂ 𝑟
• For a small sample, 𝑍1 does not approximate the standard normal distribution
very accurately
• For a sample with a small number of uncensored observations, ℓ(𝜃) tends to be
asymmeteric
−1/3
• MLE ̂
𝜙 ̂ = 𝜃−1/3 = [∑𝑖 𝑡𝑖 /𝑟]
• Observed information matrix
3𝑟 9𝑟
𝐼1 (𝜙) = 6𝜙 ∑ 𝑡𝑖 + 2
⇒ 𝐼1 (𝜙)̂ =
𝑖
𝜙 𝜙2̂
• Pivotal quantity
𝜙 − 𝜙̂ 𝜙 − 𝜙̂
𝑍2 = = √ ∼ 𝒩(0, 1)
̂
[𝐼1 (𝜙)]
−1/2
𝜙/̂ 9𝑟
• Likelihood ratio statistic Λ(𝜃) = 2ℓ(𝜃)̂ − 2ℓ(𝜃) can also be used to obtain
confidence interval for 𝜃, where
= −𝑟 − 𝑟 log(𝜃)̂ (19)
• LRT statistic
Λ(𝜃) = 2ℓ(𝜃)̂ − 2ℓ(𝜃)
̂
= 2𝑟[(𝜃/𝜃) ̂
− 1 + log(𝜃/𝜃)]
• Two-sided (1 − 𝛼)100% confidence intervals are obtained as the set of 𝜃 values
that satisfy
Λ(𝜃) ≤ 𝜒2(1),1−𝛼
• Confidence interval for the parameter 𝜃 can also be used to obtain confidence
intervals for a monotone function of 𝜃, such as
𝐿(Data) ⩽ 𝜃 ⩽ 𝑈 (Data),
𝐿(Data) ⩽ 𝜃 ⩽ 𝑈 (Data)
𝑡0 /𝑈 (Data) ⩽ (𝑡0 /𝜃) ⩽ 𝑡0 /𝐿(Data)
−𝑡0 /𝐿(Data) ⩽ (−𝑡0 /𝜃) ⩽ −𝑡0 /𝑈 (Data)
exp ( − 𝑡0 /𝐿(Data)) ⩽ 𝑆(𝑡0 ; 𝜃) ⩽ exp ( − 𝑡0 /𝑈 (Data))
• MLE
∑𝑖 𝑡𝑖 308
𝜃̂ = = = 44.0
𝑟 7
• 95% confidence interval of 𝜃 (Method I)
√
̂ −1/2 ⇒ 𝜃 ̂ ± 𝑧 (𝜃/̂ 𝑟)
𝜃 ̂ ± 𝑧.975 [𝐼(𝜃)] .975
√
⇒ 44.0 ± (1.96)(44.0/ 7)
⇒ 11.4 ⩽ 𝜃 ⩽ 76.6
• 95% CI for 𝜃 can be obtained from the set of 𝜃’s such that
• 95% CI for 𝜃
22.8 to 102.4
Λ 3
0
20 40 60 80 100
θ
95% CI
method lower upper
Normal approx. 11.40 76.60
Sprott 22.69 103.03
LRT 22.80 102.40
• Sprott’s method
2
(𝜙 ̂ − 𝜙) ̂ ̂
𝑊2 (𝜃) = [ ] = (𝜃−1/3 − 𝜃−1/3 )2 𝐼1 (𝜃−1/3 )
̂ −1/2
[𝐼1 (𝜙)]
• LRT
𝑊3 (𝜃) = 2ℓ(𝜃)̂ − 2ℓ(𝜃) = 2𝑟[(𝜃/𝜃)
̂ ̂
− 1 − log (𝜃/𝜃)]
10.0
Wj(θ) 7.5
5.0
2.5
0.0
0 25 50 75 100 125
θ
Figure 1: Comparisons between normal approximations and LRT
• (1 − 𝛼)100% CI for 𝜃
−1/2
𝜃 ̂ ± 𝑧1−𝛼/2 [𝐼(𝜃)]
̂ (22)
• Exact confidence interval for 𝜃 can be derived for uncensored and Type II
censored samples, define
𝑊1 = 𝑛𝑡(1)
𝑊2 = (𝑛 − 1)(𝑡(2) − 𝑡(1) )
⋮
𝑊𝑟 = (𝑛 − 𝑟 + 1)(𝑡(𝑟) − 𝑡(𝑟−1) )
• In general
𝑊𝑖 = (𝑛 − 𝑖 + 1)(𝑡(𝑖) − 𝑡(𝑖−1) ) 𝑖 = 1, … , 𝑟
where
𝜕(𝑔1−1 (𝑤1 , … , 𝑤𝑟 ), … , 𝑔𝑟−1 (𝑤1 , … , 𝑤𝑟 ))
𝐽=
𝜕(𝑤1 , … , 𝑤𝑟 )
𝜕𝑔1−1 (𝑤1 ,…,𝑤𝑟 ) 𝜕𝑔2−1 (𝑤1 ,…,𝑤𝑟 ) 𝜕𝑔𝑟−1 (𝑤1 ,…,𝑤𝑟 )
…
⎡ −1 𝜕𝑤1 𝜕𝑤1 𝜕𝑤1 ⎤
⎢ 𝜕𝑔1 (𝑤1 ,…,𝑤𝑟 ) 𝜕𝑔2−1 (𝑤1 ,…,𝑤𝑟 ) −1
𝜕𝑔𝑟 (𝑤1 ,…,𝑤𝑟 ) ⎥
=⎢ 𝜕𝑤2 𝜕𝑤2 … 𝜕𝑤2
⎥
⎢ −1 ⋮ ⋮ ⋱ ⋮ ⎥
𝜕𝑔1 (𝑤1 ,…,𝑤𝑟 ) 𝜕𝑔2−1 (𝑤1 ,…,𝑤𝑟 ) 𝜕𝑔𝑟−1 (𝑤1 ,…,𝑤𝑟 )
⎣ 𝜕𝑤𝑟 𝜕𝑤𝑟 … 𝜕𝑤𝑟 ⎦
𝑤1
𝑤1 = 𝑔1 (𝑡(1) , … , 𝑡(𝑟) ) = 𝑛𝑡(1) ⇒ 𝑡(1) = 𝑔1−1 (𝑤1 , … , 𝑤𝑟 ) =
𝑛
𝑤2 = 𝑔2 (𝑡(1) , … , 𝑡(𝑟) ) = (𝑛 − 1)(𝑡(2) − 𝑡(1) )
𝑤2 𝑤
⇒ 𝑡(2) = 𝑔2−1 (𝑤1 , … , 𝑤𝑟 ) = + 1
𝑛−1 𝑛
⋮
𝑤𝑟 = 𝑔𝑟 (𝑡(1) , … , 𝑡(𝑟) ) = (𝑛 − 𝑟 + 1)(𝑡(𝑟) − 𝑡(𝑟−1) )
𝑤𝑟 𝑤2 𝑤
⇒ 𝑡(𝑟) = 𝑔𝑟−1 (𝑤1 , … , 𝑤𝑟 ) = +⋯+ + 1
𝑛−𝑟+1 𝑛−1 𝑛
𝑇0 = ∑ 𝑊𝑖 = ∑ 𝑡(𝑖) + (𝑛 − 𝑟)𝑡(𝑟)
𝑖 𝑖
𝑀𝑋 (𝑡) = 𝐸[𝑒𝑡𝑋 ]
1
⎧ 1−𝜃𝑡 for 𝑋 ∼ Exp(𝜃)
{
{
{ 1 𝑟
= ⎨( 1−𝜃𝑡 ) for 𝑋 ∼ Gamma(𝑟, 𝜃)
{
{ 𝑟/2
{( 1 ) for 𝑋 ∼ 𝜒2(𝑟)
⎩ 1−2𝑡
2𝑇0
• Since 𝜃 ∼ 𝜒2(2𝑟) , we can write
2𝑇0
𝑃 (𝜒2(2𝑟), 𝛼/2 ≤ ≤ 𝜒2(2𝑟), 1−𝛼/2 ) = 1 − 𝛼
𝜃
2𝑇0 2𝑇0
2
≤𝜃≤ 2
(27)
𝜒(2𝑟), 1−𝛼/2 𝜒(2𝑟), 𝛼/2
• Homework
• Obtain 95% exact and approximate confidence intervals for 𝜃
• Let 𝑇𝑖𝑗 be the lifetime corresponding to the 𝑗𝑡ℎ subject of the 𝑖𝑡ℎ group
(𝑖 = 1, … , 𝑚; 𝑗 = 1, … , 𝑛𝑖 )
• Assume 𝑇𝑖𝑗 ∼ Exp(𝜃𝑖 ) and the null hypothesis of interest
𝐻0 ∶ 𝜃 1 = ⋯ = 𝜃 𝑚
• Data:
{(𝑡𝑖𝑗 , 𝛿𝑖𝑗 ), 𝑖 = 1, … , 𝑚; 𝑗 = 1, … , 𝑛𝑖 }
𝜕ℓ(𝜽) 𝑇𝑖 𝑟𝑖
∣ =0 ⇒ − = 0 ⇒ 𝜃𝑖̂ = (𝑇𝑖 /𝑟𝑖 )
𝜕𝜃𝑖 ̂
2
𝜃𝑖 𝜃𝑖̂
𝜃𝑖 =𝜃𝑖̂
• Under
𝐻0 ∶ 𝜃1 = ⋯ = 𝜃𝑚 = 𝜃 (say)
log-likelihood function
Λ = 2ℓ(𝜽)̂ − 2ℓ(𝜃)̃
= 2 ∑ [ − (𝑇𝑖 /𝜃𝑖̂ ) − 𝑟𝑖 log(𝜃𝑖̂ ) + (𝑇𝑖 /𝜃)̃ + 𝑟𝑖 log(𝜃)]
̃
𝑖
= 2 ∑ [ − 𝑟𝑖 log(𝜃𝑖̂ ) + 𝑟𝑖 log(𝜃)]
̃
𝑖
• Homework
• Test 𝐻0 ∶ 𝜃1 = ⋯ = 𝜃4
𝑇𝑖 ∼ Exp(𝜃𝑖 ), 𝑖 = 1, 2
log 𝜃1̂ − log 𝜃2̂ = log(𝜃1̂ /𝜃2̂ ) ∼ 𝒩( log(𝜃1 /𝜃2 ), (𝑟1−1 + 𝑟2−1 )) (31)
• Null hypothesis
• Test statistic
log(𝜃1̂ /𝜃2̂ ) − log(𝜃1 /𝜃2 )
𝑍= 1/2
∼ 𝒩(0, 1)
(𝑟1−1 + 𝑟2−1 )
• LRT statistics can also be used to obtain confidence interval of (𝜃1 /𝜃2 ),
consider the null hypothesis for 𝑎 > 0
𝐻0 ∶ 𝜃1 = 𝑎𝜃2
• MLE under 𝐻0
𝑇1 + 𝑎𝑇2 𝑇 + 𝑎𝑇2
𝜃2̃ = ⇒ 𝜃1̃ = 𝑎𝜃2̃ = 1
𝑎(𝑟1 + 𝑟2 ) (𝑟1 + 𝑟2 )
where
ℓ(𝜃1̂ , 𝜃2̂ ) = − ∑ [(𝑇𝑖 /𝜃𝑖̂ ) + 𝑟𝑖 log(𝜃𝑖̂ )]
𝑖
where
ℓ(𝜃1̂ , 𝜃2̂ ) = −(𝑟1 + 𝑟2 ) − 𝑟1 log(𝜃1̂ ) − 𝑟2 log(𝜃2̂ )
𝑇 𝑇
ℓ(𝜃1̃ , 𝜃2̃ ) = − 1 − 2 − 𝑟1 log(𝜃1̃ ) − 𝑟2 log(𝜃2̃ )
𝜃̃
1 𝜃̃ 2
= −(𝑟1 + 𝑟2 ) − 𝑟1 log(𝜃1̃ ) − 𝑟2 log(𝜃2̃ )
• (1 − 𝛼)100% confidence interval for (𝜃1 /𝜃2 ) can be construed from the values
𝑎 that satisfy
• For type I censored sample, LRT statistics based confidence interval for (𝜃1 /𝜃2 )
(34) is more accurate than that of normal approximation (32) for small samples
• Unrestricted MLEs
𝑇1 𝑇
𝜃1̂ = = 70 and 𝜃2̂ = 2 = 54
𝑟1 𝑟2
−0.617 < log(𝜃1 /𝜃2 ) < 1.136 ⇒ 0.54 < (𝜃1 /𝜃2 ) < 3.114
𝐻0 ∶ 𝜃1 = 𝑎𝜃2
• Estimates under 𝐻0
𝑇1 + 𝑎𝑇2 𝑇 + 𝑎𝑇2
𝜃2̃ = ⇒ 𝜃1̃ = 𝑎𝜃2̃ = 1
𝑎(𝑟1 + 𝑟2 ) (𝑟1 + 𝑟2 )
3
Λ(a)
0
0 1 2 3
a
• 95% of (𝜃1 /𝜃2 ) is the range of values of 𝑎 = (𝜃1 /𝜃2 ), such that
which is
0.56 < 𝑎 < 3.15 ⇒ 0.56 < (𝜃1 /𝜃2 ) < 3.15
Table 3: 95% confidence interval of (𝜃1 /𝜃2 ) for Type I censored sample
95% CI
method lower upper
Normal approximation 0.54 3.114
LRT 0.56 3.150
• Likelihood function
2
𝐿(𝜃1 , 𝜃2 ) = ∏(1/𝜃𝑖 )𝑟𝑖 𝑒− ∑𝑗 𝑡(𝑖𝑗) /𝜃𝑖 𝑒−(𝑛𝑖 −𝑟𝑖 )𝑡(𝑖𝑟𝑖 ) /𝜃𝑖
𝑖=1
2
= ∏(1/𝜃𝑖 )𝑟𝑖 𝑒−(1/𝜃𝑖 )[ ∑𝑗 𝑡(𝑖𝑗) −(𝑛𝑖 −𝑟𝑖 )𝑡(𝑖𝑟𝑖 ) ]
𝑖=1
2
= ∏(1/𝜃𝑖 )𝑟𝑖 𝑒−𝑇𝑖 /𝜃𝑖
𝑖=1
2𝑇𝑖 2𝑟 𝜃 ̂
= 𝑖 𝑖 ∼ 𝜒2(2𝑟𝑖 )
𝜃𝑖 𝜃𝑖
• We can write
𝜃1̂ 𝜃2
𝑃 (𝐹(2𝑟1 ,2𝑟2 ),𝛼/2 ≤ ≤ 𝐹(2𝑟1 ,2𝑟2 ),1−𝛼/2 ) = 1 − 𝛼
𝜃̂ 𝜃
2 1
−𝑛𝑘̂ 𝑛𝑡 ̄
+ 2 =0 (39)
𝛼̂ 𝛼̂
−𝑛𝜓(𝑘)̂ − 𝑛 log 𝛼̂ + 𝑛 log 𝑡 ̃ = 0 (40)
−𝑛𝑘̂ 𝑛𝑡 ̄ 𝑡̄
+ 2 = 0 ⇒ 𝛼̂ = (41)
𝛼̂ 𝛼̂ 𝑘̂
• Score functions
𝜕ℓ(𝛼, 𝑘) −𝑛𝑘 𝑛𝑡 ̄
𝑈1 (𝛼, 𝑘) = = + 2
𝜕𝛼 𝛼 𝛼
𝜕ℓ(𝛼, 𝑘)
𝑈2 (𝛼, 𝑘) = = −𝑛𝜓(𝑘) − 𝑛 log 𝛼 + 𝑛 log 𝑡 ̃
𝜕𝑘
• Score vector
𝑈1 (𝛼, 𝑘)
𝑈 (𝛼, 𝑘) = [ ]
𝑈2 (𝛼, 𝑘)
• Hessian matrix
𝐻11 (𝛼, 𝑘) 𝐻12 (𝛼, 𝑘)
𝐻(𝛼, 𝑘) = [ ]
𝐻21 (𝛼, 𝑘) 𝐻22 (𝛼, 𝑘)
• Score vector and information matrix are function of parameters and data
• The estimate 𝜽(2) can be obtained by using 𝜽(1) as input in equation (44)
• Repeating the procedure of evaluating the equation (44) using the current
estimate, the following sequence of estimates can be obtained
{𝜽(𝑗) , 𝑗 = 3, 4, 5, … }
• Convergence criteria are defined on the basis of two successive values of the
parameters estimates
′
• 𝜽(𝑗) = (𝛼,̂ 𝑘)̂ is considered as MLE if one of the following criteria is satisfied
• ∣𝜽(𝑗) − 𝜽(𝑗−1) ∣ is very small
• ∣ℓ(𝜽(𝑗) ) − ℓ(𝜽(𝑗−1) )∣ is very small
• 𝑈 (𝜽𝑗 ) ≈ 𝟎
′
• Estimated variance-variance matrix of the MLE (𝛼,̂ 𝑘)̂ can be obtained from
the inverse of the negative of the hessian matrix evaluated at the MLE
−1
̂ (𝛼)
Var
̂ ̂
̂𝑘 = −[𝐻(𝛼,̂ 𝑘)]
• Since 𝛼 > 0 and 𝑘 > 0, confidence intervals based on the sampling distribution
of log 𝛼̂ and log 𝑘̂ ensure non-negative lower limit of the confidence interval
Following are survival time of 20 male rates that were exposed to a high level of
radiation
> rtime
## 152 152 115 109 137 88 94 77 160 165
## 125 40 128 123 136 101 62 153 83 69
−100.25
−100.50
−100.75
logL
−101.00
−101.25
−101.50
7 8 9 10 11
k
Figure 2: Plot of log-likelihood function against shape parameter 𝑘 for different values of scale
parameter 𝛼
−100.5
−101.0
logL
−101.5
−102.0
10.0 12.5 15.0
α
Figure 3: Plot of log-likelihood function against scale parameter 𝛼 for different values of shape
parameter 𝑘
10
9
k
7
11 12 13 14 15
α
Figure 4: Contour plot of log-likelihood function of gamma distribution for rat survival data
8.9
8.8
k
8.7
8.6
12.7 12.8 12.9 13.0
α
Figure 5: Contour plot of log-likelihood function of gamma distribution for rat survival data
• Initial values of the parameters can be selected from the exploratory plots of
log-likelihood function
• The log-likelihood function must provide finite values with the initial values of
the parameters
• As an example, assume the initial values 𝛼0 = 12.9 and 𝑘0 = 8.8, and the
corresponding value of log-likelihood function
> gamma_loglk(par = c(12.9, 8.8), time = rtime)
## [1] -100.48
• Since both sets of initial values van provide finite values of log-likelihood
function, we can use either of these two as an initial value for optimizing the
log-likelihood function using the R function optim()
9
k
11 12 13 14 15
α
Figure 6: Contour plot of 𝑟(𝛼, 𝑘) for rat survival data
• Approximate (1 − 𝑝)100% confidence region for (𝛼, 𝑘)′ can be obtained from
the set of points (𝛼, 𝑘) satisfying
Λ(𝛼, 𝑘) ≤ 𝜒2(2),1−𝑝
k
8
10 15 20
α
Figure 7: 95% confidence region of (𝛼, 𝑘) for rat survival data
𝐻0 ∶ 𝑘 = 𝑘 0
𝛼(𝑘
̃ 0 ) = arg max ℓ(𝛼, 𝑘0 ) under 𝐻0 (51)
𝛼∈Θ𝛼
Λk
0
0 5 10 15 20
k0
𝐻0 ∶ 𝛼 = 𝛼 0
Λα
0
0 10 20 30
α0
• For 𝛼
no-transformation ∶ 4.84 ≤ 𝛼 ≤ 20.95
log-transformation ∶ 6.91 ≤ 𝛼 ≤ 24.08
LRT ∶ 7.37 ≤ 𝛼 ≤ 25.9
• For 𝑘
no-transformation ∶ 3.46 ≤ 𝑘 ≤ 14.13
log-transformation ∶ 4.79 ≤ 𝑘 ≤ 16.14
LRT ∶ 4.54 ≤ 𝑘 ≤ 15.28
152 0 125 1
152 0 40 1
115 1 128 1
109 1 123 1
137 1 136 1
88 1 101 1
94 1 62 1
77 1 153 0
160 0 83 1
165 0 69 1
Λα
0
0 20 40 60
α0
Λk
0
0 5 10 15
k0
• 95% CI for 𝛼
no-transformation ∶ 4.44 ≤ 𝛼 ≤ 38.17
log-transformation ∶ 9.65 ≤ 𝛼 ≤ 47.02
LRT ∶ 10.7 ≤ 𝛼 ≤ 52.48
• 95% CI for 𝑘
no-transformation ∶ 3.46 ≤ 𝑘 ≤ 14.13
log-transformation ∶ 4.79 ≤ 𝑘 ≤ 16.14
LRT ∶ 2.78 ≤ 𝑘 ≤ 10.9
• Estimate of median
Mahbub Latif
Institute of Statistical Research and Training (ISRT)
University Dhaka, Dhaka 1000, Bangladesh
(mlatif@isrt.ac.bd)
November 2023
• 𝑦𝑖 = log 𝑡𝑖
• 𝑓0 (𝑧) = −𝑑𝑆0 (𝑧)/𝑑𝑧 → pdf
𝑛
• 𝑟 = ∑𝑖=1 𝛿𝑖
𝑛
𝜕ℓ(𝑢, 𝑏) 𝜕ℓ (𝑧 , 𝛿 ) 𝜕𝑧
=∑ 𝑖 𝑖 𝑖 × 𝑖
𝜕𝑢 𝑖=1
𝜕𝑧𝑖 𝜕𝑢
𝑛
𝜕ℓ𝑖 (𝑧𝑖 , 𝛿𝑖 ) −1
=∑ ×( )
𝑖=1
𝜕𝑧𝑖 𝑏
𝑛
ℓ(𝑢, 𝑏) = −𝑟 log 𝑏 + ∑ [𝛿𝑖 log 𝑓0 (𝑧𝑖 ) + (1 − 𝛿𝑖 ) log 𝑆0 (𝑧𝑖 )]
𝑖=1
𝑛
𝜕ℓ(𝑢, 𝑏) 𝑟 𝜕ℓ (𝑧 , 𝛿 ) 𝜕𝑧
=− +∑ 𝑖 𝑖 𝑖 × 𝑖
𝜕𝑏 𝑏 𝑖=1 𝜕𝑧𝑖 𝜕𝑏
𝑛
𝑟 𝜕ℓ (𝑧 , 𝛿 ) −𝑧
= − + ∑ 𝑖 𝑖 𝑖 × ( 𝑖)
𝑏 𝑖=1 𝜕𝑧𝑖 𝑏
𝜕 2 ℓ(𝑢, 𝑏) 𝜕 𝜕ℓ(𝑢, 𝑏)
= [ ]
𝜕𝑢2 𝜕𝑢 𝜕𝑢
1 𝑛 𝜕 2 log 𝑓0 (𝑧𝑖 ) 𝜕 2 log 𝑆0 (𝑧𝑖 )
= ∑ [𝛿 + (1 − 𝛿 ) ]
𝑏2 𝑖=1 𝑖 𝜕𝑧𝑖2 𝑖
𝜕𝑧𝑖2
𝜕ℓ(𝑢,𝑏)
𝜕𝑢
𝑈 (𝑢, 𝑏) = [ ]
𝜕ℓ(𝑢,𝑏)
𝜕𝑏
𝜕 2 ℓ(𝑢,𝑏) 𝜕 2 ℓ(𝑢,𝑏)
2
𝐼(𝑢, 𝑏) = −𝐻(𝑢, 𝑏) = − [ 𝜕 2𝜕𝑢
ℓ(𝑢,𝑏)
𝜕𝑢 𝜕𝑏
𝜕 2 ℓ(𝑢,𝑏)
]
𝜕𝑏 𝜕𝑢 𝜕𝑏2
• MLE
• Variance-covariance matrix
−1
var(𝑢,̂ 𝑏)̂ = [𝐼(𝑢,̂ 𝑏)]
̂ = 𝑉̂
• Sampling distribution
𝑢̂ −1
𝑢 ̂
( ̂ ) ∼ 𝒩2 ([ ] , [𝐼(𝑢,̂ 𝑏)] )
𝑏 𝑏
• For a large 𝑛, (𝑢,̂ 𝑏)̂ ′ follows a bivariate normal distribution with mean (𝑢, 𝑏)′
and variance matrix 𝑉 ̂
• Standard error of 𝑢̂ and 𝑏̂ can be obtained from the diagonal elements of 𝑉 ̂
𝑢̂ − 𝑢 𝑏̂ − 𝑏 log 𝑏̂ − log 𝑏
𝑍1 = , 𝑍2 = , 𝑍2′ =
𝑠𝑒(𝑢)̂ 𝑠𝑒(𝑏)̂ 𝑠𝑒(log 𝑏)̂
𝑢̂ ± 𝑧1−𝑝/2 𝑠𝑒(𝑢)̂
𝑏̂ ± 𝑧1−𝑝/2 𝑠𝑒(𝑏)̂
𝑏̂ exp{±𝑧1−𝑝/2 𝑠𝑒(log 𝑏)}
̂
̂ 𝑝
𝑦𝑝̂ = 𝑢̂ + 𝑏𝑤
̂ + 𝑤𝑝2 𝑉22
𝑠𝑒(𝑦𝑝̂ ) = √𝑉11 ̂ + 2𝑤𝑝 𝑉12
̂
• Pivotal quantity
𝑦𝑝̂ − 𝑦𝑝
𝑍𝑝 = ∼ 𝒩(0, 1)
𝑠𝑒(𝑦𝑝̂ )
• (1 − 𝑞)100% confidence interval for 𝑦𝑝
where
𝜕𝑔 𝜕𝑔 𝜕𝑔 ′
𝐚= =( ,…, )
𝜕𝜽 𝜕𝜃1 𝜕𝜃𝑘
Theorem
Let 𝑇1𝑛 , … , 𝑇𝑘𝑛 be statistics and 𝑔𝑖 (𝑥1 , … , 𝑥𝑘 ), 𝑖 = 1, … , 𝑝, be functions, all of whose
first derivatives exist.
The joint distribution of
𝑝
√
{ 𝑛[𝑔𝑖 (𝑇1𝑛 , … , 𝑇𝑘𝑛 ) − 𝑔𝑖 (𝜃1 , … , 𝜃𝑘 )]}
𝑖=1
𝜓̂ − 𝜓
𝑍0 = ∼ 𝒩(0, 1)
𝑠𝑒(𝜓)̂
• Since 𝜓 ̂ is a function of 𝑢̂ and 𝑏,̂ using the statement of Theorem of the slide
[23], we can write
1/2
𝑠𝑒(𝜓)̂ = [𝐚′ 𝑉 ̂ 𝐚] ,
where 𝑉 ̂ is the covariance matrix of (𝑢,̂ 𝑏)̂ ′ and
𝜕𝜓 𝜕𝜓 ′
𝐚=( , ) = ( − 1/𝑏,̂ −𝜓/̂ 𝑏)
̂
𝜕𝑢 𝜕𝑏 𝑢=𝑢,𝑏=
̂ 𝑏̂
𝑆0 (𝜓 ̂ − 𝑧1−𝑝/2 𝑠𝑒(𝜓))
̂ ≤ 𝑆(𝑦0 ) ≤ 𝑆0 (𝜓 ̂ + 𝑧1−𝑝/2 𝑠𝑒(𝜓))
̂
• MLEs
(𝑢,̂ 𝑏)̂ ′ = arg max ℓ(𝑢, 𝑏) unrestricted
(𝑢,𝑏)′ ∈Θ
Λ1 (𝑢0 ) ≤ 𝜒2(1),1−𝑝
̂ 1 (𝑢0 ) ≤ 𝜒2(1),1−𝑝 }
{𝑢0 ∶ 𝐼(𝑢0 ≤ 𝑢)Λ
̂ 1 (𝑢0 ) ≤ 𝜒2(1),1−𝑝 }
{𝑢0 ∶ 𝐼(𝑢0 ≥ 𝑢)Λ
• Obtain the expression of likelihood ratio test statistic based confidence interval
for the scale parameter 𝑏
𝑦𝑝 = 𝑢 + 𝑤𝑝 𝑏, 𝑤ℎ𝑒𝑟𝑒 𝑤𝑝 = 𝑆0−1 (1 − 𝑝)
𝐻0 ∶ 𝑦𝑝 = 𝑦𝑝0
3. 𝑢̃ = 𝑦𝑝0 − 𝑤𝑝 𝑏̃
• (1 − 𝑞)100% Confidence interval for 𝑦𝑝 can be obtained from the set of 𝑦𝑝0
values such that
Λ(𝑦𝑝0 ) ≤ 𝜒2(1),1−𝑞
Mahbub Latif Lifetime Data Analysis November 2023 33 / 185
Likelihood ratio procedures for survival quantity
𝐻0 ∶ 𝑆(𝑦0 ) = 𝑠0
• The same likelihood ratio statistic (8) can be used to test the hypothesis
• The (1 − 𝑝)100% confidence interval for 𝑆(𝑦0 ) can be defined as the set of 𝑠0
values such that
• The likelihood ratio procedure can provide quite accurate confidence intervals
when the number of failures is about 20 or more
• Two-sided intervals perform better than one-sided intervals as the former giving
more closer to nominal coverage than the other
• Censored sample
{(𝑡𝑖 , 𝛿𝑖 ), 𝑖 = 1, … , 𝑛}
• Define
𝑦𝑖 = log 𝑡𝑖 and 𝑧𝑖 = (𝑦𝑖 − 𝑢)/𝑏
𝑆0 (𝑧) = exp(−𝑒𝑧 )
𝑑
𝑓0 (𝑧) = − 𝑆 (𝑧) = exp(𝑧 − 𝑒𝑧 )
𝑑𝑧 0
𝑛
ℓ(𝑢, 𝑏) = −𝑟 log 𝑏 + ∑ [𝛿𝑖 log 𝑓0 (𝑧𝑖 ) + (1 − 𝛿𝑖 ) log 𝑆0 (𝑧𝑖 )]
𝑖=1
• 𝑟 = ∑ 𝑖 𝛿𝑖
• General expressions
• MLEs 𝑢̂ and 𝑏̂
𝜕ℓ(𝑢, 𝑏)
∣ =0
𝜕𝑢 𝑢=𝑢,̂ 𝑏=𝑏̂
𝑛 𝑛
1
− [𝑟 − ∑ 𝑒 ] = 0 ⇒ 𝑟 = ∑ 𝑒𝑧𝑖̂
𝑧𝑖̂
(16)
𝑏̂ 𝑖=1 𝑖=1
• ̂ 𝑏̂
𝑧𝑖̂ = (𝑦𝑖 − 𝑢)/
𝑟 1 𝑛
= − − ∑ 𝑧𝑖 [𝛿𝑖 (1 − 𝑒𝑧𝑖 ) − (1 − 𝛿𝑖 )𝑒𝑧𝑖 ]
𝑏 𝑏 𝑖=1
𝑟 1 𝑛
= − − ∑ 𝑧𝑖 [𝛿𝑖 − 𝑒𝑧𝑖 ] (17)
𝑏 𝑏 𝑖=1
𝜕ℓ(𝑢, 𝑏)
∣ =0
𝜕𝑏 𝑢=𝑢,̂ 𝑏=𝑏̂
𝑟 1 𝑛 𝑛
− − ∑ 𝑧𝑖̂ (𝛿𝑖 − 𝑒𝑧𝑖̂ ) = 0 ⇒ 𝑟 = − ∑ 𝑧𝑖̂ (𝛿𝑖 − 𝑒𝑧𝑖̂ ) (18)
𝑏̂ 𝑏̂ 𝑖=1 𝑖=1
where
𝜕 2 log 𝑓0 (𝑧) 𝜕 𝑧 𝑧 𝜕 2 log 𝑆0 (𝑧)
= {1 − 𝑒 } = −𝑒 =
𝜕𝑧 2 𝜕𝑧 𝜕𝑧2
𝜕 2 ℓ(𝑢, 𝑏) 1 𝑛 𝑧𝑖̂ 𝑟
∣ = − ∑𝑒 = − (19)
𝜕𝑏 2 ̂ 2
𝑏 𝑖=1 𝑏̂ 2
𝑢=𝑢,̂ 𝑏=𝑏̂
𝑟 2 𝑛 𝑧𝑖 𝑧𝑖 1 𝑛 2 𝑧𝑖
= 2 + 2 ∑ 𝑧𝑖 [𝛿𝑖 (1 − 𝑒 ) − (1 − 𝛿𝑖 )𝑒 ] − 2 ∑ 𝑧𝑖 𝑒
𝑏 𝑏 𝑖=1 𝑏 𝑖=1
𝑟 2 𝑛 𝑧𝑖 1 𝑛 2 𝑧𝑖
= 2 + 2 ∑ 𝑧𝑖 [𝛿𝑖 − 𝑒 ] − 2 ∑ 𝑧𝑖 𝑒
𝑏 𝑏 𝑖=1 𝑏 𝑖=1
𝜕 2 ℓ(𝑢, 𝑏) 𝑟 1 𝑛 2 𝑧𝑖̂
∣ = − − ∑ 𝑧𝑖̂ 𝑒 (20)
𝜕𝑏2 𝑢=𝑢,̂ 𝑏=𝑏̂ 𝑏̂2 𝑏̂2 𝑖=1
Mahbub Latif Lifetime Data Analysis November 2023 50 / 185
𝜕 2 ℓ(𝑢, 𝑏) 1 𝑛 𝜕 log 𝑓0 (𝑧𝑖 ) 𝜕 log 𝑆0 (𝑧𝑖 )
= 2 ∑ [𝛿𝑖 + (1 − 𝛿𝑖 ) ]
𝜕𝑢 𝜕𝑏 𝑏 𝑖=1 𝜕𝑧𝑖 𝜕𝑧𝑖
1 𝑛
= 2 ∑ [𝛿𝑖 (1 − 𝑒𝑧𝑖 ) − (1 − 𝛿𝑖 )𝑒𝑧𝑖 − 𝑧𝑖 𝑒𝑧𝑖 ]
𝑏 𝑖=1
1 𝑛 1 𝑛 𝑛
= 2 ∑ [𝛿𝑖 − 𝑒 − 𝑧𝑖 𝑒 ] = 2 [𝑟 − ∑ 𝑒 − ∑ 𝑧𝑖 𝑒𝑧𝑖 ]
𝑧𝑖 𝑧𝑖 𝑧𝑖
𝑏 𝑖=1 𝑏 𝑖=1 𝑖=1
𝜕 2 ℓ(𝑢, 𝑏) 1 𝑛
∣ = − ∑ 𝑧𝑖̂ 𝑒𝑧𝑖̂ (21)
𝜕𝑢 𝜕𝑏 𝑢=𝑢,̂ 𝑏=𝑏̂ 𝑏̂2 𝑖=1
Mahbub Latif Lifetime Data Analysis November 2023 51 / 185
Covariance matrix
• Hessian matrix at MLEs 𝑢̂ and 𝑏̂
𝑛
1 𝑟 ∑𝑖=1 𝑧𝑖̂ 𝑒𝑧𝑖̂
̂
𝐻(𝑢,̂ 𝑏) = − [ 𝑛 ]
𝑛
𝑏̂2 ∑𝑖=1 𝑧𝑖̂ 𝑒𝑧𝑖̂ 𝑟 + ∑𝑖=1 𝑧𝑖2̂ 𝑒𝑧𝑖̂
• Covariance matrix of (𝛼,̂ 𝛽)̂ ′ (using delta method defined in the slide [24])
where
𝜕𝑔1 (𝑢,𝑏) 𝜕𝑔1 (𝑢,𝑏)
𝑒𝑢̂ 0
𝐺= [ 𝜕𝑔 𝑑𝑢 𝜕𝑏
] =[ ]
2 (𝑢,𝑏) 𝜕𝑔2 (𝑢,𝑏) 0 − 𝑏1̂2
𝜕𝑢 𝜕𝑏
• 𝛼 = 𝑔1 (𝑢, 𝑏) = 𝑒𝑢
• 𝛽 = 𝑔2 (𝑢, 𝑏) = (1/𝑏)
Mahbub Latif Lifetime Data Analysis November 2023 53 / 185
Confidence intervals
𝑢̂ ± 𝑧1−𝑝/2 𝑠𝑒(𝑢)̂
𝑏̂ ± 𝑧1−𝑝/2 𝑠𝑒(𝑏)̂
• MLE of 𝑢 under 𝐻0 ∶ 𝑏 = 𝑏0
𝑛
𝜕ℓ(𝑢, 𝑏0 ) 1
∣ = 0 ⇒ − [𝑟 − ∑ 𝑒(𝑦𝑖 −𝑢)/𝑏
̃ 0
]=0
𝜕𝑢 𝑏0 𝑖=1
𝑢=𝑢̃
1 𝑛 𝑦𝑖 /𝑏0
⇒ 𝑢(𝑏
̃ 0 ) = 𝑏0 log [ ∑ 𝑒 ]
𝑟 𝑖=1
• LRT statistics
Λ(𝑏0 ) = 2ℓ(𝑢,̂ 𝑏)̂ − 2ℓ(𝑢(𝑏
̃ 0 ), 𝑏0 )
• 100(1 − 𝑝)% CI for 𝑏 is defined by the set of 𝑏0 values such that
Λ1 (𝑏0 ) ≤ 𝜒2(1),1−𝑝
• Similarly, confidence interval for 𝑢 can be obtained using the corresponding LRT
statistics (Homework)
𝑦𝑝̂ = 𝑢̂ + 𝑤𝑝 𝑏̂
1
𝑣𝑎𝑟(𝑦𝑝̂ ) = [1 𝑤𝑝 ] 𝑉 ̂ [ ̂ + 𝑉22
] = 𝑉11 ̂ 𝑤𝑝2 + 2𝑉12
̂ 𝑤𝑝
𝑤𝑝
Λ(𝑦𝑝0 ) ≤ 𝜒2(1),1−𝑞
• (1 − 𝑝)100% CI for 𝜓
• Leukemia remission time data were given in Example 1.1.7 and used as an
example for the non-parametric methods (e.g. Kaplan-Meier method) described
in Chapter 3
• Two groups of patients (6MP and placebo), each group has 21 patients, were
followed up to observed either remission or censoring times (in weeks)
drug status n
6-MP 0 12
6-MP 1 9
placebo 1 21
• Each group has 21 subjects, and all subjects of the placebo group were failed
(end of remission) and drug group (6MP) has 12 censored times
• Two separate Weibull distributions are assumed for the failure times of two
treatment groups, e.g.
• 6MP group:
𝑇 ∼ Weibull(𝛼1 , 𝛽1 ), 𝑌 = log 𝑇 ∼ 𝐸𝑉 (𝑢1 , 𝑏1 )
• Placebo group:
• Observed data
{(𝑡𝑖 , 𝛿𝑖 ), 𝑖 = 1, … , 𝑛}
• Log-likelihood function
𝑛
ℓ(𝛼, 𝛽) = ∑ [𝛿𝑖 log 𝑓(𝑡𝑖 ; 𝛼, 𝛽) + (1 − 𝛿𝑖 ) log 𝑆(𝑡𝑖 ; 𝛼, 𝛽)]
𝑖=1
• MLEs
(𝛼,̂ 𝛽)̂ ′ = arg max ℓ(𝛼, 𝛽)
(𝛼,𝛽)′ ∈Θ
• Log-likelihood function
𝑛 𝛿𝑖 1−𝛿𝑖
ℓ𝑒𝑣 (𝑢, 𝑏) = log ∏ [𝑓(𝑦𝑖 ; 𝑢, 𝑏)] [𝑆(𝑦𝑖 ; 𝑢, 𝑏)] (30)
𝑖=1
• MLEs
(𝑢,̂ 𝑏)̂ ′ = arg max ℓ𝑒𝑣 (𝑢, 𝑏)
(𝑢,𝑏)′ ∈Θ
• In formula, response is a Surv object, e.g. to model the variables time and
status
formula = Surv(time, status) ∼ 1
• Lifetime or log-lifetime distributions can be passed to survreg by the argument
dist
• The survreg() function returns estimates of (𝑢, log 𝑏)′ and corresponding
variance matrix
• For making inference about Weibull distribution, followings are required
1. estimate of (𝑢, 𝑏)′ and the corresponding variance matrix
2. estimate of (𝛼, 𝛽)′ and the corresponding variance matrix
• It is important to understand the methods to obtain estimates and the
corresponding variance of (𝑢, 𝑏)′ and (𝛼, 𝛽)′ from the estimates and the
corresponding variance of (𝑢, log 𝑏)′
• Similarly
• Similarly
0
3.0 3.5 4.0 4.5 5.0
u0
Figure 1: Plot of LRT statistic against different null values 𝑢0 and 95% confidence interval for
𝑢 and 𝛼
2.5
0.0
−1.0 −0.5 0.0 0.5
log(b0)
Figure 2: Plot of LRT statistic against different null values 𝑏0 and 95% confidence interval for
log 𝑏 and 𝛽
> tidy(w_sreg_p)
## # A tibble: 2 x 5
## term estimate std.error statistic p.value
## <chr> <dbl> <dbl> <dbl> <dbl>
## 1 (Intercept) 2.25 0.168 13.4 5.72e-41
## 2 Log(scale) -0.315 0.174 -1.82 6.94e- 2
• Note the estimate of 𝑦𝑝̂ depends on the estimate of 𝑢̂ and 𝑏,̂ and the
corresponding variance matrix
• survreg() returns estimate and variance matrix for 𝑢̂ and log 𝑏̂
Table 4: 95% confidence intervals for different quantiles of treatment group (6-MP)
CI for 𝑡𝑝
𝑝 𝑤𝑝 𝑦𝑝̂ 𝑠𝑒(𝑦𝑝̂ ) lower upper
0.25 -1.246 2.599 0.655 3.726 48.559
0.50 -0.367 3.249 0.264 15.357 43.225
0.75 0.327 3.761 0.395 19.822 93.241
4
Λ1
0
1.5 2.0 2.5 3.0
yp
Figure 3: Plot of LRT statistic against different null values 𝑦𝑝0 and 95% confidence interval for
𝑦.25 and 𝑡.25 (6-MP group)
2.5
0.0
2.5 3.0 3.5 4.0
yp
Figure 4: Plot of LRT statistic against different null values 𝑦𝑝0 and 95% confidence interval for
𝑦.5 and 𝑡.5 (6-MP group)
Table 5: 95% confidence intervals of different quantiles using Wald and LRT method
(6-MP group)
Wald LRT
𝑝 lower upper lower upper
0.25 3.726 48.559 6.586 23.058
0.50 15.357 43.225 16.289 51.342
0.75 19.822 93.241 27.522 112.730
Table 6: 95% confidence intervals for different quantiles using Wald and LRT method
(placebo group)
Wald LRT
𝑝 lower upper lower upper
0.25 1.362 10.707 2.031 5.927
0.50 4.499 11.708 5.755 9.488
0.75 8.592 16.863 8.873 16.996
0.75
0.25
0.00
0 10 20 30
remission time (in weeks)
Figure 5: Comparison survival probabilities of between two treatment groups
0.75
survival prob
6−MP
0.50
Placebo
0.25
0.00
0 10 20 30
remission time (in weeks)
Figure 6: Comparison of parametric (Weibull) and non-parametric (step-function) estimates of
survivor function using remission time data
• Obtain Wald and LRT statistics based confidence interval for the survival
probability 𝑆(10)
𝑦−𝜇 𝑦−𝜇
𝑆(𝑦; 𝜇, 𝜎) = 𝑆0 ( ) = 1 − Φ( ) (37)
𝜎 𝜎
𝜕 log 𝑓0 (𝑧)
= −𝑧 (41)
𝜕𝑧
𝜕 2 log 𝑓0 (𝑧)
= −1 (42)
𝜕𝑧 2
𝜕 log 𝑆0 (𝑧) 𝑓 (𝑧)
=− 0 (43)
𝜕𝑧 𝑆0 (𝑧)
2
𝜕 2 log 𝑆0 (𝑧) 𝑧𝑓 (𝑧) 𝑓 (𝑧)
2
= 0 −[ 0 ] (44)
𝜕𝑧 𝑆0 (𝑧) 𝑆0 (𝑧)
• Sampling distribution
where
−1
𝑉 ̂ = [ − 𝐻(𝜇,̂ 𝜎)]
̂
log 𝑡 − 𝜇̂
𝑆(𝑡; 𝜇,̂ 𝜎)̂ = 1 − Φ( )
𝜎̂
= 1 − Φ(𝜓)̂ (47)
where
log 𝑡 − 𝜇̂
𝜓 ̂ = Φ−1 (1 − 𝑆(𝑡; 𝜇,̂ 𝜎))
̂ =
𝜎̂
• Standard error of 𝜓 ̂
𝑠𝑒(𝜓)̂ = √𝐚′ 𝑉 ̂ 𝐚
where
𝐚 = (−1/𝜎,̂ −𝜓/̂ 𝜎)̂ ′
• (1 − 𝛼)100% CI of 𝑆(𝑡)
𝐿<𝜓<𝑈
𝐿 < Φ−1 (1 − 𝑆(𝑡; 𝜇, 𝜎)) < 𝑈
Φ(𝐿) < 1 − 𝑆(𝑡; 𝜇, 𝜎) < Φ(𝑈 )
1 − Φ(𝑈 ) < 𝑆(𝑡; 𝜇, 𝜎) < 1 − Φ(𝐿)
where
𝐿 = 𝜓 ̂ − 𝑧1−𝛼/2 𝑠𝑒(𝜓)̂
𝑈 = 𝜓 ̂ + 𝑧1−𝛼/2 𝑠𝑒(𝜓)̂
𝑦𝑝̂ = 𝜇̂ + 𝜎𝑤
̂ 𝑝
𝑠𝑒(𝑦𝑝̂ ) = √𝐚′ 𝑉 ̂ 𝐚
where
𝐚 = (1, 𝑤𝑝 )′
• Obtain the expressions of Wald-type and LRT based 100(1 − 𝛼)% confidence
intervals of 𝑦𝑝
1 0
𝐺=[ ]
0 exp(𝜎)
Wald LRT
par lower upper lower upper
𝜇 4.942 5.447 5.000 5.400
𝜎 0.676 1.127 0.709 1.109
log 80 − 𝜇̂
𝑆(80; 𝜇,̂ 𝜎)̂ = 1 − Φ( )
𝜎̂
= 0.824
0.75
estimate 0.50
log−normal
0.25
PL
0.00
0 50 100 150
time
Figure 7: Comparison of the estimates of survivor function
‘
Mahbub Latif Lifetime Data Analysis November 2023 119 / 185
Table 8: Estimate and confidence interval of 𝑆(80)
95% Wald-type CI
parameter est lower upper
𝑆(80) 0.824 0.667 0.924
• Obtain LRT based 95% CI for 𝑆(80)
𝑦𝑝̂ = 𝜇̂ + 𝜎𝑤
̂ 𝑝
• 𝑤𝑝 = Φ−1 (𝑝)
95% Wald-type CI
𝑝 𝑤𝑝 𝑦𝑝̂ se lower upper
0.25 -0.674 4.606 0.105
0.50 0.000 5.195 0.129
0.75 0.674 5.783 0.194
(𝛽/𝛼)(𝑡/𝛼)𝛽−1
𝑓(𝑡; 𝛼, 𝛽) = 2
(48)
[1 + (𝑡/𝛼)𝛽 ]
−1
𝑆(𝑡; 𝛼, 𝛽) = [1 + (𝑡/𝛼)𝛽 ] (49)
(𝛽/𝛼)(𝑡/𝛼)𝛽−1
ℎ(𝑡; 𝛼, 𝛽) = (50)
[1 + (𝑡/𝛼)𝛽 ]
• 𝑧 = (𝑦 − 𝑢)/𝑏
𝑦−𝑢
𝑆(𝑦; 𝑢, 𝑏) = 𝑆0 ( ) (55)
𝑏
1
= (56)
1 + exp [(𝑦 − 𝑢)/𝑏]
• Log-likelihood function
𝑛 𝛿𝑖 1−𝛿𝑖
ℓ(𝜇, 𝜎) = log ∏ [(1/𝑏) 𝑓0 (𝑧𝑖 )] [𝑆0 (𝑧𝑖 )]
𝑖=1
𝑛 𝑛
= −𝑟 log 𝑏 + ∑ 𝛿𝑖 log 𝑓0 (𝑧𝑖 ) + ∑(1 − 𝛿𝑖 ) log 𝑆0 (𝑧𝑖 )
𝑖=1 𝑖=1
𝑛
= −𝑟 log 𝑏 + ∑ [𝛿𝑖 {𝑧𝑖 − log(1 + 𝑒𝑧𝑖 )} − log(1 + 𝑒𝑧𝑖 )]
𝑖=1
• Sampling distribution
where
−1
̂
𝑉 ̂ = [ − 𝐻(𝑢,̂ 𝑏)]
𝑦 − 𝑢̂ 1
𝑆0 ( ) = 𝑆(𝑦; 𝑢,̂ 𝑏)̂ = (63)
𝑏̂ ̂ 𝑏]̂
1 + exp [(𝑦 − 𝑢)/
1 − 𝑆(𝑦) 𝑦 − 𝑢̂
log [ ]= = 𝜓 ̂ = 𝑆0−1 (𝑆(𝑦)) (64)
𝑆(𝑦) 𝑏 ̂
• Standard error of 𝜓 ̂
𝐿<𝜓<𝑈
1 − 𝑆(𝑦)
𝐿 < log <𝑈
𝑆(𝑦)
1 − 𝑆(𝑦)
exp(𝐿) < < exp(𝑈 )
𝑆(𝑦)
1 − 𝑆(𝑦)
1 + exp(𝐿) < 1 + < 1 + exp(𝐿)
𝑆(𝑦)
1 1
< 𝑆(𝑦) <
1 + exp(𝑈 ) 1 + exp(𝐿)
where
𝐿 = 𝜓 ̂ − 𝑧1−𝛼/2 𝑠𝑒(𝜓)̂
𝑈 = 𝜓 ̂ + 𝑧1−𝛼/2 𝑠𝑒(𝜓)̂
Mahbub Latif Lifetime Data Analysis November 2023 132 / 185
Estimate of survivor function Lrt based
Λ(𝑠0 ) ≤ 𝜒2(1),1−𝛼
where
1 − 𝑠0
𝑆(𝑦0 ) = 𝑠0 ⇒ 𝑢 = 𝑦0 − 𝑏 log
𝑠0
̂ 𝑝
𝑦𝑝̂ = 𝑢̂ + 𝑏𝑤
𝑠𝑒(𝑦𝑝̂ ) = √𝐚′ 𝑉 ̂ 𝐚
where
𝐚 = (1, 𝑤𝑝 )′
Mahbub Latif Lifetime Data Analysis November 2023 135 / 185
Homework
• Obtain the expressions of Wald-type and LRT based 100(1 − 𝛼)% confidence
intervals of 𝑦𝑝
Wald LRT
dist par est lower upper lower upper
Logistic 𝑢 5.121 4.920 5.321 5.000 5.300
𝑏 0.438 0.326 0.587 0.360 0.559
Gaussian 𝜇 5.195 4.942 5.447 5.000 5.400
𝜎 0.873 0.676 1.127 0.709 1.109
1
𝑆(80; 𝑢,̂ 𝑏)̂ =
̂ 𝑏]̂
1 + exp [(log 80 − 𝑢)/
= 0.844
0.75
estimate 0.50
log−logistic
0.25 log−normal
PL
0.00
0 50 100 150 200 250
time
Figure 8: Comparison of the estimates of survivor function
‘
Mahbub Latif Lifetime Data Analysis November 2023 145 / 185
Table 11: Estimate and corresponding Wald-type confidence interval of the survival
probability 𝑆(80)
95% CI
dist est lower upper
Log-logistic 0.844 0.566 0.957
Log-normal 0.824 0.667 0.924
̂ 𝑝
𝑦𝑝̂ = 𝑢̂ + 𝑏𝑤
𝑝
• 𝑤𝑝 = log 1−𝑝
95% Wald-type CI
dist 𝑝 𝑤𝑝 𝑦𝑝̂ se lower upper
Logistic 0.25 -1.099 4.640 0.143
0.50 0.000 5.121 0.102
0.75 1.099 5.601 0.234
Gaussian 0.25 -0.674 4.826 0.101
0.50 0.000 5.121 0.102
0.75 0.674 5.416 0.177
• Analyze the locomotive control lifetimes using Weibull model and compare the
results
• General expression of the 𝑝𝑡ℎ quantile of the 𝑗𝑡ℎ population takes the form
𝑦𝑗𝑝 = 𝑢𝑗 + 𝑏𝑗 𝑤𝑝 , 𝑗 = 1, … , 𝑚 (68)
• 𝑤𝑝 = 𝑆0−1 (1 − 𝑝)
• When the scales are not equal (i.e. 𝑏1 ≠ 𝑏2 ), the difference between the 𝑝𝑡ℎ
quantiles does depend on the probability 𝑝 wp=p er function
• The ratio of the 𝑝𝑡ℎ quantiles of two lifetime distributions does not depend on
the probability 𝑝 when the corresponding shape parameters are equal (𝛽1 = 𝛽2 )
𝑆1 (𝑦) = 𝑆2 (𝑦)
• Under the assumption of common scale (shape for lifetime) parameter, the null
hypothesis of equality of two distributions can be expressed as
𝑆1 (𝑦 + 𝑢1 − 𝑢2 ) = 𝑆2 (𝑦) (73)
𝑆1 (𝑡(𝛼1 /𝛼2 )) = 𝑆2 (𝑡) (74)
• That is, the survivor functions for 𝑌 are translations of one another by an
amount (𝑢1 − 𝑢2 ) along the 𝑦-axis
𝐻0 ∶ 𝑦1𝑝 = 𝑦2𝑝
(𝑦1𝑝
̂ − 𝑦2𝑝
̂ ) − (𝑦1𝑝 − 𝑦2𝑝 )
𝑍𝑝 = 1/2
∼ 𝒩(0, 1) under 𝐻0
[𝑣𝑎𝑟(𝑦1𝑝
̂ ) + 𝑣𝑎𝑟(𝑦2𝑝 )]
• The statistic 𝑍𝑝 can be used to obtain confidence interval for (𝑦1𝑝 − 𝑦2𝑝 )
• The statistic 𝑍𝑏 can be used to obtain confidence interval for (𝑏1 /𝑏2 )
𝐻0 ∶ 𝑢1 = 𝑢2 , 𝑏1 = 𝑏2
𝐻0 ∶ 𝑢1 = ⋯ = 𝑢𝑚 , 𝑏1 = ⋯ = 𝑏𝑚
𝐻1 ∶ all 𝑢𝑗 ’s are not equal, 𝑏1 = ⋯ = 𝑏𝑚
𝐻0 ∶ 𝑏1 = ⋯ = 𝑏𝑚 = 𝑏 (say) (75)
• Log-likelihood function
𝑚
ℓ(𝑢1 , … , 𝑢𝑚 , 𝑏1 , … , 𝑏𝑚 ) = ∑ ℓ𝑗 (𝑢𝑗 , 𝑏𝑗 ) (76)
𝑗=1
• 𝑟𝑗 = ∑𝑖 𝛿𝑗𝑖
Mahbub Latif Lifetime Data Analysis November 2023 164 / 185
Case 1
• LRT statistic
• MLEs
• (𝑢̂𝑗 , 𝑏̂𝑗 )′ = arg maxΘ ℓ𝑗 (𝑢𝑗 , 𝑏𝑗 ), 𝑗 = 1, … , 𝑚
• 100(1 − 𝛼)% confidence interval of (𝑏1 /𝑏2 ) can be obtained from the range of
𝑎 values that satisfy
Λ(𝑎) ≤ 𝜒2(1),1−𝛼 ,
where the LRT statistic
• Test equality of several location parameters when scale parameters are equal
𝐻0 ∶ 𝑢 1 = ⋯ = 𝑢 𝑚 , 𝑏 1 = ⋯ = 𝑏 𝑚 (78)
𝐻1 ∶ all 𝑢𝑗 ’s are not equal, 𝑏1 = ⋯ = 𝑏𝑚 (79)
• MLEs
• under 𝐻0 , (𝑢⋆ , 𝑏⋆ ) = arg maxΘ ℓ(𝑢, … , 𝑢, 𝑏, … , 𝑏)
• LRT statistic
𝐻0 ∶ 𝑢1 − 𝑢2 = 𝛿, 𝑏1 = 𝑏2 vs 𝐻1 ∶ 𝑢1 − 𝑢2 ≠ 𝛿, 𝑏1 = 𝑏2
• LRT statistic
• 100(1 − 𝛼) confidence interval for (𝑢1 − 𝑢2 ) can be obtained from the set of 𝛿
values that satisfy Λ(𝛿) ≤ 𝜒2(1),1−𝛼
Mahbub Latif Lifetime Data Analysis November 2023 169 / 185
Case 5
• 𝑤𝑝 = 𝑆0−1 (1 − 𝑝)
• LRT statistic
• under 𝐻0
• under 𝐻1
(𝑢̂1 , 𝑢̂2 , 𝑏̂2 , 𝑏̂1 ) = arg max ℓ(𝑢1 , 𝑢2 , 𝑏1 , 𝑏2 )
Θ
• 100(1 − 𝛼) confidence interval for (𝑦1𝑝 − 𝑦2𝑝 ) can be obtained from the set of
Δ values that satisfy Λ(Δ) ≤ 𝜒2(1),1−𝛼
• 𝑢𝑗 = log 𝛼𝑗
• 𝑏𝑗 = 1/𝛽𝑗
Voltage
0.75 28
30
0.50 32
y
34
36
0.25
38
0.00
0 50 100 150
x
Figure 9: Comparison of estimated survivor function
• p-value
𝑃 𝑟(𝜒2(6) ≥ Λ) = 0.185
It does not provide enough evidence to reject the null hypothesis of equality of
the scale parameters.
• Wald-type
comparison
3:5
3:4
2:7
2:6
2:5
2:4
2:3
1:7
1:6
1:5
1:4
1:3
1:2
0
8
^ ^
bj bj′
Figure 10: Estimate and 95% confidence interval of pair-wise comparisons of scale parameters
(𝑏𝑗 /𝑏𝑗′ )
𝐻0 ∶ 𝑢 1 = ⋯ = 𝑢 𝑚 , 𝑏 1 = ⋯ = 𝑏 𝑚
• LRT statistic
• p-value 𝑃 (𝜒2(1) > 80.013) < .001 → There is a strong evidence against the
assumption of equality of 𝑚 location parameters
comparison
3:5
3:4
2:7
2:6
2:5
2:4
2:3
1:7
1:6
1:5
1:4
1:3
1:2
0.0
2.5
5.0
7.5
^ −u
u ^
j j′
Figure 11: Estimate and 95% confidence interval of pair-wise comparisons of location
parameters (𝑢𝑗 − 𝑢𝑗′ )
𝑦𝑗𝑝 = 𝑢𝑗 + 𝑏𝑗 𝑤𝑝 , 𝑗 = 1, … , 𝑚
• 95% confidence interval for the difference of median between groups 1 and 2
𝑦1𝑚
̂ − 𝑦2𝑚
̂ ± 𝑧1−𝛼/2 𝑠𝑒(𝑦1𝑚
̂ − 𝑦2𝑚
̂ )
(6.19 − 5.491) ± (1.96)(1.291)
−1.831 < (𝑦1𝑚 − 𝑦2𝑚 ) < 3.231
comparison
3:5
3:4
2:7
2:6
2:5
2:4
2:3
1:7
1:6
1:5
1:4
1:3
1:2 0.0
2.5
5.0
7.5
y^pj − y^pj′
Figure 12: Estimate and 95% confidence interval of pair-wise comparisons of medians
(𝑦𝑗,.5 − 𝑦𝑗′ ,.5 )
• Analyse the breakdown time data using log-logistic and log-normal distributions
and compare the results with that of Weibull distribution