06. Conditional models in recurrent events

knitr::opts_chunk$set(warning = FALSE, message = FALSE)
suppressWarnings({
  suppressMessages({
library(revents)
library(survival)
library(broom)
library(dplyr)
library(frailtyEM)
  })
})

# Load data layout corresponding to the third layout
data("data_layout_1", package = "revents")
data_layout_1 <- revents::data_layout_1

Cox proportional hazard model (CoxPH)

In time to first event analyses (i.e. classical cox regression), the risk set is only restricted to first event of relapses. Thus, when working with recurrent event data, this risk set is specified by using the subset of SEVENT == 1.

cox.relapses <- coxph(Surv(TSTART, TSTOP, STATUS) ~ DISEASE_COURSE + AGE + SEX 
                      + RACE + TIME_SINCE_DIAGNOSIS, ties = "breslow", 
                      data = subset(data_layout_1, SEVENT == 1))

cox.relapses
#> Call:
#> coxph(formula = Surv(TSTART, TSTOP, STATUS) ~ DISEASE_COURSE + 
#>     AGE + SEX + RACE + TIME_SINCE_DIAGNOSIS, data = subset(data_layout_1, 
#>     SEVENT == 1), ties = "breslow")
#> 
#>                           coef exp(coef)  se(coef)      z        p
#> DISEASE_COURSESPMS   -0.412918  0.661717  0.077685 -5.315 1.06e-07
#> AGE                   0.001154  1.001155  0.003051  0.378   0.7053
#> SEXMale               0.096333  1.101125  0.061252  1.573   0.1158
#> RACEWhite             0.052987  1.054416  0.108670  0.488   0.6258
#> TIME_SINCE_DIAGNOSIS  0.002447  1.002450  0.001475  1.659   0.0971
#> 
#> Likelihood ratio test=41.48  on 5 df, p=7.489e-08
#> n= 1313, number of events= 1255

Here it can be seen that the CoxPH model gives a significant decreasing risk of relapses of 34% (HR = 0.66) in the SPMS group compared with the RRMS group. It is important to consider that this model is ignoring all the other events that precede the 1st events.

In order to account for them, we can use recurrent events approaches:

Andersen-Gill (AG) model

This is a classical cox extension model for recurrent event data which assumes that events are independent and that the hazard ratio is constant over time.

ag.relapses <- coxph(Surv(TSTART, TSTOP, STATUS) ~ DISEASE_COURSE + AGE +
                       SEX + RACE + TIME_SINCE_DIAGNOSIS, data = data_layout_1)

ag.relapses %>% tidy(exp = TRUE, conf.int = TRUE)
#> # A tibble: 5 × 7
#>   term                 estimate std.error statistic  p.value conf.low conf.high
#>   <chr>                   <dbl>     <dbl>     <dbl>    <dbl>    <dbl>     <dbl>
#> 1 DISEASE_COURSESPMS      0.450  0.0486     -16.4   1.12e-60    0.409     0.495
#> 2 AGE                     1.00   0.00176      2.17  2.98e- 2    1.00      1.01 
#> 3 SEXMale                 0.972  0.0355      -0.788 4.31e- 1    0.907     1.04 
#> 4 RACEWhite               1.16   0.0608       2.41  1.61e- 2    1.03      1.30 
#> 5 TIME_SINCE_DIAGNOSIS    1.00   0.000910     1.86  6.32e- 2    1.00      1.00

Adjusting for covariates (in this case there are no time-dependent covariates) and considering all events, the AG model shows that the risk of relapse in the SPMS group is 56% lower than in the RRMS group.

It is important to mention that the AG model can also be fitted with a robust = TRUE to take into account correlations among the events and provides robust standard errors (not applied in this example).

Prentice Williams Peterson (PWP TT/CT) model (overall effect)

The PWP model turns into a useful model when dependency between events become relevant to account. This model add an strata term to the model, which allows to account for the dependency between events while obtaining the overall effect.

pwp.tt.relapses <- coxph(Surv(TSTART, TSTOP, STATUS) ~ DISEASE_COURSE + 
                           AGE + SEX + RACE + TIME_SINCE_DIAGNOSIS + 
                           strata(STATUS), data = data_layout_1)

pwp.tt.relapses
#> Call:
#> coxph(formula = Surv(TSTART, TSTOP, STATUS) ~ DISEASE_COURSE + 
#>     AGE + SEX + RACE + TIME_SINCE_DIAGNOSIS + strata(STATUS), 
#>     data = data_layout_1)
#> 
#>                            coef  exp(coef)   se(coef)      z        p
#> DISEASE_COURSESPMS   -3.980e-01  6.716e-01  4.864e-02 -8.184 2.75e-16
#> AGE                   2.340e-03  1.002e+00  1.758e-03  1.331  0.18325
#> SEXMale               4.159e-06  1.000e+00  3.553e-02  0.000  0.99991
#> RACEWhite            -8.101e-02  9.222e-01  6.093e-02 -1.330  0.18368
#> TIME_SINCE_DIAGNOSIS  2.308e-03  1.002e+00  8.847e-04  2.609  0.00908
#> 
#> Likelihood ratio test=86.57  on 5 df, p=< 2.2e-16
#> n= 5092, number of events= 3779

When analyses are performed based on restricted risk sets (the risk set only involves those with the same number of previous events) with a calendar time scale, the PWP gives a HR: 0.67 for the SPMS group compared with the RRMS group.

Prentice Williams Peterson (PWP-GT) model (overall effect)

The PWP model can also be used to account for inter-event dependence when the time scale is the gap time scale. This model is useful when the time between events is of interest and there is a renewal after each event (i.e. the participant returns to the previous state as, for example, in diseases where the community does not develop after the first event).

pwp.gt.relapses <- coxph(Surv(TGAP, STATUS) ~ DISEASE_COURSE + AGE + SEX
                         + RACE + TIME_SINCE_DIAGNOSIS +
                           strata(STATUS), data = data_layout_1)

pwp.gt.relapses
#> Call:
#> coxph(formula = Surv(TGAP, STATUS) ~ DISEASE_COURSE + AGE + SEX + 
#>     RACE + TIME_SINCE_DIAGNOSIS + strata(STATUS), data = data_layout_1)
#> 
#>                            coef  exp(coef)   se(coef)      z       p
#> DISEASE_COURSESPMS   -0.4705481  0.6246598  0.0489033 -9.622 < 2e-16
#> AGE                   0.0026107  1.0026141  0.0017573  1.486 0.13737
#> SEXMale               0.0071008  1.0071261  0.0355375  0.200 0.84163
#> RACEWhite            -0.0819665  0.9213029  0.0609020 -1.346 0.17834
#> TIME_SINCE_DIAGNOSIS  0.0024122  1.0024151  0.0008833  2.731 0.00632
#> 
#> Likelihood ratio test=121.9  on 5 df, p=< 2.2e-16
#> n= 5092, number of events= 3779

When gap time scale is used, the PWP model gives a HR:0.62 for the SPMS group compared with the RRMS group.

Frailty model

The frailty model, introduces a random covariate or frailty term into the model that induces dependence among the recurrent event times.

frailty.relapses <- coxph(Surv(TSTOP, STATUS) ~ DISEASE_COURSE + AGE + SEX
                          + RACE + TIME_SINCE_DIAGNOSIS + frailty(ID),
                          data = data_layout_1)

tidy(frailty.relapses, exp = TRUE, conf.int = TRUE)
#> # A tibble: 6 × 7
#>   term                 estimate std.error  statistic  p.value conf.low conf.high
#>   <chr>                   <dbl>     <dbl>      <dbl>    <dbl>    <dbl>     <dbl>
#> 1 DISEASE_COURSESPMS      0.707  0.0486   50.8       1.02e-12    0.643     0.778
#> 2 AGE                     1.00   0.00175   0.155     6.94e- 1    0.997     1.00 
#> 3 SEXMale                 1.00   0.0355    0.0000366 9.95e- 1    0.933     1.07 
#> 4 RACEWhite               1.01   0.0608    0.0271    8.69e- 1    0.897     1.14 
#> 5 TIME_SINCE_DIAGNOSIS    1.00   0.000913  3.23      7.21e- 2    1.00      1.00 
#> 6 frailty(ID)            NA     NA         0.000590  8.55e- 1   NA        NA

Conditional on the unmeasured heterogeneity and covariates, the frailty model indicates that the SPMS group had a reducing risk of 30% than the RRMS (HR: 0.70).

In order to verify if the use of frailty models is justified in our study, it is possible to use the emfrail function from frailtyEM package for getting the frailty variance or Kendall’s tau. A variance close to 0 means no heterogeneity between patients and that a frailty model would not be justified (https://www.jstatsoft.org/article/view/v090i07).

David Herman

Cox proportional hazard model (CoxPH)

Andersen-Gill (AG) model

Prentice Williams Peterson (PWP TT/CT) model (overall effect)

Prentice Williams Peterson (PWP-GT) model (overall effect)

Frailty model