Solved – Checking the proportional hazard assumption

cox-modelrsurvival

I have a question on the cox proportional hazard model, in particular the proportionality assumption. I use cox.zph() function in R to check whether the proportionality assumption is satisfied. The global test of cox.zph() has p-value much greater than 0.05, but one of the covariates has p-value close to 0.01. In this situation, should I do a remodelling (can be stratification or time interaction), or should i say that the assumption is still not violated since globally its not significant? Thank you

Best Answer

The global test of proportional hazards is not well-calibrated. You haven't controlled for multiple comparisons. It's difficult to gauge power of the test. $\alpha=0.05$ is probably too lax in most sample sizes. The test is arbitrarily powerful in large sample sizes. It's possible that the covariate you identify is a spurious finding, and that it arises from natural variability in the observation of time-to-event data.

Even if the hazards were not proportional, altering the model to fit a set of assumptions fundamentally changes the scientific question. As Tukey said, "Better an approximate answer to the exact question, rather than an exact answer to the approximate question." If you were to fit the Cox model in the presence of non-proportional hazards, what is the net effect? Slightly less power. In fact, you can recover most of that power with robust standard errors (specify robust=TRUE or cluster = ~id). In this case the interpretation of the (exponentiated) model coefficient is a time-weighted average of the hazard ratio--I do this every single time.

When the actual hazard ratio over-time is of interest, there are flexible methods of estimating its value. You may create a flexible, polynomial representation of time using basis splines and fit their interaction with the covariate(s) to estimate a hazard ratio time function. The power of the Cox model may be compromised by this. Using a parametric exponential survival model with spline adjustment for time can approximate the semi-parametric inference of the Cox model very well, and is better powered to detect interactions of time with one or more covariates.

Related Solutions

Solved – Handling borderline cases of the proportional hazards assumption

There are several options: I'd recommend examining the impact of the assumption on your hazard ratio estimates as a next step, rather than relying on a test statistic (or even the log-log graphs -- from a determining the impact perspective.) The two I'd initially suggest:

explicitly add in a time * covariate interaction to examine how the hazard ratio for your covariate changes over time.
add in a "heaviside" interaction term that explicitly models a hazard ratio for an "early" period and a "late" period. This is probably only sensible when you have an a priori cut-off for defining early/late (e.g. we've used it when modelling rescreening times in breast cancer, where there are defined screening provider targets for a 27 month rescreen interval: so we defined the break point for the heaviside function at 27 months.)

Obviously stratification by the covariate in question isn't an option... since there's only one covariate and it's the one you're principally interested in!

Link for some options in Survival analysis: a self learning text

and also see the following presentation (which is on the mathematical side) course notes from National University of Singapore

Solved – Violation of proportional hazard for covariate but not for interaction it’s part of in a Cox Proportional Hazards model

In your model you need to add an interaction term for the infected:

cph(formula = Surv(start_time, end_time, event) ~ feed_time + 
    treatment * clutch1 + treatment:start_time + 
    cluster(cage), data = df, x = T, y = T)

See my own answer here and my blog about this here. Since you have a limited amount of data you can also use the tt() approach although I'm uncertain if it works as expected with the rms::cph wrapper.

coxph(formula = Surv(lifespan) ~ feed_time + 
      treatment * clutch1 + tt(treatment) + 
      cluster(cage), data = df, x = T, y = T,
      tt = function(x, t, ...){
        ns(x + t, 2)
      })

If you stratify on your main variable you won't get an estimate and you can't do an interaction variable with the clutch1 variable. I may have misread your question but just to be sure, stratification can only be used with categorical variables and not continuous. You can categorize continuous variables but I wouldn't recommend that.

Best Answer

Related Solutions

Solved – Handling borderline cases of the proportional hazards assumption

Solved – Violation of proportional hazard for covariate but not for interaction it’s part of in a Cox Proportional Hazards model

Related Question