I have a question on the cox proportional hazard model, in particular the proportionality assumption. I use cox.zph() function in R to check whether the proportionality assumption is satisfied. The global test of cox.zph() has p-value much greater than 0.05, but one of the covariates has p-value close to 0.01. In this situation, should I do a remodelling (can be stratification or time interaction), or should i say that the assumption is still not violated since globally its not significant? Thank you
Solved – Checking the proportional hazard assumption
cox-modelrsurvival
Related Question
- Cox Model – Understanding the Cox Proportional Hazards Model
- Solved – coxme proportional hazard assumption
- Survival – Proportionality Assumption in Cox Regression Model: Detailed Explanation
- Solved – How to handle multiple violations of the Schoenfeld residuals assumption in cox proportional hazard models
- Cox Model – Addressing Violation of Proportional Hazard Assumption and Interaction with Time
Best Answer
The global test of proportional hazards is not well-calibrated. You haven't controlled for multiple comparisons. It's difficult to gauge power of the test. $\alpha=0.05$ is probably too lax in most sample sizes. The test is arbitrarily powerful in large sample sizes. It's possible that the covariate you identify is a spurious finding, and that it arises from natural variability in the observation of time-to-event data.
Even if the hazards were not proportional, altering the model to fit a set of assumptions fundamentally changes the scientific question. As Tukey said, "Better an approximate answer to the exact question, rather than an exact answer to the approximate question." If you were to fit the Cox model in the presence of non-proportional hazards, what is the net effect? Slightly less power. In fact, you can recover most of that power with robust standard errors (specify
robust=TRUE
orcluster = ~id
). In this case the interpretation of the (exponentiated) model coefficient is a time-weighted average of the hazard ratio--I do this every single time.When the actual hazard ratio over-time is of interest, there are flexible methods of estimating its value. You may create a flexible, polynomial representation of time using basis splines and fit their interaction with the covariate(s) to estimate a hazard ratio time function. The power of the Cox model may be compromised by this. Using a parametric exponential survival model with spline adjustment for time can approximate the semi-parametric inference of the Cox model very well, and is better powered to detect interactions of time with one or more covariates.