R Survival Analysis – Interpretation of coxph: rsq, cox.zhp, Robustness, and Concordance

cox-modellikelihoodrsurvival

I would like to get explained the following situations in coxph (library(survival))

The Concordance, what is a good Concordance?
What does it mean robust= 2.61 p=0.1 in a logrank p=0.006 ?
what is a good rsq? (summary(coxph.model) $ rsq
How to interpret cox.zph (coxph.model) – the output has, for every variable in the multivariable coxph.model a chisq, df and p value. Also, in addition to the variables in the multivariable model, there is a GLOBAL value.

This would be a potential output (I've tricked the name of the variables due to confidenciality) – I think it would be helpful to have the interpretation of this particular output as a guidance, if possible.

thank you!

Best Answer

Point 1. Quoting from Section 20.10 of Frank Harrell's Regression Modeling Strategies:

The c index [concordance] is the proportion of all pairs of subjects whose survival time can be ordered such that the subject with the higher predicted survival is the one who survived longer.

So a concordance of 0.5 is what you get if a model can't distinguish survival times at all. What's "good" above that depends on the nature of the study.

Point 2. Standard significance tests assume that observations are independent. Your use of an id variable indicates that some individuals (or groups of individuals with the same id value) contributed to multiple observations. Perhaps a single individual could experience more than 1 event. The "robust" standard error estimate takes that lack of independence into account, generally leading to wider confidence intervals and higher p-values. As the output from the summary says:

The likelihood ratio and score [logrank] tests assume independence of observations within a cluster, the Wald and robust score tests do not.

Point 3. See the answer to Point 1. What's good depends on the nature of the study. I find concordance and measures of model validation and calibration to be more useful than $R^2$ values. I strongly recommend learning to use the tools in Frank Harrell's rms package to evaluate model quality.

Point 4. This is covered at the end of Chapter 3 (Section 3.5.2, "Score tests") of the main survival vignette.

The cox.zph function checks proportional hazards for a fitted Cox model directly...

for individual predictors or for the model as a whole (GLOBAL). A low p-value indicates evidence against PH. In your case, it looks like age might violate PH. You might fix that by modeling age flexibly with a spline (e.g., rcs() in the rms package), as incorrect specification of the functional form of a continuous predictor can show up as an apparent violation of PH.

Related Solutions

Solved – Cox proportional hazard model and interpretation of coefficients when higher case interaction is involved

A couple suggestions, not directly related to CoxPH but to interactions and collinearity

1) When you are getting "crazy" values like these, one possiblitiy is collinearity. This is often a problem when you have interactions. Have you centered all your variables (by subtracting the mean from each)?

2) You can't interpret one interaction among many quite so easily. LT, food and temp2 are all involved in many interactions. So, look at predicted values from different combinations.

3) Check the units of the different variables. When you get crazy parameters, sometimes it's a problem of units (e.g. measuring a human height in millimeters or kilometers)

4) Once you've got that stuff straightened out, I find the easiest way to think of the effects of different interactions (esp. higher level ones) is to graph the predicted values with different combinations of the independent values.

Solved – Meta Analysis of Cox Regression Coefficients

Just supply the beta coefficients and corresponding standard errors to the rma() function. So, your syntax should be like this:

rma(coef, sei=se, data=dat)

where coef is the name of the variable in dataset dat denoting the coefficients and se is the name of the variable for the corresponding standard errors. The standard errors already include the information about the number of samples (and actually, it's the number of "events", not the sample sizes, that determine the size of the standard errors).

Best Answer

Related Solutions

Solved – Cox proportional hazard model and interpretation of coefficients when higher case interaction is involved

Solved – Meta Analysis of Cox Regression Coefficients

Related Question