Solved – High Censoring Rate in Survival Analysis; Much higher survival time among censored patients

censoringsurvival

I am trying to understand censoring in survival analysis and
wondering about how to tell when standard use of censoring breaks down.
In one case, the number of censored patients is fairly high (low death rate), yet the median or mean "survival" time (times of last confirmed observation of the censored patient) among these censored patients with death unconfirmed is nearly twice the equivalent survival times found among the uncensored patients with confirmed dates of death.

In the limiting case of a "wonder-drug" where nearly all of the patients survived beyond the trial close-out date, nearly all patients would be censored and using only the remaining uncensored patients (who may have died in the short time before the drug "kicked-in") might lead to a very poor looking survival curve that belies the activity of the drug.

I understand that censoring is usually a useful method, but it seems a bit counter-intuitive to assess the effectiveness of a drug based on patients for whom the drug is least effective.

Best Answer

I also found the word "censoring" to be confusing when I first started survival analysis.

"Censored" individuals aren't removed from analysis; they're just treated differently from those with events/deaths. If censoring is non-informative, as @swmo discussed, then a censored individual provides information that the event did not occur up to the censoring time. Just doesn't provide the exact time.

A standard survival curve includes the censored patients, noting the censoring times with a mark on the curve at the censoring time. The survival curve only drops at times of (noncensored) events, with a drop given by the ratio of events at that time to the total at risk at that time, including those with later censoring times. So the survival curves for the wonder drug in your example would in fact look quite good, with the few early events leading to small drops in the curve (as the fraction of individuals dying early was small) and then a high survival fraction thereafter.

Also, you're not usually comparing censored to uncensored patients within a single treatment group, as the question seems to suggest. Rather, you're comparing the timing of events in treatment group A to those in treatment group B. So in a test of a poor drug A versus wonder drug B, there would be many events/deaths in group A and few in group B, or at least events would tend to happen earlier in group A.

If most patients in group B are "cured" and they are not otherwise at high risk of death, then the "survival times" of the censored individuals in that group would mostly be determined by the duration of the study. A longer survival time for censored versus non-censored individuals may just mean that the study went on long enough to pick up most of those who were not "cured" by drug B.

Related Question