Solved – Could ROC curve be used for regression problems

machine learning

A receiver operating characteristic curve, or ROC curve, is a graphical plot that illustrates the diagnostic ability of a binary classifier system as its discrimination threshold is varied.

Could it be used for a regression problem, e.g. housing price prediction? How about bias-variance?

I am trying to train a model that predict the length of cut for operation on patients. I worry about it is reasonable and enough to just using error rate. The indirect evaluation is the recovery of patients after operation.

Could someone please give a hint? Thanks in advance.

Best Answer

In survival regression, a common metric is the "concordance index" that measures how well a regressor ranks pairs of samples. This concordance index is equivalent to the area under the ROC curve, see Why is ROC AUC equivalent to the probability that two randomly-selected samples are correctly ranked?

Note that the concordance index does not care about the absolute values of your predictions. The only thing that matters are the relative values compared to other samples in the dataset. A good concordance does therefore not mean that the regressor is also well calibrated.