I'm wondering which quality measures are available for non-binary classifiers. I've read this article

https://en.wikipedia.org/wiki/Receiver_operating_characteristic

I understand, that the idea of Confusion matrix can be generalized easily. However, such metrics as sensitivity,specificity or precision seem to make sense only for binary classification problems.

So, my question is: are there any developed methods for evaluting the quality of non-binary classifiers?

Thank you.

## Best Answer

I'm not sure whether you're still interested in this topic or not, but I became curious about it and have read a little on the subject. I've decided to share the following information to store the findings for myself and with hope that this might be helpful for others as well.

It seems that there are quite a number of quality measures, which can be applied to various models, non-binary classification included. Some of them are mentioned on this page and in this paper. Another interesting paper, criticizing AUC measure can be found here. Please note that my answer might be a little more general, as I haven't restricted my search to classification problems.