Solved – p-value test for comparing classifiers

MATLABp-valuet-test

I have to use p-value test to compare two classifiers. I am getting the error vectors of 2 hypothesis and I am using their difference as an input in ttest function of MATLAB

[H,p] = ttest(error,'Tail','both');

I am getting H as 0 and p as 0.34
As far as I read, this means that we cannot reject null hypothesis in this case. I am not sure
I don't know how to interpret that? Can anybody help?
And again what will one tailed test do in this case?

Best Answer

The $t$-test is not a very good choice to compare classifiers, since their performance is not normally distributed. I suggest considering the Wilcoxon signed-ranks test instead.

For a thorough read on this topic please refer to this paper (available here):

Demšar, Janez. "Statistical comparisons of classifiers over multiple data sets." The Journal of Machine Learning Research 7 (2006): 1-30.

The above paper lists multiple arguments against using the $t$-test for classifier comparison.