My question is not Matlab specific but more theoretical.
I'm currently using boosting to create a two class classifier and my week classifiers are trees. While I have a fairly large number of training examples in both classes, most of them are in one single class. I have the intuition that this difference in the number of examples in each class for the training set would deviate the resulting classifier from a "fair" one, towards one that benefits the class with more examples.
Am I right? what are the accepted ways to cope with this issue?
Thanks in advance!
Best Answer