When building a decision tree, suppose that there are two attributes that have the same maximum information gain.
Will there be any difference between choosing any of the two attributes to be a tree node? Or are there any other factors that I have to consider in order to decide which attribute should I choose?
Best Answer
You could look ahead at the information gain of the remaining attributes after a split and select based on that. In general though, if you're using information gain as your splitting criterion, it will be the only thing to look at.