Solved – Positive correlation and negative regressor coefficient sign

correlationmultiple regressionregression coefficients

Is it possible to obtain a positive correlation between a regressor and a response (+0,43) and, after that, obtain a negative coefficient in the fitted regression model for this regressor?

I'm not talking about changes in the sign of the regressor among some models. The coefficient sign always remains.

Could the remaining variables of the fitted model influence the changing of the sign?

Best Answer

Both @Henry, and @JDav are pointing you in the right direction (+1 to each). However, I'm very visual and it helps me if I can see how this works. In that respect, here's a quick plot in which the first variable is confounded with group membership. If the groups are ignored, the correlation coefficient is positive (as can be seen in the figure), but in a multiple regression, $\beta_{x1}=-1$, albeit with different intercepts for the three groups. enter image description here
As further food for thought, when all variables are categorical (instead of continuous as in this case) the phenomenon of reversing the apparent relationship upon inclusion of other variables is known as Simpson's paradox. Since it's ultimately quite similar, it may help to read about that as well. It is discussed on CV here.

Related Question