Solved – OLS results dependent on scaling of independent variable

least squarespython

I have the following python dataframe (5 rows out of nearly 15,000):

   N0_YLDF        MAT       MAP
0  6.286333  11.669069  548.8765
1  6.317000  11.669069  548.8765
2  6.324889  11.516454  531.5035
3  6.320667  11.516454  531.5035
4  6.325556  11.516454  531.5035

I run OLS using following formula:
N0_YLDF ~ MAT + MAP

Here, MAP stands for mean annual precipitation. The coefficient of y-intercept for MAP is 0.0079. However, when I divide MAP by 100, the coefficient increases to 0.79

Does this mean that I should normalize my independent variables?

Best Answer

@Alecos gave a very thorough mathematical answer.

More intuitively, suppose you regress weight on height. You measure weight in pounds and height in inches. Then you are told you should have measured weight in kilos and height in centimeters. Many of the numbers in the regression results will change, but the meaning will stay exactly the same.

So, in answer to your question: No. The fact you stated doesn't mean anything; it is an automatic consequence of what you did. Whether you should standardize your variables is a good question, but the changes you noted give no help in answering it.

Related Question