Solved – Multicollinearity in WLS regression

heteroscedasticitymulticollinearityregressionweighted-regression

I run a weighted-least squares regression to account for the heteroscedasticity in my data. When I examine the Pearson correlations between all predictor variables, I can't detect high collinearity. However, when I apply WLS with the inverse of the variance as my weights and then screen the variance inflation factors of the regression, I find VIFs far above 10.

My question is: Does WLS always suffer from multicollinearity in my case as the weighting with the inverse of the variance necessarily leads to higher dependencies of the predictors?

Best Answer

It's a great question, because it concerns an important and fundamental issue about multiple regression. The answer is that weighting can change the VIF (a measure of collinearity among the regressors) by arbitrary amounts up or down (with a lower limit of $1$).

To see why that is, consider a model with two regressors $X_1$ and $X_2$. (We do not have to consider the response variable: it plays no role in computing the VIF.) Here is a scatterplot of 50 observations, with the symbol areas proportional to the weights:

Figure

If those weights were not applied, clearly the correlation would be strongly positive, because the points line up closely along a positively sloping line. Indeed, their (unweighted) VIF is $13.54$: large enough to drive one to investigate this model carefully for effects of collinearity. (Rules of thumb assert that VIFs above $5$ or $10$ begin to be of concern.)

The weights, though, throw the correlation in altogether a different direction. The points with the heaviest weights tend to be negatively correlated: some towards the upper left, others toward the lower right. These cause the weighted correlation to be nearly zero. Indeed, the VIF for these weighted data is merely $1.44$: low and benign.

The procedure works in reverse: we could weight the data to perform an OLS and then unweight them in a WLS. Thus it's just as possible that the weighted fit will reduce the VIF as increase it.