Solved – Is residuals autocorrelation always a problem

autocorrelationdiagnosticregression

I read that OLS underestimates variance when residuals are autocorrelated. I see why autocorrelation would be a problem in time series analysis, in the sense that the coefficient are not efficient because we're not including all the potential predictors. But is there a mathematical problem as well?

For example, we want to predict the used-cars sales margins. The data set includes each vehicle make, model, mileage p/gallon, price, options etc. and the final sale price. For some reason the catalog has been sorted by car make and year/model, so adjacent observations will likely have similar sale numbers. Is autocorrelation a problem in this case?

Best Answer

Correlated residuals in time series analysis may imply far worse than low efficiency: if the structure of autocorrelation implies integrated or near-integrated data, then any inferences about levels, means, variances, etc. may be spurious (with unknown direction of bias) because the population mean is undefined and the population variance is infinite (so, for example, the finite values $\bar{x}$ and $s_{x}$, and quantities derived from these are always false estimates of the corresponding population statistics).

That's not a problem that can be resolved by increasing sample size to offset inefficiency.

If autocorrelated errors obtain in OLS, I would say that the same issues may be present (it depends on the data generating process). Again: not an issue of efficiency.

The critical caveat is whether ordering of your data is meaningful: if the order has meaning in that it relates to the data generating process then you're in trouble.

Related Question