Solved – Lag Length from a VAR and Vector Error Correction Model (VECM)

lagsvector-autoregressionvector-error-correction-model

A colleague wrote a paper and I am reviewing it for him to make sure it is good. In the paper, the author estimated a VAR to determine the optimal lag length based on the Schwartz Criterion. Then later in the paper the author estimated a Vector Error Correction Model for men and women. For men, the error correction terms had two lags while for women it is three lags for the error correction terms.

I suspect that the lags from the VAR would be independent of the lags of the error correction terms and the lag of 3 for the women specification provided a better fit of the VECM. In other words, the lag lengths from the VAR and VECM would be independent of each other.

Thoughts?

Best Answer

This post gives you a cookbook recipe for building a VECM, including the step of determining the appropriate lag length according to information criteria. Actually, this particular step is not very elaborate in that post; fortunately, you can find it discussed more explicitly in the vignette (see especially p. 17-18) for the "vars" package in R. I will briefly add that use of BIC is generally fine for explanatory modelling, while AIC could be used if the model is intended for forecasting.

Are the lag lengths supposed to be independent in the VAR model for levels and the VECM model? No, just the opposite. There is a clear relationship which you will also find in the vignette mentioned above, which is that the lag order of the VECM should be one less than the order of the VAR model for levels. See also Lutkepohl "New Introduction to Multiple Time Series Analysis" and perhaps Pfaff "Analysis of Integrated and Cointegrated Time Series with R".

Finally, the error correction term is included in each equation of the VECM only once. It is either lagged by one or by $p$ where $p$ is the lag order of the VECM; the corresponding representations of the VECM are known as long-run and transitory; it is still the same model, just different representations -- pick the one you like. (I think it is possible to have the error correction term lagged by any $m$ between 1 and $p$, but this is not common practice unless $m$ equals 1 or $p$.) Meanwhile, you should not have the error correction term in multiple lags as you will face the problem of perfect multicollinearity.