Solved – How to select GARCH lag for forecasting purpose (AIC+likelihood ratio)

aicgarchlagslikelihood-ratiomodel selection

I modified this question to be more specific.

I'm fitting a ARIMA-GARCH model to my hedge fund index daily log return series. I used ACF, PACF, Ljung-Box test and Archtest to check for autocorrelation and conditional heteroskedasticity. As the ACF and PACF for return itself don't show significant autocorrelation, (but they do for squared return), also suggested by the Ljung-Box test with h=0, so I exclude the autocorrelation in the mean process. But to double check, I use an ARIMA(1,0,1), and for both coefficients for AR and MA terms are not statistically significant. So I exclude them and go for only a GARCH model.

I did check the other similar questions about garch lag selection (for example, here) and it seems like that when it comes to the function of predicting, it's better to choose the one with lowest AIC rather than BIC. So I first compare the AIC then I further check using likelihood ratio test.

Here are my results

enter image description here

I highlighted the lowest two for AIC and BIC.
And so I use likelihood ratio test to compare those highlighted models, at the end it indicates that GARCH(5,4) is the best at 95% significant level for each comparison, So my questions are:

  1. Though I only tried the lags until 5, it seems like for now I will have lower AIC if I further increase the lags, so should I try testing on more lags? Also I have concern that more lags is not suitable for a model aiming at forecasting. That leads to my second question.

  2. I'm new to this field but as far as I know, for parsimony, it is not favorable to have too many parameters in the model, as it might result in more error if we use the model to forecast.In that case, should I just go for a garch(1,1), like suggested by the answer from the link I share above? Or should I restrict to some lag number, say, I choose only until lag 2, if so that would give the choice of garch(2,2). And yet, how should I choose which lag number as the bound?

  3. Regarding the use of AIC BIC tests, I'm not sure whether I should use the whole model: ARMA(p,q)-GARCH(p,q) or just the GARCH(p,q)? In this case there's no arma process so I guess it is sufficient to use only Garch. In other time series with also arma process, I did try both, and it showed that with whole model I get lower number of AIC and BIC,but the ranking did not change. So, does it matter whether to use the entire model to test for aic and bic or not?

Here are the results from matlab, with ARMA(0,0)-GARCH(5,4), just to provide more information

enter image description here

I would appreciate if anyone could help me with them! References are welcomed as well:)

Best Answer

In general, building an ARMA-GARCH model in a stepwise fashion based on diagnostics such as ACF, PACF and Ljung-Box is problematic because the latter do not have standard null distributions when applied on returns when the conditional variance is nonconstant; or on squared returns when the conditional mean is nonconstant. Thus the following will not work exactly as you expect it to (but hopefully the distortion will not be too large and you could still trust the results to some extent):

I'm fitting a ARIMA-GARCH model to my hedge fund index daily log return series. I used ACF, PACF, Ljung-Box test and Archtest to check for autocorrelation and conditional heteroskedasticity. As the ACF and PACF for return itself don't show significant autocorrelation, (but they do for squared return), also suggested by the Ljung-Box test with h=0, so I exclude the autocorrelation in the mean process. But to double check, I use an ARIMA(1,0,1), and for both coefficients for AR and MA terms are not statistically significant. So I exclude them and go for only a GARCH model.

Going forward,

I did check the other similar questions about garch lag selection (for example, here) and it seems like that when it comes to the function of predicting, it's better to choose the one with lowest AIC rather than BIC. So I first compare the AIC then I further check using likelihood ratio test.

AIC, BIC and LR all address different questions and serve different goals. You should not expect all of them to point to the same direction, and you should choose the appropriate one based on your modelling goal. If the goal is forecasting, AIC is the most relevant choice.


Regarding your Q1, experience in finance tells us that high-order GARCH models do not tend to beat low-order GARCH models. I would stick to a relatively parsimonious model unless I had reasons to believe the time series is somehow special and unlike other financial time series. I do not see a sound theoretical reason to select a model that has higher AIC than another model (when there are not that many models being compared, like in your case), but the experience in finance points to a different solution.

Regarding your Q2, see above.

Regarding your Q3, it does matter that you consider the full model. Considering only part of the model does not make sense. (You could construct examples where you choose really poor models over much better models only because you happen to look at part of the picture instead of the whole picture.)