GARCH – Different Outcomes with Different Conditional Distributions in GARCH Model (rugarch)

garchmodel selection

I am using an external regressor in the variance equation of a GARCH model (rugarch). However, the value of the GARCH coefficient is coming out to be zero with a p-value of 1 and the coefficient of external regressor to be significant, when I am using the ged distribution. When I am using the std distribution, the GARCH coefficient remains significant and the external regressor insignificant. Opposite outcome, just by the change of the conditional distribution. My question is how to choose the distribution then? (this is the original post).

I will need to estimate a GARCH model, first without any exogenous variables, 2nd with exogenous variable ex1 and 3rd with exogenous variable ex2. Please note the table below:

Model specs AIC using ged AIC using std
GARCH without exog regressor 2.3743 2.3705
GARCH with 1st exog regressor ex1 2.3746 2.3708
GARCH with 2nd exog regressor ex2 2.3140 2.3689

In the above AIC is lower when using ged for ex2, compared to std. But the problem is that they are all part of the same process, and will need to be estimated simultaneously. Can I select two conditional distributions for estimating the three models of the same process? That is for GARCH without exog regressor I use std, for GARCH with 1st exog regressor ex1, I use std and for GARCH with 2nd exog regressor I use ged.

enter image description here

Best Answer

Such tiny differences between AIC would normally mean all the models are roughly equally good (or equally bad). However, the table contains AIC-per-observation, not AIC, as rugarch has got the nomenclature wrong. To obtain the actual AIC, you would have to multiply it by the sample size $T$. If $T$ is 100 or more, the differences are quite salient. There seems to be a clear winner at 2.314.

(I would also check whether the reported AIC-per-observation is has not been multiplied by -1; then the winner would be the highest value, i.e. 2.3746, and it would be not such a clear-cut case unless $T$ is about 666 or more. I use 666 to yield a difference of 2 between the AIC values of the best and the next best model. But according to p. 28 of the rugarch vignette, the signs are not mixed up, so then there is no need to multiply by -1.)