Solved – ARIMA Cross Validation

arimaforecastingpredictiontime seriesvalidation

I work with R and have got some questions regarding my ARIMA model. In specific, I have yearly data ranging from 1946 to 2019 and would like to do a basic two-step ahead ARIMA forecast for 2020 and 2021. Plotting the time series and the ACF/PACF as well as performing a KPSS test reveals a trend and non-stationary data (see plots below). Taking the first differences and performing a KPSS test again does make the time series stationary. Thus I assume first differencing is required in the model.

Applying the auto.arima() function on the full sample (1946-2019) gives a first intuition on the functional form of time series. The auto.arima() suggests an ARIMA(1,1,0) with drift. However, this model (with the highest fit based on AICc) does not necessarily have to be a good model for forecasting as Hyndman et al. (2018) write. 1

Hence I do cross validation and split the data into a training (1946-2014) and a test set (2015-2019), perform auto.arima() on the training set and trace all the suggested models including the one with the lowest AICc, which also is an ARIMA(1,1,0) with drift. Next I perform forecasts with all these models (9 models in total) and compare their test set MAE/RMSE. The model with lowest MAE/RMSE is an ARIMA(1,1,1) with drift but it still gives a very inaccurate (but best?) forecast of the test set. This is, I assume, because it is not really trained with the strong upwards trend at the upper end of the time series. The orange line in the plot shows the real data whereas the blue line with a confidence interval shows the forecast. The residual plot looks fine to me and the Box-Ljung test with p = 0.31 does not indicate correlation.

Here is the thing I struggle with, I separated it into two questions:

1) I am not quite sure whether the ARIMA(1,1,1) with drift is a good model to apply on the whole time series (1946-2019) and perform the forecasts for 2020 and 2021 with it, since it still is so inaccurate in cross validation. Is it better to simply stick with the ARIMA(1,1,0) with drift for forecasting (as suggested by both the auto.arima() on the full data set (1946-2019) and the auto.arima() on the training data set, based on AICc)? Or do I need an entirely different approach?

2) Different ARIMA() specifications are suggested by auto.arima() depending on the size of the training set. As there are some changes in the trend behavior of the time series it is not really intuitive to me how many observations to include in the training set. If I make the training set smaller forecasts of the test set would be even more inaccurate since the trend of the time series changes at the upper end. If I make it larger there is less forecasted values to compare with. What would be good approach in my case?

Thank you for your help!

Best, Theo Ruben

1 Hyndman, R. B. and Athanasopoulos, G. (2018). Forecasting: Principles and Practice. Monash University, Australia

Best Answer

Your test set is very small. Model comparisons based on 4 observations are unlikely to be trustworthy. You could do time series cross validation based on rolling windows instead*. Even so I would lean towards AIC rather than time series cross validation, given the discussion in the thread "AIC versus cross validation in time series: the small sample case", though the case is not clear cut.

*See the last section of Rob J. Hyndman "Why every statistician should know about cross-validation".

Related Solutions

Solved – Poor forecast results of a state space model

First, is your subsetting statement mistyped? It appears you mean something like:

data.s<-data[1:528,]
data.s.g<-data.s[,1]

You might even want to show us a sample of your data (dput), which would let us process it to get an answer more like what you're expecting -- though not using an ARIMA(1,1,1) model.

Second, it looks like you might be training your VAR on the entire data and then predicting the last part, while training your ARIMA and SS on only the first part of the data? (In addition to which, VAR has two time series to work with.)

Third, you're expecting too much of your ARIMA. (If you look into the internals of the Arima object returned by auto.arima, you can find the state space model that R uses under the hood: arima.m$model.) An AR(1) uses only the current data point to make its next prediction, which is not much information.

auto.arima isn't magic. It knows nothing about your data and looks through a limited window of options. If you know more, like perhaps the data has a natural 100-period cycle, you can add that and get much better results.

Fourth, be careful that you've got your dlm model wired together correctly. It seems like there may be one more state than you think there is.

EDIT: Now that you've posted your data, it looks a lot like stock prices, which you're not going to predict with any canned methods.

Solved – Should auto.arima in R ever report a model with higher AIC, AICC and BIC than other models considered

auto.arima uses some approximations in order to speed up the processing. The final model is fitted using full MLE, but along the way the models are estimated using CSS unless you use the argument approximation=FALSE. This is explained in the help file:

approximation If TRUE, estimation is via conditional sums of squares and the information criteria used for model selection are approximated. The final model is still computed using maximum likelihood estimation. Approximation should be used for long time series or a high seasonal period to avoid excessive computation times.

The default setting is approximation=(length(x)>100 | frequency(x)>12), again this is specified in the help file. As you have 17544 observations, the default setting gives approximation=TRUE.

Using the approximations, the best model found was a regression with ARIMA(5,1,0) errors with AICc of 2989.33. If you turn the approximations off, the best model has ARIMA(2,1,1) errors with an AICc of 2361.40.

> fitauto = auto.arima(reprots[,"lnwocone"], approximation=FALSE,
                xreg=cbind(fourier(reprots[,"lnwocone"], K=11),
                reprots[,c("temp","sqt","humidity","windspeed","mist","rain")]),
                start.p=1, start.q=1, trace=TRUE, seasonal=FALSE)
> fitauto
Series: reprots[, "lnwocone"] 
ARIMA(2,1,1) with drift         
...
sigma^2 estimated as 0.08012:  log likelihood=-1147.63
AIC=2361.27   AICc=2361.4   BIC=2617.76

Best Answer

Related Solutions

Solved – Poor forecast results of a state space model

Solved – Should auto.arima in R ever report a model with higher AIC, AICC and BIC than other models considered

Related Question