Solved – Is every ARIMA(1,1,0) model equivalent to an AR(2) model

arimartime series

Assume I have a time series $ x_t $ that I want to fit using an ARIMA(1,1,0) model of the form:

$ \Delta x_t = \alpha \Delta x_{t-1} + w_t $

This could be rewritten as:

$ x_t – x_{t-1} = \alpha ( x_{t-1} – x_{t-2} )+ w_t $

$ x_t = ( 1 + \alpha)x_{t-1} – \alpha x_{t-2} + w_t $

The last equation describes an AR(2) model with coefficients $1+\alpha$ and $-\alpha$. I recognize that, depending on $\alpha$, this AR(2) model might be non-stationary. However, if I was taking a diff to begin with, then the series I am modeling shouldn't be stationary.

I know that if the model is non-stationary, a diff should be used. But how would the results differ if I used a AR(2) model vs an ARIMA(1,1,0) model? I assume (as hinted by R) that it has an issue with convergence. However, when I ask R to perform the fits, it will do both of them, and the coefficients are (mostly) consistent with my observations above. The forecasts are definitely different, though.

If anyone could shed some light on this, or point me to a good reference, I would appreciate it.

Here is the R code I used to generate both models.

> set.seed(2)
> x <- arima.sim(n = 1000, model=list(order=c(1,1,0), ar=c(0.3)))
> plot(x)
> arima(x, order=c(1,1,0))

Call:
arima(x = x, order = c(1, 1, 0))

Coefficients:
         ar1
      0.3291
s.e.  0.0298

sigma^2 estimated as 1.03:  log likelihood = -1433.91,  aic = 2871.81
> arima(x, order=c(2,0,0))

Call:
arima(x = x, order = c(2, 0, 0))

Coefficients:
         ar1      ar2  intercept
      1.3290  -0.3294    50.9803
s.e.  0.0298   0.0299    35.9741

sigma^2 estimated as 1.03:  log likelihood = -1438.93,  aic = 2885.86
Warning messages:
1: In log(s2) : NaNs produced
2: In log(s2) : NaNs produced
3: In log(s2) : NaNs produced
4: In arima(x, order = c(2, 0, 0)) :
  possible convergence problem: optim gave code = 1

Best Answer

The forecast for the ARIMA(1,1,0) enforces the restriction that $d=1$.

It is maybe even easier to see in the AR(1) vs. ARIMA(0,1,0) case: The latter is just $$ \Delta y_t=\epsilon_t $$ whose optimal forecasts are 0 at all horizons (we expect $\epsilon_t$ to take the value zero). If we aim to forecast $y_t$ itself, we take the last in sample value and just accumulate the forecast changes of $y_t$. Basically, we expect the value tomorrow to be today's value plus the expected change from today to tomorrow.

So, as we do not expect any changes here, the optimal forecast for such a random walk is $y_T$ ($T$ being the last in sample observation) for all $h=T+1,\ldots$.

If, on the other hand, we fit an AR(1) model, we obtain an estimate $\hat\alpha$ and produce the optimal forecasts from an AR(1) model as $$ y_{T+h}=\hat\alpha^hy_T $$ If estimation errors (as they generally will in finite samples) are such that $\hat\alpha$ differs from the true value of 1, the forecasts will differ.

Related Solutions

Solved – Simulation of ARIMA (1,1,0) series

If you want to simulate ARIMA you can use arima.sim in R, there is no need to do it by hand. This will generate the series you want.

e <- rnorm(100,0,0.345) 
arima.sim(n=100,model=list(ar=-0.7048,order=c(1,1,0)),start.innov=4.1,n.start=1,innov=2.1+e)

You can look at the code of how this is achieved by typing arima.sim in R command line. Alternatively if you do it yourself, the function you are probably looking is diffinv. It computes the inverse of lagged differences.

For recursive sequences R has a nice function filter. So instead of using loop

z <- rep(NA,100)
z[1] <- 4.1
for (i in 2:100) z[i]=cons+phi*z[i-1]+e[i]

you can write

filter(c(4.1,2.1+e),filter=-0.7048,method="recursive")

This will give the identical result to arima.sim example above:

diffinv(filter(c(4.1,2.1+e),filter=-0.7048,method="recursive")[-1])

Solved – time series forecasting using auto.arima and exponential smoothing

Seasonality is probably not very strong. Different algorithms will give different results, unless seasonality is glaringly obvious.
The best measure is always to compare forecast accuracy on a holdout set: hold back the last $n$ observations, fit your models to all other observations, forecast into the last $n$ time periods with both models, then compare forecast accuracy using your error measure of choice (see 5 below).
Yes, this is a common complaint. I don't think there is an easy way to get the in-sample fit. But you can get the residuals: auto.arima(WWWusage)$residuals. Best to look into the code of auto.arima() to see whether you need to add or subtract them from the original series to get the fit. I'd say you have to subtract ("actuals=model+residuals"), but better check.
I recommend a good forecasting textbook. This is a very good start. Otherwise, read through the help pages.
The appropriate error measure will depend on your personal loss function. Is your pain symmetric, and will it increase more strongly with larger errors? Then use MSE. Is your pain proportional to absolute errors? Then use MAE. Best to look at multiple error measures.

One tip: averaging forecasts will usually improve accuracy. Consider taking the average of your two models' forecasts per future time bucket.
auto.arima() apparently fits no drift, even if you allow it.