Solved – Prediction interval for lasso regression with time series data

lassomachine learningmathematical-statisticsprediction intervaltime series

I am currently working with time series data. My objective is to predict the a certain value at time t given some other variables that we will know the same day ( but prior to our objective variable). After trying several models I have managed to obtain a relatively good prediction using lasso regression.

However, given the importance of the problem, I would like to have some kind of confidence interval for my predictions, it would be very important to understand how accurate my prediction would be given a certain probability.

One solution I have though about is using a certain number of past MAE to compute the standard deviation and with that, and assuming they errors have a normal distribution compute a confidence interval at 95% with +-2 s.d.

One important consideration is that my dependent variable does not behave the same through the years, it is not stationary.

Would this be a robust way of computing this intervals or are there better alternatives?

Best Answer

As suggested in the comments, I am turning my comment into an answer so the question will have an answer in the system.

Original:

Check RMSEP (Root Mean Standard Error of Prediction). This value tracks your model's ability to predict on out-of-sample values. You can calculate RMSEP over different time periods to determine how well you are predicting, which will give you a "standard error" of prediction which could be used in the calculation of a rough confidence interval.

I suppose a bit of an explanation of this answer is needed, since only a certain number of characters are allowed per comment. First, as @StephanKolassa noted in his comment, there is a difference between prediction intervals and confidence intervals. This is an important distinction. In RMSEP calculation, one is attempting to understand how well a model can predict, and can compare different models by relative magnitudes of RMSEPs. The RMSEP by itself may not be extremely useful (like $R^2$ for simple regression would be), but it can be enlightening for model comparison.

Additionally, building on @ChrisHaug's comment, you may want to look at the out-of-sample errors you use to calculate your RMSEP measure. They may provide you with an empirical distribution which could shed some light on how well your model is predicting. For example, if your errors have an extremely heavy tail, it may indicate that your model does not do a great job if you are looking to avoid extremely unlikely but costly tail-events, (like an asset manager attempting to avoid exposure to recession-level events).

Related Solutions

Solved – Prediction Interval for Neural Net With Hessian :: nnet in R

Try the nnetpredint package. https://cran.r-project.org/web/packages/nnetpredint/

I’ve met the same problem and I also want to construct a prediction confidence interval to the neural networks. So I tried to develop the nnetpredint (R package), using the method from these related papers, which use the Jacobian matrix (first order derivative of the training datasets with gradient function) to estimate model errors instead of the Hessian matrix. The manual is here and the method has the function interface to the models trained by nnet, neuralnet and RSNNS packages:

The example for nnet package is here. The method nnetPredInt takes the model weights, nodes number, training datasets, etc. as input and compute the prediction interval for the new datasets.

    install.packages("nnetpredint")

    # Example: Using the nnet object trained by nnet package
    library(nnet)
    xTrain <- rbind(cbind(runif(150,min = 0, max = 0.5),runif(150,min = 0, max = 0.5)) ,
    cbind(runif(150,min = 0.5, max = 1),runif(150,min = 0.5, max = 1))
    )
    nObs <- dim(xTrain)[1]
    yTrain <- 0.5 + 0.4 * sin(2* pi * xTrain %*% c(0.4,0.6)) +rnorm(nObs,mean = 0, sd = 0.05)
    plot(xTrain %*% c(0.4,0.6),yTrain)

    # Training nnet models
    net <- nnet(yTrain ~ xTrain,size = 3, rang = 0.1,decay = 5e-4, maxit = 500)
    yFit <- c(net$fitted.values)
    nodeNum <- c(2,3,1)
    wts <- net$wts

    # New data for prediction intervals
    library(nnetpredint)
    newData <- cbind(seq(0,1,0.05),seq(0,1,0.05))
    yTest <- 0.5 + 0.4 * sin(2* pi * newData %*% c(0.4,0.6))+rnorm(dim(newData)[1],mean = 0, sd = 0.05)

    # S3 generic method: Object of nnet
    yPredInt <- nnetPredInt(net, xTrain, yTrain, newData, alpha = 0.05) # 95% confidence interval
    print(yPredInt[1:20,])

    # S3 default method for user defined input
    yPredInt2 <- nnetPredInt(object = NULL, xTrain, yTrain, yFit, node = nodeNum, wts = wts, newData, alpha = 0.05, funName = 'sigmoid')

    plot(newData %*% c(0.4,0.6),yTest,type = 'b')
    lines(newData %*% c(0.4,0.6),yPredInt$yPredValue,type = 'b',col='blue')
    lines(newData %*% c(0.4,0.6),yPredInt$lowerBound,type = 'b',col='red') # lower bound
    lines(newData %*% c(0.4,0.6),yPredInt$upperBound,type = 'b',col='red') # upper bound

The keys to the estimation methods:

Use the first order Taylor expansion to expand the f(x) at each weight parameters. And calculate the gradient vector/ Jacobian matrix from the training datasets.

References:

De Veaux R. D., Schumi J., Schweinsberg J., Ungar L. H., 1998, "Prediction intervals for neural networks via nonlinear regression", Technometrics 40(4): 273-282.

Chryssolouris G., Lee M., Ramsey A., "Confidence interval prediction for neural networks models",IEEE Trans. Neural Networks, 7 (1), 1996, pp. 229-232

And also check out this paper for detailed maths. http://cdn.intechopen.com/pdfs-wm/14915.pdf Confidence Intervals for Neural Networks and Applications to Modeling Engineering Materials

Solved – Confidence interval vs. prediction interval misunderstanding

Although I don’t have the perfect answer to your question, but apropos of the popular misconception - a 90% CI for the mean simply implies that “the population mean is likely to lie in the said interval in 90% of the samples”. Expanded - a X% Confidence Interval (where X could be 90%, 95% or 99% etc.) means that in X% of random samples drawn from the distribution, the estimated mean will lie in the stated interval. It does not mean that X% of the population lies in the CI.

Best Answer

Related Solutions

Solved – Prediction Interval for Neural Net With Hessian :: nnet in R

Solved – Confidence interval vs. prediction interval misunderstanding

Related Question