MATLAB: By using Narnet to predict the future Price, we need determining the optimal lags to detemine the optimal hiddenlayesizes

narnetneural networks

To determine the hiddenlayesizes:

1/we apply a trial and error method by using lags=1 (default value), or

2/we must verify the optimal delays by using autocorrolation function,and next we use the obtained optimal lag in trial and error to determine the optimal number of hidden nodes?

Thanks in advance

Best Answer

1. For unbiased prediction use divideblock so that the delays and weights are not determined by nontraining( i.e., validation and test ) data.

2. It is worthwhile to plot the trn/val/tst data in three colors to view the data division. Beware if the training subset doesn't look like it could be used to predict the nontraining data.

3. Estimate unbiased values for delays by determining the significant lags of the training subset autocorrelation function.

4. Given a subset of the significant lags to use for delays, you can determine the maximum number of hidden nodes so that the number of unknown weights Nw, does not exceed the number of training equations Ntrneq.

5. By trial and error determine the smallest number of hidden nodes that will yield a sufficiently low error rate for the training and validation subsets. If you exceed the max number of hidden nodes determined in 4, you have to beware of the overtraining/overfitting phenomenon (More unknowns than equations).

6. I tend to use 10 or more trials of random initial weights for each setting of hidden nodes.

7. I have zillions of posts in both the NEWSGROUP and ANSWERS. The posts in the NEWSGROUP tend to be more tutorial in nature.

8. Finally, a direct answer to your question:

 No. All you have to do is find a good combination 
of lags and hidden nodes that will yield a good 
unbiased prediction. 
 Since I could not find a good tutorial, I made 
up my own.

Hope this helps.

Greg

Related Solutions

MATLAB: Is the neural network overfitting

The best approach for regression is to start with FITNET using as many defaults as possible. The default I-H-O node topology contains Nw = (I+1)*H+(H+1)*O unknown weights. Ntrn training examples yields Ntrneq = Ntrn*O training equations with Ntrndof = Ntrneq-Nw training degrees of freedom. The average variance in the training target examples is MSEtrn00 = mean(var(target')). Obtaining a mean-square-error lower than MSEtrngoal = 0.01*Ntrndof*MSEtrn00a/Ntrneq for Ntrndof > 0 results in a normalized DOF adjusted MSE of NMSEtrna <= 0.01 and the corresponding adjusted training Rsquared R2trna = 1-NMSEtrna >= 0.99. That is interpreted as the successful modeling of at least 99% of the variation in the target.

The training objective is to try and minimize H with the constraint R2trna >=0.99. This is usually achieved by trial and error over a double for loop with the outer loop of hidden node candidate values h = Hmin:dH:Hmax and an inner loop of i = 1:Ntrials random weight initializations. I have posted many, many examples. Search NEWSGROUP and ANSWERS using

 greg fitnet Ntrials

If Ntrneq < ~2*Nw, validation stopping and/or regularization should be used to mitigate the problem of overtraining and overfit net.

The best approach to avoid overtraining is to use BOTH validation set stopping AND regularization.

HOWEVER FOR SOME STRANGE REASON, using validation stopping with TRAINBR is NOT AVAILABLE IN THE NNTOOLBOX !!!

Your choice of TRAINBR instead of FITNET is not wrong. However, you have made numerous errors, especially by not accepting as many defaults as possible.

Why not just use the syntax in

 help trainbr
 doc trainbr

with the double loop approach?

Don't forget to initialize the RNG before the first loop so that you can duplicate results.

Hope this helps.

Thank you for formally accepting my answer

Greg

MATLAB: Validation sets vs test sets

I assume you're talking about Neural Networks?

If so, validation is used for the neural network to decide when training is complete and to avoid overfitting. Testing is an independent test set.

http://www.mathworks.com/help/releases/R2015b/nnet/ug/divide-data-for-optimal-neural-network-training.html

Best Answer

Related Solutions

MATLAB: Is the neural network overfitting

MATLAB: Validation sets vs test sets

Related Question