MATLAB: Neural Network NAR-based time-series prediction starts failing after several timesteps

closeloopDeep Learning Toolboxnarnetneural networkstime seriestutorial

I am starting to experiment with NAR-based time-series prediction. I've followed several tutorials to write a small simple script to predict a simple sin(t) signal. The resulting prediction is quite good (as expected) at the begining, but as time progresses, the network starts failing catastrophically. Is there anything I am doing wrong?

Here is the code I am using:

DELAY=1:100;
HIDDEN=[10];
t=linspace(1,100,1000);
prueba=cos(t);
datos=prueba;
net = narnet(DELAY,HIDDEN);
[Xs,Xi,Ai,Ts] = preparets(net,{},{},num2cell(datos));
net = train(net,Xs,Ts,Xi,Ai);
net = closeloop(net);
[Xs,Xi,Ai,Ts] = preparets(net,{},{},num2cell(prueba));
y = net(Xs,Xi,Ai);
plot(prueba(DELAY(end)+1:end),'k')
hold on
plot(cell2mat(y),'r')

And the results I am getting are illustrated in the next figure (target-black; prediction-red)

Best Answer

After a not so quick look I have the following comments:

1. cos(t) has a period of 2*pi and satisfies a 2nd order homogeneous difference equation. Therefore

a. Only two delays per period are necessary. No more than eight to sixteen delays per period should be sufficient.

b. No hidden layer is necessary. One hidden layer with one hidden node is rarely better. One hidden layer with H = 2 hidden nodes should be more than sufficient.

2. The autocorrelation function of cos(t) with N = 1000 and dt = 0.1 has 859 positive lag points with significant correlations that have absolute values greater than 0.046.

3. The default divideFcn of narnet is 'dividerand'. However, random sampling of a uniformly sampled time series destroys the beneficial effect of correlations. 'divideblock' and 'divideind' or even 'divideint' with dt = 0.3, should work much better.

4. With I = O =1, N=1000, Ntrn = 700, Nval = 150, Ntst = 150, NFD = 100 and H = 10, there are

Nw = (NFD+1)*H+(H+1)*O = 1021 unknown weights

Ntrneq = Ntrn*O = 700 training equations.

Therefore the net is severely overfit and overtraining mitigation via a large validation set and/or mse regularization should be instituted.

5. Run the original data through the closed loop configuration and separately tabulate trn, val, and tst performance before before testing on new data.

Hope this helps.

Thank you for formally accepting my answer

Greg

Related Solutions

MATLAB: How to predict future values of time series in neural network

0. There is no lower case "L" in Heath

1. Capitals for cells, lower case for doubles

2. OL and 'o' for OpenLoop, CL and 'c' for Closed Loop

3. Search: NEWSGROUP ANSWERS

   narnet               40        165
   narnet greg          14        144
   narnet tutorial       8         38

4. Apply your code to the example data in help/doc narnet and/or one of the other example datasets in help/doc nndatasets

5. Run the example(s) with all defaults except divideblock before considering your own data with nondefault settings

Hope this helps

Thank you for formally accepting my answer

Greg

 close all, clear all, clc, plt=0
 T = simplenar_dataset;
 t = cell2mat(T); [ I N ] = size(t)            % [ 1 100 ]
 vart1 = var(t,1)         % MSE Reference 0.063306
 % In general vart1 = mean(var(t',1))
 Ntst = round(0.15*N), Nval = Ntst         % 15, 15
 Ntrn = N-Nval-Ntst                               % 70
 % ASSUME no statistical differences in trn/val/tst 
 % subsets so that DIVIDEBLOCK can be used
 trnind = 1:Ntrn; valind= Ntrn+1:Ntrn+Nval;
 tstind = Ntrn+Nval+1:N;
 ttrn = t(trnind); tval = t(valind); ttst=t(tstind);
 plt = plt+1, figure(plt), hold on
 plot(trnind,ttrn,'k','LineWidth',2)
 plot(valind,tval,'b','LineWidth',2)
 plot(tstind,ttst,'g','LineWidth',2)
 % Plot shows no significant statistical differences  
 % in trn/val/tst subsets
 % In general: 
 % A. deduce significant positive feedback delay lags,
 %      FD, from the autocorrelation function of ttrn
 % B. For MSEgoal = vart1/200, detemine the smallest 
 %     successful number of hidden nodes, H, by trial and error
 % C. For 1st run use defaults except for DIVIDEBLOCK
 FD =1:2, H = 10
  neto = narnet; neto.divideFcn = 'divideblock';
  [ Xo ,Xoi, Aoi, To ]  = preparets( neto, {}, {}, T );
  to = cell2mat(To);   varto1 = var(to,1) %0.061307
[ neto tro Yo Eo Xof Aof]  = train( neto , Xo, To, Xoi, Aoi );
% [ Yo = neto(Xo, Xoi, Aoi ); Eo = gsubtract(To,Yo);
NMSEo = mse(Eo)/varto1          % 6.3328e-09
% Use training record tro to isolate predicted future 
% nontraining (i.e., val and test) outputs and performance.
% For further predictions, must use the CL configuration.
%BUG WARNING: Division indices and ratios in tro are not  
% consistent with those used above and stored in neto.

MATLAB: One step ahead prediction with Recursive Neural Net (RNN)

 The removedelay command reduces all delays by 1
 However feedback delays must be positive
 Therefore, removedelay will cause an error if the minimum feedback 
 delay of any net is 1.
 This includes nar and narx. 
 Similarly, input delays must be nonnegative 
 Therefore, removedelay will cause an error if the minimum input 
 delay of any net is 0.

Hope this helps.

Thank you for formally accepting my answer

Greg

Best Answer

Related Solutions

MATLAB: How to predict future values of time series in neural network

MATLAB: One step ahead prediction with Recursive Neural Net (RNN)

Related Question