MATLAB: NARX OPTIMUM HIDDEN NODES NUMBER

Deep Learning Toolboxhubnarxnetneuralneural network

I used the code below to try to get the optimum number of hidden nodes for a narx network….I would like any suggestions concerning if I am implementing the code correctly. THANKS

        close all,clear all, clc, plt=0;
          tic
         load 'ProjectD'
        X = GasProduced; %[1x1500] cell

        T = OilRate;     %[1x1500] cell
         x  = cell2mat(X);
         t  = cell2mat(T);
         [ I N ] = size(X);           % [ 1 1500]
         [ O N ] = size(T);
        MSE00 = mean(var(t',1)) % 1.1197e+07
        MSE00a = mean(var(t',0)) %1.12041e+07
        %Normalization
        zx = zscore(cell2mat(X), 1);
        zt = zscore(cell2mat(T), 1);
            Ntrn = N-2*round(0.15*N)  
            trnind = 1:Ntrn
            Ttrn = T(trnind)
        Neq     = prod(size(Ttrn))     % 1500
        %Significant Lags were determined using the code at :
        <http://www.mathworks.com/matlabcentral/newsreader/view_thread/341287#935393>
        %sigilag95: [0:517 520 521 524 525 636:1255 1345:1384] %Significant Input Lag
        %sigflag95: [0:348 411:1207 1321:1401]  %significant Feedback lag
        rng('default')
        % %  

        % %  
FD   = 1:20; %Random Selection of sigflag subset
ID   = 1:20; %Random selection of sigilag subset
   NFD  = length(FD)   % 
   NID  = length(ID)  %
MXFD  = max(FD)      
   MXID = max(ID)
Ntrneq = prod(size(t))
   %  Nw =  ( NID*I + NFD*O + 1)*H + ( H + 1)*O
   Hub     =  -1+ceil( (Ntrneq-O) / ((NID*I)+(NFD*O)+1))  
    Hmax    =  floor(Hub/10) %  
    Hmax = 2 ==>  Nseq >>Nw :
           Hmin    = 0
           dH      = 1
           Ntrials = 25
           j=0
           rng(4151941)
           for h = Hmin:dH:Hmax
              j = j+1
              if h == 0
                  net = narxnet( ID, FD, [] );
                  Nw =  ( NID*I + NFD*O + 1)*O
              else
                  net = narxnet( ID, FD, h );
                  Nw =  ( NID*I + NFD*O + 1)*h + ( h + 1)*O
              end
              Ndof            = Ntrn-Nw
              [ Xs Xi Ai Ts ] = preparets(net,X,{},T);
              ts              = cell2mat(Ts);
              xs              = cell2mat(Xs);
              MSE00s          = mean(var(ts',1))
              MSE00as         = mean(var(ts'))
              MSEgoal         = 0.01*Ndof*MSE00as/Neq
              MinGrad         = MSEgoal/10
              net.trainParam.goal      =  MSEgoal;
              net.trainParam.min_grad  =  MinGrad;
              net.divideFcn            =  'dividetrain';
              for i = 1:Ntrials
                  net            =  configure(net,Xs,Ts);
                  [ net tr Ys ]  =  train(net,Xs,Ts,Xi,Ai);
                  ys             =  cell2mat(Ys);
                  stopcrit{i,j}  = tr.stop;
                  bestepoch(i,j) = tr.best_epoch;
                  MSE            = mse(ts-ys);
                  MSEa           = Neq*MSE/Ndof;
                  R2(i,j)        = 1-MSE/MSE00s; 
                  R2a(i,j)       = 1-MSEa/MSE00as;
              end
           end
           stopcrit   =  stopcrit    %Min grad reached (for all).
           bestepoch  =  bestepoch
           R2         =  R2
           R2a        =  R2a
           Totaltime  =  toc

Best Answer

I get the same results as you.

1. However, there are some code inconsistencies including:

a. Using Ntrn = 70 with 'dividetrain'
b. Not taking lags into account when counting equations... the equation count should be based on ttrns.

2. I now find using o (instead of s) and c as subscripts for openloop and closeloop to be more natural.

3. Sorry for not suggesting

a. less trivial dataset examples like simpleseries and pollution.
b. Using ID = 0

3. Remember, DIVIDETRAIN is only recommended for estimating the minimum values of ID, FD, and H that will yield acceptable performance. To get reliable estimates of performance on unseen data use data division with Ntrn as small as possible.

Hope this helps.

Greg

Related Solutions

MATLAB: Narx delays problem & multistep ahead predictions

The version of the code you are using is both dated and error prone. Check both the NEWSGROUP and ANSWERS for the latest version.

Also: You are mistaking correlation values for correlation lags.

Hope this helps.

Thank you for formally accepting my answer

Greg

MATLAB: Finding Optimal ID, FD and Hidden Nodes for NARXNET

> Hi, After a lengthy research, I have finally have better Understanding about ID and FD. I have then put some code together to find the optimal ID and FD and then using these ID and FD to find optimal hidden node for my NARXNet using simplenarx dataset. While I am convince that it is correct but I am not very Confident if this is the correct way of doing things so I would really appreciate any comments/correction if any.

> Some additional question are:

1) Should I use data division such as 60/20/20 in the double for loop?

 You have used TRAINBR for which Nval = 0 and performFcn = msereg
 HOWEVER,you have imposed performFcn = mse and DIVIDETRAIN for which Ntst = 0 .
 VERY CONFUSING!

2) I used intersect command to find the subset of lags, is this correct way of doing it?

 NO.
 GEH1 'WHAT IS THE FOLLOWING COMMAND FOR?'

if true % code end

 GEH2 = 'USE ONLY TRAINING DATA TO DETERMINE DELAYS' 
 GEH3 = 'REMAINING POST STRICTLY VALID ONLY FOR I = O = 1 '
       ' MODIFICATIONS NEEDED FOR MULTIVARIABLE DATA'
 GEH4 = 'CANNOT USE FD = 0 WILL GET ERROR'

> Using Fixed ID and FD to Find Optimal Number of Hidden Node

subset_ID_FD = intersect(sigflag95, sigilag95)

GEH5 =  '0     3     4     5    10'

Opti_ID_FD = max(subset_ID_FD);

GEH6 = 'Opti_ID_FD = 10 NOT NECESSARILY OPTIMAL!'

Ntrn = N-2*round(0.15*N) % default 0.7/0.15/0.15 trn/val/tst ratios

GEH7 = 'ABOVE NOT VALID FOR TRAINBR  0.85/0/0.15'

%ID = 1:2 %default for Prediction ID = 1:Opti_ID_FD; % 0:2 % Regression (default)

 GEH8 = 'ZERO DELAY IS NOT A MATLAB DEFAULT'

Hub = floor((Ntrneq-O)/(NFD*O+O+1)) % 5

 GEH9 = 'Hub = (Ntrneq-O)/(NID*I+NFD*O+1)= 3.29'
 Hmax = Hub; % 2 is sufficient to get R2=0.999
dH =1;
Hmin = 1;
Ntrials = 10;
%

trainFcn = 'trainbr'
%
rng('default')
j=0
for h = Hmin:dH:Hmax
  j=j+1
  if h==0
      neto = narxnet(ID,FD,[],'open',trainFcn);
      Nw = (NID*I+NFD*O+1)*h+(h+1)*O;
 GEH10 = 'Nw = (NID+1)*O'

neto.divideFcn = 'dividetrain'; % No data division

 GEH11 = 'NEED NONTRAINING DATA FOR UNBIASED PREDICTION !!'

neto.performFcn = 'mse';

 GEH12 ' TRAINBR USES MSEREG WITH NO VAL SET'
 [Xo Xoi Aoi To ] = preparets(neto,X,{},T);
 to = cell2mat(To);
 MSE00o = var(to,1)
 MSE00oa = var(to,0)
 MSEgoal = 0.005*max(Ndof,0)*MSE00oa/Ntrneq
 GEH13 = 'ONLY USE TRAINING DATA TO COMPUTE TRAINING PARAMETERS'
 GEH14 = SUGGEST LOOKING AT TRAINING RECORD tro.

Hope this helps.

 *Thank you for formally accepting my answer*

Greg

Best Answer

Related Solutions

MATLAB: Narx delays problem & multistep ahead predictions

MATLAB: Finding Optimal ID, FD and Hidden Nodes for NARXNET

Related Question