MATLAB: Are the sample input and output matrices stored in the model itself (net.inputs{1}.exampleInput) in Neural Network Toolbox 6.0 (R2008a)

Deep Learning Toolbox

When initializing a neural network with the NEWFF function, I noticed that the sample input and output matrices are stored in the model itself (net.inputs{1}.exampleInput). When having huge training data sets, this would consume too much memory. The data and models should be decoupled and not stored together.

Best Answer

Sample data is needed by NEWFF to create an initialized network and that can be reinitialized with INIT properly. The sample data is used to configure data ranges, processing function settings, etc.

You should use a small subset of the data for creating the network and then use rest of the data to train the network.

Changing or removing the sample data will automatically change the pre- and post-processing settings which are calibrated by that data and hence not recommended.

The processing settings are also updated for any changes to the list of processing functions or parameters and having the data stored in the network object allows this to happen reliably.

Currently, the sample input and target data are stored in the network so that the network’s input and output processing functions can have their processing settings automatically reconfigured if you makes changes to those processing functions or their parameters before training.

If you want to limit the amount of data stored in the network, for better memory efficiency, the recommended work around is to create the network using a subset of the data (i.e. only supply a subset of the columns of inputs and targets). As long as the data is still representative of value ranges and the presence of NaN’s (only a concern in applications where unknown inputs are being used) then the network will still train well.

In applications where there are no unknown input values, the ranges of inputs and targets could be used instead, alongside a third vector. (Having at least 2 vectors in inputs and targets is important to distinguish calls which supply input and target data from old calls to NEWFF that only supplied input ranges followed by layer sizes, etc.)

    inputs2 = [minmax(inputs) inputs(:,1)];
    targets2 = [minmax(targets) targets(:,1)];
    Net = newff(inputs2,targets2, …)

This workaround is only needed if memory efficiency of the network object is a concern.

Related Solutions

MATLAB: How to change the activation function in ANN model created using toolbox

You are approaching the problem in exactly the wrong way.

The multilayer perceptron with one hidden layer is a universal approximator. The only reason to use more than one hidden layer is to reduce the total number of unknown weights by reducing the total number of hidden nodes (i.e., H1+H2 < H).

Ntrn training pairs of I-dimensional inputs and O-dimensional output targets yields Ntrneq = Ntrn*O training equations. The best way to obtain a robust design that tends to be resistant to noise, interference, measurement and transcription error is to MINIMIZE the number of unknown weights that yield an acceptable solution. If possible, Nw << Ntrneq is desirable.

 1. Use FITNET (calls FEEDFORWARDNET) for regression and curve-fitting
 2. Use PATTERNNET (calls FEEDFORWARDNET) for classification and pattern-recognition
 3. You have a classification problem. Start with the simple code in
     help patternnet
     doc patternnet
 4. If there are c classes, the target matrix columns should be columns of eye(c): O = c.
 5. The relationship between trueclass indices 1:c and the target columns is
 target           = ind2vec(trueclassindices);
 trueclassindices = vec2ind(target);
 6. Before starting the design, get a "feel" for the data. This may include
  a. plot inputs
  b. plot targets
  c. plot targets vs inputs
  d. standardize inputs to zero mean and unit variance using zscore or mapstd.
  e. Repeat a and c 
  f. Remove or modify errors and outliers.
 7. Start simple with the example used in the help and doc documentation.
  help patternnet
  doc patternnet
 8. You only have to vary 2 things 
  a. Number of hidden nodes (want as small as feasible) 
  b. Initial random weights
 9. This can be accomplished with a double for loop as I have illustrated in zillions of examples in the NEWSGROUP and ANSWERS. Search results
 NEWSGROUP HITS
 greg patternnet Ntrials  8
 ANSWERS HITS
 greg patternnet Ntrials           60
 greg patternnet Ntrials  Hmax     22
 greg patternnet Ntrials  Hub      17
 greg patternnet Ntrials  Hub Hmax 10

Hope this helps.

Thank you for formally accepting my answer

Greg

MATLAB: Error in newff neural network training function

Odd, that routine has been part of the neural network toolbox for a long time. Try

restoredefaultpath

If that does not work then you probably need to reinstall the toolbox

Best Answer

Related Solutions

MATLAB: How to change the activation function in ANN model created using toolbox

MATLAB: Error in newff neural network training function

Related Question