MATLAB: Do I get different results for a neural network when I explicitly specify the number of inputs and input size

accuracyaccuratedifferentMATLABNetworkneuralresultsshallow

Assume that I create two neural networks, each having the same structure (for example, one, 10 neuron hidden layer) and a dataset 'data.mat', which has the variables 'x' and 't' corresponding to 'features' and 'targets' respectively.

The only difference between the two networks is that the first network has explicitly specified number and size of inputs while the second network has not.

For a concrete example, consider the following MATLAB script:

% load data
load('data.mat');
trainFcn = 'traincgf';
n=10;
% create and configure net1
net1 = fitnet(n,trainFcn);
net1 = configure(net1,'output', t);
% specify inputs and connections manually
net1.numInputs = 1;
net1.inputs{1}.size = 20;
net1.biasConnect =[ 1;1 ];
net1.inputConnect = [1;0];
net1.layerConnect = [0 0;1 0];
net1.outputConnect = [0 1];
% specify data split
net1.divideFcn = 'divideind';
[trainInd,valInd,testInd] = divideint(50, 0.7, 0.15, 0.15);
net1.divideParam.trainInd = trainInd;
net1.divideParam.valInd = valInd;
net1.divideParam.testInd = testInd;
% train net1
[net1,tr1]= train(net1,x,t);%,'CheckpointFile','ProteomicYield1000.mat');
y1=net1(x);
% create and configure net2
net2 = fitnet(n, trainFcn);
net2 = configure(net2,'output',t);
% Don't specify inputs or connections manually
% use the same data split as net1
net2.divideFcn = 'divideind';
net2.divideParam.trainInd = trainInd;
net2.divideParam.valInd = valInd;
net2.divideParam.testInd = testInd;
% train net2
[net2,tr2]= train(net2,x,t);
y2=net2(x);

Here 'net1' and 'net2' are two identical shallow neural networks. In 'net1', I explicitly specify the number and size of inputs after configuring the network.

In 'net2', I do not specify the number and size of inputs and directly configure and train the network.

I am running the above code on the same data, ensuring that the training, validation and test sets are exactly the same for both of the networks.

Why do I get very different results for 'net1' and 'net2', and 'net2' always seems to be more accurate than 'net1'?

Best Answer

The weights for the first network, 'net1', are not getting initialized properly. Specifically, 'net1' has two sets of weights, stored in variables 'IW' [input weights] and 'LW' [layer weights]. When you call the 'configure' function as follows:

net1 = configure(net1, 'output', t);

the layer weights in 'LW' get initialized because the network has enough information to know the size of these weights. However, the input weights in 'IW' do not get initialized because MATLAB doesn't know the input size yet. This is expected behavior.

Subsequently, when you set the size of the inputs, as follows:

net1.inputs{1}.size = 240;

MATLAB now has enough information to initialize the input weights 'IW'. However, the network does not have any input data available, so 'IW' is initialized to 'zeros' by default.

Initializing the weight matrix of the neural network to zeros is the root cause behind the poor accuracy of the network in the first script.

The are two ways to fix this issue:

1) You can re-configure the network using the inputs before training, as follows:

net1 = configure(net1, x, t);

This will initialize the input weights correctly.

2) Alternatively, you can delete the following line:

net1.inputs{1}.size = 240;

This will leave the input weights 'IW' un-initialized and calling the 'train' function will initialize 'IW' to appropriate values.

Now, 'net1' and 'net2' should perform similarly as both of them have been assigned similar initial weights.

Related Solutions

MATLAB: How do you setup “cascadeforwardnet” for classification instead of regression

Cascade forward networks are initially setup for regression. To set it up for classification, modify the following properties:

% Initialize cascade or feedforward network
net = cascadeforwardnet(10);
% Change options to setup classification network
net.performFcn = 'crossentropy';
net.trainFcn = 'trainscg';
net.layers{end}.transferFcn = "softmax";
% Specify which plot functions to use
% You can specify the plot confusion functuion here instead of regression
net.plotFcns = {'plotperform', 'plottrainstate', 'ploterrhist', 'plotconfusion'};

A complete workflow is in the attached script "test_cascade.m".

MATLAB: How to change the divide function for the training, testing and validation data of the neural network

Refer to the following documentation page for details about the four different divide functions:

https://www.mathworks.com/help/deeplearning/ug/divide-data-for-optimal-neural-network-training.html

This page describes how each divide function separates the data into training, validation and testing subsets. Additionally, it explains which properties of the network you will need to set in order to use each method. For example, in order to use 'divideind', you must set the 'divideFcn' of the network to 'divideind', as well as specify certain 'divideParam' values which correspond to the indices that will divide the data.

If you are using an app such as 'nftool' or 'nprtool', the 'divideFcn' is set to 'dividerand' by default. In order to use a different dividing method, you can follow this workaround:

1. Use 'nftool' or 'nprtool' with the default divide method to train your network.

2. Proceed through the app until you get to the page "Save Results". You will have an option to create a script from the app. Select the "Simple Script" button.

3. A script will open with the generated code for training your network. Within this script, you can change the code that sets the 'divideFcn' and 'dividParam' values.

4. Run this script to train the network again.

Best Answer

Related Solutions

MATLAB: How do you setup “cascadeforwardnet” for classification instead of regression

MATLAB: How to change the divide function for the training, testing and validation data of the neural network

Related Question