MATLAB: The gradient of mini batches

deep learningDeep Learning Toolboxgradientmini batchtraining loop

Hi there.

I need your confimation or rejection for this question…

In following code, if the minibatch size is h,

[grad,loss] = dlfeval(@modelGradients,dlnet,dlX_miniBatch,Y_miniBatch);

the grad is the average of gradients of loss over h samples? Does it calculate dradients automatically and at the end with:

grad = 1/h * sum_i=1:h (\nabla loss(y_i,yHat_i)) ??

Following this question, for computing the total loss and geadient (for a full batch), does we should take avarage of losses and averages of gradients (averaging with the number of batches, say 1000 batches each with h size)??

Best Answer

The function dlfeval evaluate the custom deep learning models. The loss are calculated based on what has been defined in modelGradients function. So if you are calculating the average loss in this function, then it will return the averaged one. For example, consider this modelGradient function, it is calculating the average cross entropy loss, so it will return the average loss. The gradients are calculated with respect to the loss function defined in for the network.

Best Answer

Related Solutions

MATLAB: 畳み込みニューラルネ​ットワークの学習にお​けるミニバッチの精度​とは何ですか？

MATLAB: Size of Mini-Batch in deep learning

Related Question

MATLAB: 畳み込みニューラルネットワークの学習におけるミニバッチの精度とは何ですか？