"size of predictions and target values must match" when using custom loop training for multi-input and multi-output

Question

0 개 추천

Dear all,

I build a deep learning model with 2 input and 2 output and use a custom loop training. But I have an error when call dlfeval(): "size of predictions and target values must match" . What should I fix? Here is my code:

[grad,state,loss] = dlfeval(@modelGradientsMulti, dlnet, input1, input2, output1, output2);
%support function
function [gradients,state,loss] = modelGradientsMulti(dlnet,in1,in2,out1,out2)
[pre1,pre2,state] = forward(dlnet,in1,in2,'Outputs',["fc1" "fc2"]);
loss1 = mse(pre1,output1);
loss2 = mse(pre2,output2);
loss = loss1 + loss2;
gradients = dlgradient(loss,dlnet.Learnables);
end

Thank you for reading my question. I hope someone expert will give a help.

댓글 수: 2
없음 표시 없음 숨기기

Katja Mogalle 2022년 1월 21일

MATLAB Online에서 열기

From the error message my first guess would be that pre1 and output1 don't have the same size, or that pre2 and output2 don't have the same size. Can you put a breakpoint at the line where you compute mse for the first time and check the sizes of pre1, output1, pre2, and output2? If there are inconsistencies between the sizes we need to look further if the data or the network is set up incorrectly.

To better understand the sizes of activations in the network, I always suggest using analyzeNetwork:

>> analyzeNetwork(dlnet)

Let me know how you get on and what you find.

Van Vy 2022년 1월 24일

@Katja Mogalle: Thank you for your reply, I checked my network and there was no problem. Now I'm trying to re-organize my code in another way because I stucked in this point long time already.

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Follow Question

Answer 1

Jaimin 2024년 12월 3일

MATLAB Online에서 열기

0 개 추천

Hi @Van Vy

To resolve the error "size of predictions and target values must match" in a deep learning model with two inputs and two outputs using a custom training loop, I have created a simplified demo code.

Kindly refer to it for understanding.

% Create dummy data 
input1 = dlarray(randn(10, 32), 'CB'); % 10 features, 32 samples 
input2 = dlarray(randn(10, 32), 'CB'); % 10 features, 32 samples 
output1 = dlarray(randn(3, 32), 'CB'); % 3 outputs, 32 samples 
output2 = dlarray(randn(3, 32), 'CB'); % 3 outputs, 32 samples 
% Custom training loop 
numEpochs = 5; 
learningRate = 0.01; 
for epoch = 1:numEpochs 
    % Evaluate the model gradients and loss using dlfeval 
    [gradients, state, loss] = dlfeval(@modelGradientsMulti, dlnet, input1, input2, output1, output2); 
    % Update the network parameters using SGD 
    dlnet = dlupdate(@(w, g) w - learningRate * g, dlnet, gradients); 
    % Display the loss 
    fprintf('Epoch %d, Loss: %.4f\n', epoch, loss); 
end 
 
% Support function for computing gradients and loss 
function [gradients, state, loss] = modelGradientsMulti(dlnet, in1, in2, out1, out2) 
    % Forward pass 
    [pre1, pre2, state] = forward(dlnet, in1, in2, 'Outputs', ["output1", "output2"]); 
    % Compute losses 
    loss1 = mse(pre1, out1); 
    loss2 = mse(pre2, out2); 
    loss = loss1 + loss2; 
    % Compute gradients 
    gradients = dlgradient(loss, dlnet.Learnables); 
end