How to use Matlab trainnet to train a network without an explicit output layer (R2024a)

Question

Michael Solonenko 2024년 8월 8일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2144094-how-to-use-matlab-trainnet-to-train-a-network-without-an-explicit-output-layer-r2024a

편집: Matt J 2024년 8월 9일

I've attempted to train a CNN with the goal of assigning N numeric values to different input images, depending on image characteristics. It looked like the network's output layer could be a fully-connected layer with N outputs (because I have not found a linear output layer in Deep Network Designer). I am not sure if I can use a non-linear output layer instead, because this is fundamentally a regression task.

However, when using a fully-connected layer in place of an output layer the trainnet gives repeating errors indicating that I must have an output layer.

So basically, I have two questions:

1) Is it possible to use trainnet in a network without an output layer? It is difficult to imagine that a built-in training function has an oversight like this. Do I really need to construct a custom training loop if my network?..

2) Are there any alternatives? In essence, all I am looking for is an output layer that is either a) linear or b) does not change the previous layer's output. Just anything that is compatible with a regression task.

If any clarification is needed on my issue or network construction, I would be happy to provide it.

Thank you so much for your help!

Deep Learning Toolbox Version 24.1 (R2024a) , trainnet function, Matlab 2024.

댓글 수: 2
없음 표시없음 숨기기

Matt J 2024년 8월 9일

MATLAB Online에서 열기

I can't reproduce that. Here is an example of a simple network training where the final layer is a fully connect layer. No error messages:

ds=combine( arrayDatastore(rand(3),IterationDim=3) , ...

arrayDatastore(rand(1),IterationDim=3) );

layers=[imageInputLayer([3,3,1]),fullyConnectedLayer(1)];

trainnet(ds,layers,'mse', trainingOptions('adam',TargetDataFormats="CB"))

Iteration Epoch TimeElapsed LearnRate TrainingLoss _________ _____ ___________ _________ ____________ 1 1 00:00:00 0.001 0.073747 30 30 00:00:01 0.001 0.036938 Training stopped: Max epochs completed

ans =

dlnetwork with properties: Layers: [2x1 nnet.cnn.layer.Layer] Connections: [1x2 table] Learnables: [2x3 table] State: [0x3 table] InputNames: {'imageinput'} OutputNames: {'fc'} Initialized: 1 View summary with summary.

Michael Solonenko 2024년 8월 9일

Use of 'mse' does the trick. I figured that out already, and thank you!

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Aditya 2024년 8월 8일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2144094-how-to-use-matlab-trainnet-to-train-a-network-without-an-explicit-output-layer-r2024a#answer_1496769

편집: Aditya 2024년 8월 8일

Hi @Michael Solonenko,

To Address your query:

1) From my knowledge we cannot use "trainnet" function without an explicit output layer. The trainnet function in MATLAB expects a complete network architecture, including an output layer, to properly define the loss function and perform backpropagation during training.

2) Use a fully connected layer with N outputs and set the loss function to "mse" since you are doing regression tasks. I am not sure why you are getting the mentioned error while doing this step. It might be helpful if you could provide the code that you are using (layers architecture & trainingOptions)

Also you could refer to this MATLAB documentation on "Train Convolutional Neural Network for Regression":

https://www.mathworks.com/help/deeplearning/ug/train-a-convolutional-neural-network-for-regression.html

Hope this helps!

댓글 수: 3
이전 댓글 1개 표시이전 댓글 1개 숨기기

Michael Solonenko 2024년 8월 8일

MATLAB Online에서 열기

Hello @Aditya,

Thank you for the response!

However, I do have one question. In your response to 1) you explain that "trainnet" cannot function without an explicit output layer, but the link provided at the end has an example of just this.

Here, you have a network terminating in a fully connected layer:

layers = [
    imageInputLayer([28 28 1])
    convolution2dLayer(3,8,Padding="same")
    batchNormalizationLayer
    reluLayer
    averagePooling2dLayer(2,Stride=2)
    convolution2dLayer(3,16,Padding="same")
    batchNormalizationLayer
    reluLayer
    averagePooling2dLayer(2,Stride=2)
    convolution2dLayer(3,32,Padding="same")
    batchNormalizationLayer
    reluLayer
    convolution2dLayer(3,32,Padding="same")
    batchNormalizationLayer
    reluLayer
    fullyConnectedLayer(numResponses)];

And this network would be trained as such:

miniBatchSize  = 128;
validationFrequency = floor(numel(anglesTrain)/miniBatchSize);
options = trainingOptions("sgdm", ...
    MiniBatchSize=miniBatchSize, ...
    InitialLearnRate=1e-3, ...
    LearnRateSchedule="piecewise", ...
    LearnRateDropFactor=0.1, ...
    LearnRateDropPeriod=20, ...
    Shuffle="every-epoch", ...
    ValidationData={XTest,anglesTest}, ...
    ValidationFrequency=validationFrequency, ...
    Plots="training-progress", ...
    Metrics="rmse", ...
    Verbose=false);
net = trainnet(XTrain,anglesTrain,layers,"mse",options);

Is there something in the options that allows this? How does this work?

Aditya 2024년 8월 9일

편집: Aditya 2024년 8월 9일

Hi @Michael Solonenko,

Yes, so when we call trainnet with the "mse" loss function, MATLAB automatically understands that the network is intended for regression tasks. The "mse" loss function (mean squared error) is applied to the output of the fully connected layer during training.

You can also look into this MATLAB documentation on regressionLayer: https://in.mathworks.com/help/deeplearning/ref/regressionlayer.html

Here they have mentioned to use "trainnet" with "mse" instead of using regressionLayer.

Hope this clarifies the doubt!

Michael Solonenko 2024년 8월 9일

Thank you!

댓글을 달려면 로그인하십시오.

Answer 2

Matt J 2024년 8월 9일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2144094-how-to-use-matlab-trainnet-to-train-a-network-without-an-explicit-output-layer-r2024a#answer_1497229

편집: Matt J 2024년 8월 9일

1) Is it possible to use trainnet in a network without an output layer? It is difficult to imagine that a built-in training function has an oversight like this.

trainnet is always to be used without an output layer.. The loss function is specified using the lossFcn input argument,

netTrained = trainnet(images,net,lossFcn,options)

2) Are there any alternatives? In essence, all I am looking for is an output layer that is either a) linear or b) does not change the previous layer's output. Just anything that is compatible with a regression task.

The lossFcn can be a customized loss function supplied by you. From the doc,

Function handle with the syntax loss = f(Y1,...,Yn,T1,...,Tm), where Y1,...,Yn are dlarray objects that correspond to the n network predictions and T1,...,Tm are dlarray objects that correspond to the m targets.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

How to use Matlab trainnet to train a network without an explicit output layer (R2024a)

댓글 수: 2
없음 표시없음 숨기기

답변 (2개)

댓글 수: 3
이전 댓글 1개 표시이전 댓글 1개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

How to use Matlab trainnet to train a network without an explicit output layer (R2024a)

댓글 수: 2 없음 표시없음 숨기기

답변 (2개)

댓글 수: 3 이전 댓글 1개 표시이전 댓글 1개 숨기기

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

댓글 수: 2
없음 표시없음 숨기기

댓글 수: 3
이전 댓글 1개 표시이전 댓글 1개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기