Huber loss in feedforwardnet

Question

Abdulaziz Almershed 2021년 2월 17일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/748342-huber-loss-in-feedforwardnet

답변: Shashank Gupta 2021년 2월 22일

Hello,

I have read that Huber loss as an error function is more effective in datasets with many outliers. However, I do not know how to use it in training my fitnet. my fitnet is

net=fitnet([5 5], 'trainlm')

Thank you

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Shashank Gupta 2021년 2월 22일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/748342-huber-loss-in-feedforwardnet#answer_630059

MATLAB Online에서 열기

Hi Abdulaziz,

There are multiple ways to do it. First and easy way is to shift to deep neural network framework rather than using shallow neural network. It gives more fleibility to write custom function. Check out this example to create a deep network. Now coming to custom loss function so you can either define a custom output layer and include Huber loss for the same. Check out this example, this will help you. Or if you don't wish to create a separate layer then you can write a custom training loop and implement the loss function on your own. check this documentation.

If you still wants to use shallow neural network i.e. fitnet then implement the following steps.

% define fitnet.
net=fitnet([5 5], 'trainscg');
% define a custom function 'newFcn';
net.performFcn = 'newFcn';
net = train(net,x,t);

Now you have to create a new package "+newfcn" with following template.

1) newfcn.m - Same as mse.m

2) +newfcn/apply.m - The main performance calculation

3) +newfcn/apply.m

function perfs = apply(t,y,e,param)
Calculate performance for each target individually so 'perfs' is same size as t, y and e.

4) +newfcn/backprop.m - Backpropagation derivatives

function dy = backprop(t,y,e,param)
Return 'dperf/dy', the derivative of performance with respect to each output 'y'.

5) +newfcn/forwardprop.m - Forward propagate derivatives

function dperf = forwardprop(dy,t,y,e,param)
Return 'dperf/dwb', given 'dy/dwb'. (wb - weights and biases)

In this way you have to define everything wrt custom function. Both the method which I suggested to similar but I would prefer the first one because it is easy to implement and you will find enough resources to explore. The second shallow network approach is slightly non trivial and there aren't many resources available. So try shifting to first approach.

I hope this helps.

Cheers