Issue with batch normalization layer of saved CNN

조회 수: 16 (최근 30일)
Kjetil Kveim
Kjetil Kveim 2019년 3월 21일
답변: Sam Leeney 2022년 12월 15일
When loading a previously trained CNN, I got an issue with the batch normalization layers. When looking into the loaded CNN model the trainable mean and variance are empty.
Name: 'batchnorm_1'
TrainedMean: []
TrainedVariance: []
So the checkpoint doesn't seem to save these parameters. Are there any workarounds for this issue? I am using Matlab R2018b.
  댓글 수: 1
Wes Baldwin
Wes Baldwin 2020년 7월 29일
Doesn't this mean using checkpoints on networks with a batchnorm layer is useless??? Kinda a big deal for long training!!! You could potentially lose days or weeks of training with no option but to start from the beginning.

댓글을 달려면 로그인하십시오.

채택된 답변

Vivek Singh
Vivek Singh 2019년 3월 22일
We were able to reproduce the issue. We will inform you once the issue is fixed.
Since TrainedMean and TrainedVariance are calculated after the training is finished, therefore as a workaround you can use the below mentioned codes to explicitly save and load the Model.
%To save model with name "demoModel", assuming your network is in "net"
save('demoModel','net')
%To load model to variable net1
net1=load('demoModel.mat','net');
net1.net.Layers(n).TrainedMean %where n is the batch normalization layer
  댓글 수: 4
ramin nateghi
ramin nateghi 2020년 11월 5일
편집: ramin nateghi 2020년 11월 5일
Hello,
I also faced this problem. When a model is saved by "save" function, it is ok and the model contains all information (TrainedMean and TrainedVariance) of the trained batch normalization layers. But, when the model is saved by the"checkpoint" during training, both of the TrainedMean and TrainedVariance params became empty. This is a bug for "checkpoint".
Yi Wei
Yi Wei 2020년 11월 30일
Does it mean the saving process will cost too much time if a network contains normalization layers and the training data volume is large(e.g.8T)?

댓글을 달려면 로그인하십시오.

추가 답변 (1개)

Sam Leeney
Sam Leeney 2022년 12월 15일
For anyone else stuck, there is a fix here; https://uk.mathworks.com/matlabcentral/answers/423588-how-to-classify-with-dag-network-from-checkpoint

카테고리

Help CenterFile Exchange에서 Image Data Workflows에 대해 자세히 알아보기

제품


릴리스

R2018b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by