Why do the training curve fall sharp suddenly?

Question

Saugata Bose 2019년 8월 31일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/478348-why-do-the-training-curve-fall-sharp-suddenly

댓글: Saugata Bose 2019년 8월 31일

I am training a CNN classifier on a binary balanced dataset. The dataset has 4500 numbers of tweet data along with the class of the tweet. During training, I am applying, GLOVE embedding of 300 dimensions, 'adam' solver to run the model for 33 times of epochs. Besides, the sequence length I have considered is 31.

I have applied 200 filters which include a number of convolution2dlayers,batch normalization layers, relu layers, dropout layers and max-pooling layers. The drop out I have considered is 0.2 and the max pool layer is of size [1 sequence length].

The training curve is approaching smoothly until the end period where it has fallen sharply. Here, I have attached the training plot I receive:

Would you please explain to me why does this sudden fall occur? And how could I get rid of this?

Thanks,

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Matt J 2019년 8월 31일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/478348-why-do-the-training-curve-fall-sharp-suddenly#answer_389873

From the page Monitor Deep Learning Training Progress:

The final validation metrics are labeled Final in the plots. If your network contains batch normalization layers, then the final validation metrics are often different from the validation metrics evaluated during training. This is because batch normalization layers in the final network perform different operations than during training.

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Saugata Bose 2019년 8월 31일

Matt hi. thanks for your response. Yes, removing batch normalization has solved the problem. But it does not mean that using batch normaliation will always create such anamaly. Because, I am using batch normalization layer in few of my works but I never did experence such thing. Does this anamaly by any chance relate to the dataset the model is working on or the hyperparameters of the model?

thanks,

댓글을 달려면 로그인하십시오.

Why do the training curve fall sharp suddenly?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

Community Treasure Hunt

Why do the training curve fall sharp suddenly?

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기