sgdmupdate() is a single step or multiple steps until convergence?

조회 수: 1 (최근 30일)
robinho robinho
robinho robinho 2022년 7월 12일
댓글: Ben 2023년 9월 18일
Is calling sgdmupdate() once training a net over a single step or multiple steps until convergence? If the latter, how is convergence judged?

답변 (1개)

Akash
Akash 2023년 9월 15일
편집: Akash 2023년 9월 15일
Hi Robinho,
I understand that you are seeking clarification on whether calling "sgdmupdate" function during training involves a single step or multiple steps until convergence. Additionally, you would like to know how convergence is determined, specifically the criteria used for judging convergence.
The "sgdmupdate" function utilizes multiple steps or epochs to achieve convergence during training. Each step represents a single update of the parameters using "Stochastic gradient descent with momentum (SGDM)".
To determine convergence, you can specify the total number of training epochs using the "numEpochs" parameter. This parameter indicates the number of times the model goes through the entire training dataset. Convergence is typically judged based on the desired level of accuracy or a predefined stopping criterion.
Additionally, the convergence can also be determined by monitoring the "Stop" property of the "TrainingProgressMonitor" object. If the "Stop" property is set to true, it indicates that the training process should be stopped. This property can be triggered, for example, by clicking the "Stop" button during training.
To gain more information about "sgdmupdate" function, including its usage and convergence criteria, I recommend referring to the documentation provided at the below provided link.
Thanks,
Akash.
  댓글 수: 2
feynman feynman
feynman feynman 2023년 9월 17일
thanks so much. So sgdmupdate runs multiple steps instead of 1?
Ben
Ben 2023년 9월 18일
Each call to sgdmupdate performs one step of the update algorithm specified at the bottom of the documentation page: https://uk.mathworks.com/help/deeplearning/ref/sgdmupdate.html
Typically you use sgdmupdate in custom training loops, which will typically end up calling sgdmupdate multiple times as the loop progresses, and control of convergence is up to the user to manage in the custom loop.

댓글을 달려면 로그인하십시오.

카테고리

Help CenterFile Exchange에서 Sequence and Numeric Feature Data Workflows에 대해 자세히 알아보기

태그

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by