how to manage the time of updating the network weights in DRL

Question

MOHAMMADREZA 2025년 3월 10일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2174999-how-to-manage-the-time-of-updating-the-network-weights-in-drl

답변: Jack 2025년 3월 10일

Hi , I am trying to write an DRL agent. Actually, I do not need to update the weights of the NNs at each step, but every n steps. How can I manage it? in Particular, I do not know when updating weights happens. Is it after exiting the step function?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Jack 2025년 3월 10일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2174999-how-to-manage-the-time-of-updating-the-network-weights-in-drl#answer_1561520

MATLAB Online에서 열기

You can control when the network weights update by simply decoupling the weight update routine from every single step. If you’re writing your own training loop, you could maintain a counter that increments on each step, and only call the weight update function every n steps. For example, if you're using a custom loop:

counter = 0;
n = 10; % update every 10 steps
while training
    % Take an action, observe state and reward, etc.
    counter = counter + 1;
    
    % Store experience, perform other step-related tasks
    
    % Update weights only every n steps
    if counter == n
        updateNetworkWeights(); % Your function to perform a training update
        counter = 0;
    end
end

If you’re using a built-in agent from MATLAB’s Reinforcement Learning Toolbox, check the agent options. For example, in a DQN agent, there is a property called LearnStepPeriod in DQNAgentOptions which lets you specify the number of steps between network training updates.

Regarding “when” the weights are updated, typically the update happens at the end of the step function (i.e., after the environment returns the next state and reward). This way, you ensure that the experience from that step is included before updating the network.

Follow me so you can message me anytime with future questions. If this helps, please accept the answer and upvote it as well.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

how to manage the time of updating the network weights in DRL

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

how to manage the time of updating the network weights in DRL

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기