The definition of the Target update frequency in Reinforcement Learning Designer.

Question

Xian Zheng Hong 2024년 3월 7일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2091631-the-definition-of-the-target-update-frequency-in-reinforcement-learning-designer

댓글: Xian Zheng Hong 2024년 3월 16일

채택된 답변: UDAYA PEDDIRAJU

In DDPG Agent, there are four networks. Online policy, Target policy, Online Q and Target Q.

The [Target update frequency] is used to the Target policy and Target Q in Reinforcement Learning Designer.

Are the Update frequency of the Online policy and Online Q same as the [Target update frequency] ?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

UDAYA PEDDIRAJU 2024년 3월 12일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2091631-the-definition-of-the-target-update-frequency-in-reinforcement-learning-designer#answer_1424086

Hi Xian,

No, the update frequency of the Online Policy and Online Q networks is not the same as the Target Update Frequency. The Target Update Frequency specifically applies to how often the Target Policy and Target Q networks are updated, which is typically less frequent or managed differently to ensure stability in learning.

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Xian Zheng Hong 2024년 3월 16일

Thanks for answering. Here is my another question.

Are the Online policy and Online Q updated at every time step in Reinforcement Learning Designer Toolbox?

댓글을 달려면 로그인하십시오.

The definition of the Target update frequency in Reinforcement Learning Designer.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

Community Treasure Hunt

The definition of the Target update frequency in Reinforcement Learning Designer.

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기