The definition of the Target update frequency in Reinforcement Learning Designer.

조회 수: 8 (최근 30일)
In DDPG Agent, there are four networks. Online policy, Target policy, Online Q and Target Q.
The [Target update frequency] is used to the Target policy and Target Q in Reinforcement Learning Designer.
Are the Update frequency of the Online policy and Online Q same as the [Target update frequency] ?

채택된 답변

UDAYA PEDDIRAJU
UDAYA PEDDIRAJU 2024년 3월 12일
Hi Xian,
No, the update frequency of the Online Policy and Online Q networks is not the same as the Target Update Frequency. The Target Update Frequency specifically applies to how often the Target Policy and Target Q networks are updated, which is typically less frequent or managed differently to ensure stability in learning.
  댓글 수: 1
Xian Zheng Hong
Xian Zheng Hong 2024년 3월 16일
Thanks for answering. Here is my another question.
Are the Online policy and Online Q updated at every time step in Reinforcement Learning Designer Toolbox?

댓글을 달려면 로그인하십시오.

추가 답변 (0개)

카테고리

Help CenterFile Exchange에서 Deep Learning Toolbox에 대해 자세히 알아보기

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by