How I can access the action output of the actor network in DDPG during training?

Question

Maha Mosalam 2021년 12월 2일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1601730-how-i-can-access-the-action-output-of-the-actor-network-in-ddpg-during-training

답변: Yash 2024년 12월 24일

I want to access the action output of the actor network in DDPG during training since I want to change it by force function to other action optimized from sepeate function to accelerate training and improve learning effeciecncy for actor , if any help for that? I wil be thankful

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Yash 2024년 12월 24일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1601730-how-i-can-access-the-action-output-of-the-actor-network-in-ddpg-during-training#answer_1556358

You can use the function getAction which returns action from agent, actor or policy object given environment observations. You can write a custom loss function that directly uses getAction and dlgradient within it, and then use dlfeval and dlaccelerate with your custom loss function. For an example, see Train Reinforcement Learning Policy Using Custom Training Loop and Custom Training Loop with Simulink Action Noise.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

How I can access the action output of the actor network in DDPG during training?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

How I can access the action output of the actor network in DDPG during training?

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기