Do MBPO agents not support recurrent neural networks for the environment model, the base off-policy agent, or both?

조회 수: 3 (최근 30일)

Kundan Panta 2024년 5월 5일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2115186-do-mbpo-agents-not-support-recurrent-neural-networks-for-the-environment-model-the-base-off-policy

댓글: Kundan Panta 2024년 5월 7일

Since TD3, SAC, etc. agents support using recurrent layers by themselves, would using these recurrent base agents still not work with MBPO?

Could this limit be circumvented by using a custom training loop for the environment model and for the base agents?

댓글 수: 2
없음 표시없음 숨기기

Naren Raman 2024년 5월 6일

Thank you for your question. No, MBPO agents do not support recurrent networks for now as mentioned in the documentation. The custom training loop provides more flexibility. Yes, you should be able to use the custom training loop to create a custom MBPO agent with recurrent neural networks.

Kundan Panta 2024년 5월 7일

Thank you for your timely response. To confirm that recurrent networks are not supported for the base agents, in addition to the environment model, I tried combining the "Create MBPO Agent" and "Create TD3 Agent with Recurrent Neural Networks" examples and it indeed threw an error.

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.