How Does MATLAB Internally Format Actions as dlarray in DDPG with Recurrent Networks (LSTM)?

Question

Farid 2025년 3월 10일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2174967-how-does-matlab-internally-format-actions-as-dlarray-in-ddpg-with-recurrent-networks-lstm

댓글: Farid 2025년 3월 13일

In MATLAB's RL toolbox, when using DDPG with LSTM-based actors/critics, the conversion of actions to dlarray is handled automatically. Since users cannot directly control this process:

Are actions formatted with 'T' (time) or 'C' (channel) dimensions when passed between the actor and critic networks?

How does MATLAB structure actions for compatibility with recurrent layers (e.g., aligning sequences for LSTM time steps)?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

praguna manvi 2025년 3월 13일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2174967-how-does-matlab-internally-format-actions-as-dlarray-in-ddpg-with-recurrent-networks-lstm#answer_1561724

MATLAB Online에서 열기

Hi @Farid,

In the functions "getAction" and "getValue" for the "actor" and "critic" networks, respectively, the inputs/observations are reshaped and formatted into "CBT" format in the following case of sequential layer network inputs, such as when using "lstm" layer. This ensures the data is in the format that the networks expect in general. To explore this further, you can use the example below:

openExample('rl/CreateDDPGAgentUsingRecurrentNeuralNetworksExample

This example will provide more insights into how the data is structured and processed within these networks when we look underneath these functions.

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Farid 2025년 3월 13일

Thank you for time and your help

댓글을 달려면 로그인하십시오.

How Does MATLAB Internally Format Actions as dlarray in DDPG with Recurrent Networks (LSTM)?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

Community Treasure Hunt

How Does MATLAB Internally Format Actions as dlarray in DDPG with Recurrent Networks (LSTM)?

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기