![photo](/responsive_image/150/150/0/0/0/cache/matlabcentral/profiles/15495701_1557886988205_DEF.jpg)
Kundan Panta
Followers: 0 Following: 0
Feeds
질문
Confusion in agent and trainFromData options when using RNN/LSTM
My dataset contains numTraj trajectories, each containing numSteps time-steps. I filled the experience buffer with my data in a ...
8개월 전 | 답변 수: 1 | 0
1
답변질문
Do MBPO agents not support recurrent neural networks for the environment model, the base off-policy agent, or both?
Since TD3, SAC, etc. agents support using recurrent layers by themselves, would using these recurrent base agents still not work...
10개월 전 | 답변 수: 0 | 0