Problems in reinforcement learning training

조회 수: 12 (최근 30일)

ye 2024년 9월 2일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2149489-problems-in-reinforcement-learning-training

댓글: Shantanu Dixit 2024년 9월 2일

The effect of matlab reinforcement learning in the training process is better, but the reason for the poor effect after saving the agent is, or how to save the good effect in the training process

댓글 수: 3
이전 댓글 1개 표시이전 댓글 1개 숨기기

ye 2024년 9월 2일

That is, in the training process will encounter a good effect of the agent, this time to stop training and save the agent, but with the saved agent to run, the effect and training process is very different

Shantanu Dixit 2024년 9월 2일

Assuming you're experiencing different training process before and after loading the saved agent, this could be due to following factors:

Experience Buffer: By default, the experience buffer isn't saved with some agents like DDPG and DQN. If you plan to continue training the saved agent, consider setting 'SaveExperienceBufferWithAgent' to true to preserve the experience buffer.
Non-Determinism and Exploration Strategy: Training may involve stochastic elements, causing the agent to explore different trajectories after being reloaded, which could result in a different training process.

Additionaly you can refer to 'SaveAgentCriteria' and 'SaveAgentValue' to save agents that meet specific performance criteria.

Refer to the below MathWorks documentation for different saving strategies:

Saving Agents: www.mathworks.com/help/reinforcement-learning/ug/train-reinforcement-learning-agents.html

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.