My agent finds a good solution but it does not repeat it and turns back to bad behavior again (Reinforcement Learning )

Question

Soheil Khoshboo 2022년 9월 6일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1796300-my-agent-finds-a-good-solution-but-it-does-not-repeat-it-and-turns-back-to-bad-behavior-again-reinf

답변: Emmanouil Tzorakoleftherakis 2023년 1월 25일

2022-09-06 15_31_29-Reinforcement Learning Episode Manager.png

My agent finds a good solution but it does not repeat it and turns back to bad behavior again. I dont know why is that the case! it happens a lot and everytime even after finishig the training it does not learn. Can anyone tell me why is that?

Attached you can see an example. it happend very early and it was almost why expected for but it didnt follow it anymore.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Emmanouil Tzorakoleftherakis 2023년 1월 25일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1796300-my-agent-finds-a-good-solution-but-it-does-not-repeat-it-and-turns-back-to-bad-behavior-again-reinf#answer_1156410

This picture shows progress for a very small number of episodes. You should wait for a few hundred episodes before you can evaluate the agent. Even then, there is no guarantee that episode reward will increase monotinically. Please see this answer. If after training for some time, the agent is still not learning anything, you need to go back and evaluate the problem formulation. You may need to increase exploration, adjust neural net architectures, pick different sample time, tune agent hyperparameters, pick a different reward and so on. These are a few suggestions, but as you can see this is unfortunately a trial and error process

Hope this helps

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

My agent finds a good solution but it does not repeat it and turns back to bad behavior again (Reinforcement Learning )

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

My agent finds a good solution but it does not repeat it and turns back to bad behavior again (Reinforcement Learning )

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기