Why the Reinforcement Learning seems do not learn anything?

Question

HUNG JUI CHIU 2021년 3월 31일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/788839-why-the-reinforcement-learning-seems-do-not-learn-anything

답변: Tarunbir Gambhir 2021년 5월 27일

Is reward not converge to a certain value show that the RL agent does no learn anything?

The result shows that every training the agent does the different choices, it won't learn something good from the previous one.

Although the reward is good and has the good result, next training it won't keep at that good choices, it will try the other choice then get the bad result.

How can I deal with this problem?

Thank for helping.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Tarunbir Gambhir 2021년 5월 27일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/788839-why-the-reinforcement-learning-seems-do-not-learn-anything#answer_710240

If the agent is not taking good choices at later episodes, it is likely that the exploration epsilon factor is still high. You can try increasing the "agentOptions.EpsilonGreedyExploration.EpsilonDecay" parameter to encourage the agent to exploit the previously learned Q-values at later episodes.

You can refer this documentation page for more information on the importance of parameters for the epsilon-greedy exploration concept.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

Why the Reinforcement Learning seems do not learn anything?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

Why the Reinforcement Learning seems do not learn anything?

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기