Questions about Reinforcement Learning

Question

Averill Law 2020년 5월 12일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/524772-questions-about-reinforcement-learning

답변: Averill Law 2020년 5월 12일

I would greatly appreciate it if someone could answer my questions on the Reinforcement Learning examples. I would particularly like to know why the examples don't converge. Thank you.

Averill M. LAW

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Emmanouil Tzorakoleftherakis 2020년 5월 12일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/524772-questions-about-reinforcement-learning#answer_432046

Hi Averill,

Can you please let me know which examples do not converge? I will share this information with the development team.

In the meantime, please have a look at this post. Every release comes with new features and optimizations under the hood, so the training plots may differ. The examples should still converge though (again please let me know if you find any examples that don't converge).

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

Answer 2

Averill Law 2020년 5월 12일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/524772-questions-about-reinforcement-learning#answer_432076

Thank you for your response. I asked several questions around May 9th. First of all, I ran the "Rocket Lander" example for the recommended 20,000 episodes and hyperparameters, and it was still having violent crashes. I have a Dell mobile work station (CPU) with a fast processor (2nd fastest available two years ago) and the example literally ran for 50 hours, rather than a few as stated in the literature.

I also could not get the "Stochastic Waterfall Grid World" example to converge, despite running it for thousands of episodes and many choices of the hyperparameters.

Averill M. LAW

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

Questions about Reinforcement Learning

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (2개)

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

Questions about Reinforcement Learning

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (2개)

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기