How to train DDPG episode reward more better?

조회 수: 6 (최근 30일)
hunson yang
hunson yang 2020년 2월 26일
댓글: Guoge Tan 2020년 5월 25일
I'm training a DDPG agent from the Reinforcement Learning toolbox. But as you can see, my episode reward never change. I try so many way to fix this problem. Like change the netwoek, Gradient Threshold, Learning Rate. But the result will be the same. I check my reward funtion, if the situation is eligible I will give it some reward or penalty. But its reward is always be same.
Is my condtion have some problem? Or my results are not input into the model? I dont have anyway to do.
  댓글 수: 2
Emmanouil Tzorakoleftherakis
Emmanouil Tzorakoleftherakis 2020년 2월 28일
How did you set the IsDone flag? This may lead to premature episode termination
Guoge Tan
Guoge Tan 2020년 5월 25일
Hi, sorry to bother you, but I'd like to ask if your problem is solved or not? I‘m working on a path planning problem using the Reinforcement Learning toolbox on MATLAB R2020a and I also encountered a problem similar to yours.

댓글을 달려면 로그인하십시오.

답변 (0개)

카테고리

Help CenterFile Exchange에서 Training and Simulation에 대해 자세히 알아보기

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by