DQN training getting worse
조회 수: 5 (최근 30일)
이전 댓글 표시
Hello all,
I am trying to train a very simple model with DQN. When I increase the trainning steps, the average/episode reward gest worse and worse. The agent gives deseriable result at a much earlier episode than a later ones.
This is what I had for average reward and epsidoe reward. I know for sure this amount of trainning is way more than sufficient.
This is the parameters I had for this training:
Can anyone help to explain why that happens and what can to do to avoid that from happening?
Thanks
댓글 수: 0
답변 (0개)
참고 항목
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!