is actor-critic agent learning?
조회 수: 1 (최근 30일)
이전 댓글 표시
I built a actor critic agent for microgrid energy management. it has to decide the discharging/charging energy among a set of action
in total 9 action can be taken for 7008 time steps. I am training the agent over 2000 episodes. But I notice when the agent start learning at a cetain episodes, at the next episodes it fall completely down. I tattached the training for the first 250 episodes.
I wonder if there something wrong in my code.
댓글 수: 0
답변 (1개)
Ahmed R. Sayed
2022년 10월 4일
From your figure, the discounted reward value is very large. try to rescale it to a certain value [-10, 10] in the environment. For example, r(t) = 10 * Microgrid operational cost (t) / MaxCost , where MaxCost is the maximum possible cost per time step.
Another point is you can use another agent.
I hope these suggestions can solve your concerns.
댓글 수: 0
참고 항목
카테고리
Help Center 및 File Exchange에서 Agents에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!