Reinforcement learning agent stops training unexpectedly?

when I use DDPG agent to train a model, the agent stops training at 199th episode. However, the maximum episode number I set is 500. In addition, the average reward 143.9382 is far lower than the termination condition value 500. The final result shows "traning finished after all agents reached stop training criteria". I am confused what does it mean?

답변 (1개)

Emmanouil Tzorakoleftherakis
Emmanouil Tzorakoleftherakis 2023년 1월 26일

0 개 추천

The picture also mentioned why training stopped in the last two rows. In this case, it seems you have a criterion that stops training when the average steps in an episode reach 500.
The max episode number kicks in if no other criterion is satisfied by the time training reaches the specified episode number.

카테고리

도움말 센터File Exchange에서 Reinforcement Learning Toolbox에 대해 자세히 알아보기

제품

릴리스

R2022a

질문:

lab
2022년 8월 15일

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by