Pausing reinforcement learning by forcing

Question

SHromaneko 2024년 1월 2일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2065726-pausing-reinforcement-learning-by-forcing

편집: SHromaneko 2024년 1월 10일

I'm running reinforcement learning, but I think there are times when I think it's not going well and want to stop it.

If you stop today's learning agent executed with the code below with Training stopped, simulink will freeze and stop working.

Repeatedly hitting the escape key doesn't work either.

so that you can stop it properly

Is there something wrong with the code?

numObs = 9;

obsInfo = rlNumericSpec([numObs 1]);

obsInfo.Name = "observations";

mdl = "kineticmodel_wIAVFBC_IncS2Ea_NH3FBC";

open_system(mdl)

numAct = 1;

actInfo = rlNumericSpec([numAct 1],LowerLimit=0,UpperLimit=1);

actInfo.Name = "NH3";

blk = mdl + "/RL agent/RL Agent";

env = rlSimulinkEnv(mdl,blk,obsInfo,actInfo);

Ts = 1

agent = createDDPGAgent(numObs,obsInfo,numAct,actInfo,Ts);

maxEpisodes = 2000;

Tf = 1240*3

maxSteps = floor(Tf/Ts);

trainOpts = rlTrainingOptions(...

MaxEpisodes=maxEpisodes,...

MaxStepsPerEpisode=maxSteps,...

ScoreAveragingWindowLength=250,...

Verbose=false,...

Plots="training-progress",...

StopTrainingCriteria="EpisodeCount",...

StopTrainingValue=maxEpisodes,...

SaveAgentCriteria="EpisodeCount",...

SaveAgentValue=maxEpisodes);

doTraining = true;

if doTraining

% Train the agent.

trainingStats = train(agent,env,trainOpts);

else

% Load a pretrained agent for the selected agent type.

if strcmp(AgentSelection,"DDPG")

load("rlWalkingBipedRobotDDPG.mat","agent")

else

load("rlWalkingBipedRobotTD3.mat","agent")

end

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Emmanouil Tzorakoleftherakis 2024년 1월 9일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2065726-pausing-reinforcement-learning-by-forcing#answer_1386366

The proper way to stop it would be through the Episode Manager (top right of the window). Does this not work for you?

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

SHromaneko 2024년 1월 10일

편집: SHromaneko 2024년 1월 10일

It's just a bug, I re-installed and fixed it.

Thanks a lot.

댓글을 달려면 로그인하십시오.

Pausing reinforcement learning by forcing

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

Pausing reinforcement learning by forcing

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기