why agent failed to get accelerated after training?

Question

Kun Cheng 2023년 4월 18일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1949243-why-agent-failed-to-get-accelerated-after-training

답변: Piyush Dubey 2023년 6월 2일

Hi,

I trained an pre-trained agent in the same environment. I expect that, model should converge faster but it did not happen.

first pic: first training

second pic: with trained agent

it seems agent do the same training once again. My question is why the second one not faster?

agent setting:

agentOpts=rlDQNAgentOptions(...
    'UseDoubleDQN',true,...
    'MiniBatchSize', 64, ...
    'SaveExperienceBufferWithAgent',true);
'rlDQNAgentOptions' requires Reinforcement Learning Toolbox.
agentOpts.EpsilonGreedyExploration.EpsilonDecay=1e-3;
agentOpts.EpsilonGreedyExploration.Epsilon=0.9;
agentOpts.CriticOptimizerOptions.LearnRate=0.01;
agentOpts.CriticOptimizerOptions.GradientThreshold=1;
Train_Old_Model = true; % Set to true, to use pre-trained
agentOpts.ResetExperienceBufferBeforeTraining = not(Train_Old_Model);
if Train_Old_Model
    % Load experiences from pre-trained agent    
    load("XYAgent.mat",'agent');
    
else
    %new DQN Agent
    agent = rlDQNAgent(critic,agentOpts);
end

traning setting

maxEpisodes = 1300;
maxStepsPerEpisode = 20;
trainOpts = rlTrainingOptions(...
    MaxEpisodes=maxEpisodes, ...
    MaxStepsPerEpisode=maxStepsPerEpisode, ...
    Verbose=false, ...
    ScoreAveragingWindowLength=100,...
    Plots="training-progress",...
    StopTrainingCriteria="EpisodeCount",...
    StopTrainingValue=maxEpisodes);
plot(env)
%train
doTraining = true;
if doTraining
    % Train the agent.
    trainingStats = train(agent,env,trainOpts);
    save("XYAgent.mat","agent")
else
    % Load the pretrained agent for the example.
    load("XYAgent.mat","agent")
end

Thank you!

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Piyush Dubey 2023년 6월 2일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1949243-why-agent-failed-to-get-accelerated-after-training#answer_1249159

Hi Kun,

There are various reasons because of which an agent may take longer to converge. Various ways by which a model can be saved, and the training can be resumed can be found in the documentation below:

https://www.mathworks.com/help/reinforcement-learning/ug/train-reinforcement-learning-agents.html

The reasons why a pre-trained agent can take longer in the same environment are:

Overfitting a specific set of data
Different objectives of the agent
Architectural difference of the neural networks used in the agent
Exploration vs Exploitation tradeoff
Incorrectly initialized hyperparameters

Above pointers can be used for diagnosing reasons of a slower convergence of the agent.

Hope this helps.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

why agent failed to get accelerated after training?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

Community Treasure Hunt

why agent failed to get accelerated after training?

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기