Reinforcement learning and Paralle computation

Question

ryunosuke tazawa 2022년 4월 13일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1695425-reinforcement-learning-and-paralle-computation

댓글: Takeshi Takahashi 2022년 4월 18일

MATLAB Online에서 열기

I am condcting reinforcement learning with Sac agent.

I tried to use GPU and Parallel computation, but in case of using paralle computaion, the training result changed.

At all, learning is much worse than without parallel processing.　

Do you know what caused it?

%% AGENT　setting
agentOptions = rlSACAgentOptions;
agentOptions.SampleTime = Ts;
agentOptions.DiscountFactor = 0.90;                       
agentOptions.TargetSmoothFactor = 1e-3;                   
agentOptions.ExperienceBufferLength = 500;                
agentOptions.MiniBatchSize = 256;                          
agentOptions.EntropyWeightOptions.TargetEntropy = -2;     
agentOptions.NumStepsToLookAhead  = 1;
agentOptions.ResetExperienceBufferBeforeTraining = false;
agent = rlSACAgent(actor,[critic1 critic2],agentOptions);
 
 
%% Learning setting
maxepisodes = 10000;                             
maxsteps = 1e6;                                  
trainingOptions = rlTrainingOptions(...
    'MaxEpisodes',maxepisodes,...
    'MaxStepsPerEpisode',maxsteps,...
    'StopOnError','on',...
    'Verbose',true,...
    'Plots','training-progress',...
    'StopTrainingCriteria','AverageReward',...
    'StopTrainingValue',Inf,...
    'ScoreAveragingWindowLength',10); 
trainingOptions.UseParallel = true;
trainingOptions.ParallelizationOptions.Mode = 'async';
trainingOptions.ParallelizationOptions.StepsUntilDataIsSent  = 32;
trainingOptions.ParallelizationOptions.DataToSendFromWorkers =  'Experiences';

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Takeshi Takahashi 2022년 4월 18일

agentOptions.ExperienceBufferLength seems too short, which may indirectly affect the parallel training. Can you increase ExperienceBufferLength to 1e6 or more?

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Reinforcement learning and Paralle computation

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

답변 (0개)

참고 항목

카테고리

태그

Community Treasure Hunt

Reinforcement learning and Paralle computation

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

답변 (0개)

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기