Reinforcement Learning Agents generating zero episode

Question

0 개 추천

Hello Matlab community,

I faced an issue for training a multiple agent problem in Matlab/Simulink. I tried to solve a very simple problem; however, training stops at episode 1.

Suppose that we have three discrete variables A=[1 2 3], B=[1 2 3], C=[1 2 3].

Reward function = A*B*C;

Observation= A+B+C;

I tried different parameters but it didn’t work. I attached the sample file for the reference. Very appreciate it if you can suggest potential solution to solve convergence issue.

Thanks for your time 😊

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Follow Question

Answer 1

Ari Biswas 2022년 10월 4일

MATLAB Online에서 열기

0 개 추천

There is an issue with the way you specified the reset function. Your function resetRobots should return a Simulink.SimulationInput object, which is also an input argument to the function. So for e.g. the correct function signature should be:

function in = resetRobots(in, var1, var2, var3)
% write reset code
end

See this example to see how reset functions can be defined.

https://www.mathworks.com/help/reinforcement-learning/ug/water-tank-reinforcement-learning-environment-model.html#WaterTankEnvironmentModelExample-5

댓글 수: 1
이전 댓글 -1개 표시 이전 댓글 -1개 숨기기

Hamid Fazeli 2022년 10월 19일

Hi Ari,

Thank you so much for the suggestion. The problem was reset function. It was fixed :)

댓글을 달려면 로그인하십시오.

Reinforcement Learning Agents generating zero episode

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1
이전 댓글 -1개 표시 이전 댓글 -1개 숨기기

추가 답변 (0개)

카테고리

제품

릴리스

태그

Community Treasure Hunt

Reinforcement Learning Agents generating zero episode

댓글 수: 0 이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1 이전 댓글 -1개 표시 이전 댓글 -1개 숨기기

추가 답변 (0개)

카테고리

제품

릴리스

태그

참고 항목

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글 수: 1
이전 댓글 -1개 표시 이전 댓글 -1개 숨기기