Reinforcement Learning Agents generating zero episode

์กฐํšŒ ์ˆ˜: 1 (์ตœ๊ทผ 30์ผ)
Hamid Fazeli
Hamid Fazeli 2022๋…„ 10์›” 1์ผ
๋Œ“๊ธ€: Hamid Fazeli 2022๋…„ 10์›” 19์ผ
Hello Matlab community,
I faced an issue for training a multiple agent problem in Matlab/Simulink. I tried to solve a very simple problem; however, training stops at episode 1.
Suppose that we have three discrete variables A=[1 2 3], B=[1 2 3], C=[1 2 3].
Reward function = A*B*C;
Observation= A+B+C;
I tried different parameters but it didnโ€™t work. I attached the sample file for the reference. Very appreciate it if you can suggest potential solution to solve convergence issue.
Thanks for your time ๐Ÿ˜Š

์ฑ„ํƒ๋œ ๋‹ต๋ณ€

Ari Biswas
Ari Biswas 2022๋…„ 10์›” 4์ผ
There is an issue with the way you specified the reset function. Your function resetRobots should return a Simulink.SimulationInput object, which is also an input argument to the function. So for e.g. the correct function signature should be:
function in = resetRobots(in, var1, var2, var3)
% write reset code
end
See this example to see how reset functions can be defined.
  ๋Œ“๊ธ€ ์ˆ˜: 1
Hamid Fazeli
Hamid Fazeli 2022๋…„ 10์›” 19์ผ
Hi Ari,
Thank you so much for the suggestion. The problem was reset function. It was fixed :)

๋Œ“๊ธ€์„ ๋‹ฌ๋ ค๋ฉด ๋กœ๊ทธ์ธํ•˜์‹ญ์‹œ์˜ค.

์ถ”๊ฐ€ ๋‹ต๋ณ€ (0๊ฐœ)

์นดํ…Œ๊ณ ๋ฆฌ

Help Center ๋ฐ File Exchange์—์„œ Reinforcement Learning์— ๋Œ€ํ•ด ์ž์„ธํžˆ ์•Œ์•„๋ณด๊ธฐ

์ œํ’ˆ


๋ฆด๋ฆฌ์Šค

R2021b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by