Reinforcement Learning Agents generating zero episode
์ด์ ๋๊ธ ํ์
Hello Matlab community,
I faced an issue for training a multiple agent problem in Matlab/Simulink. I tried to solve a very simple problem; however, training stops at episode 1.
Suppose that we have three discrete variables A=[1 2 3], B=[1 2 3], C=[1 2 3].
Reward function = A*B*C;
Observation= A+B+C;
I tried different parameters but it didnโt work. I attached the sample file for the reference. Very appreciate it if you can suggest potential solution to solve convergence issue.
Thanks for your time ๐

์ฑํ๋ ๋ต๋ณ
์ถ๊ฐ ๋ต๋ณ (0๊ฐ)
์นดํ ๊ณ ๋ฆฌ
๋์๋ง ์ผํฐ ๋ฐ File Exchange์์ Reinforcement Learning์ ๋ํด ์์ธํ ์์๋ณด๊ธฐ
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!