I see a zero mean reward for the first agent in multi-agent RL Toolbox

Question

ali farid 2023년 9월 11일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2019651-i-see-a-zero-mean-reward-for-the-first-agent-in-multi-agent-rl-toolbox

답변: TARUN 2025년 4월 22일

Hello, I have extended the PPO Coverage coverage path planning example of the Matlab for 5 agents. I can see now that always, I have a reward for the first agent, and the problem is always, I see a zero mean reward in the toolbox for the first agent like the following image which is not correct. Do you have any idea what is happening there?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

TARUN 2025년 4월 22일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2019651-i-see-a-zero-mean-reward-for-the-first-agent-in-multi-agent-rl-toolbox#answer_1563953

Hi @ali faridi,

I understand that you are experiencing an issue with the reward for the first agent in your multi-agent PPO setup.

Here are a few things you can check to resolve the issue:

Reward Function: Inspect your environment's step function. Ensure that the reward vector (or structure) includes a non-zero value for the first agent (“rlPPOAgent”).
Agent Configuration: Make sure “rlPPOAgent” is correctly associated with its environment and policy.
Environment Setup: You can double-check the environment setup to make sure all agents are interacting with it as intended.
Training Parameters: Review the training parameters specific to the first agent, like the learning rate and discount factor.

These are some of the ways that might help you to fix the problem. If not, please provide the code that you are working with so that I can take a deeper look.

Feel free to refer this documentation on “Agents”:

https://www.mathworks.com/help/releases/R2022a/reinforcement-learning/agents.html?searchHighlight=agents&searchResultIndex=1

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

I see a zero mean reward for the first agent in multi-agent RL Toolbox

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

I see a zero mean reward for the first agent in multi-agent RL Toolbox

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기