DRL, handing two or more observations in Step function

Question

0 개 추천

Hi, I am having a problem in RL environment. I have two types of input (observations). I efined it as:

obsInfo = [rlNumericSpec([1,total_obs]) rlNumericSpec([1,total_obs])] ;

Now, I want to set them to some values in the step function. I do not know how to do that. I have one variable State as env. property. Does anyone knows how to manage it? Thanks

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Follow Question

Answer 1

Jack 2025년 3월 10일

MATLAB Online에서 열기

0 개 추천

When you define multiple observations using an array of rlNumericSpec, your environment’s step function should return a cell array containing each observation. For example, if you have defined:

obsInfo = [rlNumericSpec([1, total_obs]) rlNumericSpec([1, total_obs])];

Then in your step function you need to split your state (or otherwise create two observation vectors) and return them as a cell array. For instance, if your environment property State is a vector that combines both observation parts, you could do something like:

function [Observation, Reward, IsDone, LoggedSignals] = stepFunction(this, Action)
    % Update the state based on Action, etc.
    % Assume this.State is a 1-by-(2*total_obs) vector:
    obs1 = this.State(1:total_obs);
    obs2 = this.State(total_obs+1:end);
    % Return observations as a cell array matching obsInfo:
    Observation = {obs1, obs2};
    % Compute reward, check if done, and assign logged signals
    Reward = ...;
    IsDone = ...;
    LoggedSignals = ...;
end

This way, MATLAB’s RL framework will correctly map each cell to its corresponding rlNumericSpec from obsInfo.

Follow me so you can message me anytime with future questions. If this helps, please accept the answer and upvote it as well.

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

DRL, handing two or more observations in Step function

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

카테고리

제품

태그

Community Treasure Hunt

DRL, handing two or more observations in Step function

댓글 수: 0 이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 0 이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

카테고리

제품

태그

참고 항목

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기