Mixed DataType in action space

Question

Sania Gul 2023년 5월 13일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1962479-mixed-datatype-in-action-space

댓글: Shaik 2023년 5월 15일

For mixed type of Data in action space, which of them will be used rlFiniteSetSpec or rlNumericSpec. I have seen a query regarding the observation space, but does it holds for action space too?

https://www.mathworks.com/matlabcentral/answers/706758-a-mix-of-rlnumericspec-and-rlfinitesetspec-objects-observation-for-a-rl-environment

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Shaik 2023년 5월 13일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/1962479-mixed-datatype-in-action-space#answer_1234234

Hi,

The choice between rlFiniteSetSpec and rlNumericSpec for the action space depends on the nature of the action you want to define in your reinforcement learning (RL) environment.

rlFiniteSetSpec: Use this specification when the action space consists of a discrete set of possible actions. Each action in the set can be represented by a unique value or identifier. For example, if your action space consists of a finite set of discrete actions, such as "move forward," "turn left," or "turn right," you can use rlFiniteSetSpec to define the action space.
rlNumericSpec: Use this specification when the action space is continuous or can be represented by a range of numerical values. If your action space requires specifying a continuous value, such as controlling the speed or position of a robotic arm, you can use rlNumericSpec to define the action space.

When defining the action space in an RL environment, you need to choose the appropriate specification based on the characteristics of the actions you want to represent. You can have a mix of rlFiniteSetSpec and rlNumericSpec objects within the action space if your environment requires both discrete and continuous actions.

Regarding the link you provided, it refers to a query about the observation space, but the same concept applies to the action space as well. The rlFiniteSetSpec and rlNumericSpec objects can be used to define both observation and action spaces in an RL environment, depending on the type of data they represent.

In summary, consider the nature of your action space (discrete or continuous) and use either rlFiniteSetSpec or rlNumericSpec accordingly to define the appropriate specification for your action space in the RL environment.

댓글 수: 6
이전 댓글 4개 표시이전 댓글 4개 숨기기

Shaik 2023년 5월 14일

If you want to keep the rotational forces discrete while keeping the states 'x' and 'y' continuous, one possibility is to modify the state representation. One approach could be to represent the angle θ as an integer value (e.g., in degrees or radians) rather than using sin(θ) and cos(θ).

Here's an example of how you could modify the state representation:

Define a maximum and minimum angle, θ_max and θ_min, respectively, in degrees or radians, depending on your preference. For this example, let's assume that θ can vary between -180 and 180 degrees.
Define the rotation angle, θ, as an integer value between θ_min and θ_max. You can use the 'discretize' function in MATLAB to obtain the corresponding integer value. For example:

theta_bins = linspace(-180, 180, num_bins); % divide the range [-180, 180] into num_bins equally spaced intervals

theta = discretize(atan2(sin_theta, cos_theta)*180/pi, theta_bins);

Here, num_bins is the number of intervals you want to divide the range into. The atan2 function is used to obtain the angle from sin_theta and cos_theta, and then this angle is converted to degrees using the *180/pi scaling factor.

Update the state representation to use the new rotation angle θ instead of sin(θ) and cos(θ). You would need to modify the observation space to include the new state variables, and also the action space to properly map the discrete action space to the continuous control inputs.

You can experiment with different values of num_bins to see how it affects the discretization of the rotation angle. A larger value of num_bins will result in a finer discretization of the angle, at the cost of increased computational complexity and training time.

Sania Gul 2023년 5월 15일

I have already done it :-)

Shaik 2023년 5월 15일

Thanks Sania! :)

댓글을 달려면 로그인하십시오.

Mixed DataType in action space

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 6
이전 댓글 4개 표시이전 댓글 4개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

제품

Community Treasure Hunt

Mixed DataType in action space

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 6 이전 댓글 4개 표시이전 댓글 4개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

제품

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 6
이전 댓글 4개 표시이전 댓글 4개 숨기기