Epsilon greedy policy for DQN

Question

Akash 2023년 8월 16일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2009102-epsilon-greedy-policy-for-dqn

답변: Emmanouil Tzorakoleftherakis 2023년 9월 25일

채택된 답변: Emmanouil Tzorakoleftherakis

Hello,

I have created DQN agent with epsilon greedy exploration which has 4 discrete actions and 10 observations.

Now, my policy is:

Epsilon = 0.9;

EpsilonDecay = 1e-3

EpsilonMin = 0.01

I want to plot the Epsilon value over the episodes during the training, or need to find the variable Epsilon over the training. But, i just can see the above described values even after the training has been done.

If you have idea how to plot/know the epsilon for particular episodes then please let me know?

Thanks

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Emmanouil Tzorakoleftherakis 2023년 9월 25일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2009102-epsilon-greedy-policy-for-dqn#answer_1317952

You can use the formula here to calculate the epsilon value

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

Epsilon greedy policy for DQN

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

Epsilon greedy policy for DQN

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기