dani ansari
Followers: 0 Following: 0
Feeds
질문
solve critic overestimate and how to explore specific action range
hello im using a ddpg agent to tune a robot controller.all of my rewards are negetive and my critic learning rate is 0.01 and m...
대략 1년 전 | 답변 수: 0 | 0
0
답변질문
ddpg agent does not learn
hi im using a ddpg alghorithm to learn for tuning a pd like controller (transpose jacobian) for tuning its gains.my gains need t...
1년 초과 전 | 답변 수: 2 | 0