Francisco Serra

Last seen: 대략 2년 전 | 2024년부터 활동

Followers: 0 Following: 0

통계

Feeds

질문

Why is my DDPG agent converging to a state where it gets continuous penalization, while having a state it can go with 0 penalization?
I am training a Reinforcement Learning DDPG agent to drive a vehicle to a reference. The vehicle dynamics are: x_dot = v*cos(...

2년 초과 전 | 답변 수: 1 | 0

1

답변