Design a reinforcement learning (RL) based controller to stabilize a quad copter
이전 댓글 표시
Hi everyone, hope all is well at your end.
I need to design a reinforcement learning (RL) based controller as an asignment to stabilize a quad copter from an initial position, orientation and angular velocity. The way I look at the problem is that due to continuous action space i intend to apply a policy gradient actor critic algorithm of RL. Request guide on some Matlab code that utilises policy gradient actor critic algorithm of RL that would help me solve this problem.
Although i am developing a simple quad copter model but if anyone can help me with an existing model that will validate my case ill be grateful for that as well.
Thanking in anticipation...
Regards
답변 (1개)
Emmanouil Tzorakoleftherakis
2019년 4월 2일
1 개 추천
Hello,
Not sure if you are still looking for a solution, but starting in R2019a, you can do deep reinforcement learning directly in MATLAB and Simulink with Reinforcement Learning Toolbox.
This video shows how to use the toolbox to teach a robot how to walk with continuous actions. You could use this as a starting point and make changes as necessary for your quadcopter application.
You can also use some existing quadcopter models if you don't want to build your own from scratch. See for example the following links:
Hope this helps.
댓글 수: 3
Ali Aboubih
2022년 2월 23일
I have been attempting to get this to work on my own for a while, and have tried various agents and network architectures, but I haven't had any luck. Would you be able to assit me? Here is my code for Actor-Critic, the simulink file, and I still get results like this:

Unmanned Aerial and Space Systems
2022년 5월 2일
Hello, did you solve your problem?
Muhammad
2022년 8월 3일
Dear Ali, i believe that reward function is also very important for proper training and convergence to reference values. The proper selection of reward function will help you to get good results.
You need to think of the reward as a useful optimization criterion for your problem.
I hope with proper reward function your problem be solved.
카테고리
도움말 센터 및 File Exchange에서 Quadcopters and Drones에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!