Design a reinforcement learning (RL) based controller to stabilize a quad copter

Question

0 개 추천

Hi everyone, hope all is well at your end.

I need to design a reinforcement learning (RL) based controller as an asignment to stabilize a quad copter from an initial position, orientation and angular velocity. The way I look at the problem is that due to continuous action space i intend to apply a policy gradient actor critic algorithm of RL. Request guide on some Matlab code that utilises policy gradient actor critic algorithm of RL that would help me solve this problem.

Although i am developing a simple quad copter model but if anyone can help me with an existing model that will validate my case ill be grateful for that as well.

Thanking in anticipation...

Regards

댓글 수: 1
이전 댓글 -1개 표시 이전 댓글 -1개 숨기기

Naveed Nekoo 2022년 3월 23일

Any update on this?

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Follow Question

Answer 1

Emmanouil Tzorakoleftherakis 2019년 4월 2일

1 개 추천

Hello,

Not sure if you are still looking for a solution, but starting in R2019a, you can do deep reinforcement learning directly in MATLAB and Simulink with Reinforcement Learning Toolbox.

This video shows how to use the toolbox to teach a robot how to walk with continuous actions. You could use this as a starting point and make changes as necessary for your quadcopter application.

You can also use some existing quadcopter models if you don't want to build your own from scratch. See for example the following links:

https://www.mathworks.com/matlabcentral/fileexchange/63580-quadcopter-simulation-and-control-made-easy

https://www.mathworks.com/examples/aeroblks/mw/aeroblks_product-asbQuadcopterExample-quadcopter-project

https://www.mathworks.com/videos/drone-simulation-and-control-part-4-how-to-build-a-model-for-simulation-1539585112546.html

Hope this helps.

댓글 수: 3
이전 댓글 1개 표시 이전 댓글 1개 숨기기

Unmanned Aerial and Space Systems 2022년 5월 2일

Hello, did you solve your problem?

Muhammad 2022년 8월 3일

Dear Ali, i believe that reward function is also very important for proper training and convergence to reference values. The proper selection of reward function will help you to get good results.

You need to think of the reward as a useful optimization criterion for your problem.

I hope with proper reward function your problem be solved.

댓글을 달려면 로그인하십시오.

Design a reinforcement learning (RL) based controller to stabilize a quad copter

댓글 수: 1
이전 댓글 -1개 표시 이전 댓글 -1개 숨기기

답변 (1개)

댓글 수: 3
이전 댓글 1개 표시 이전 댓글 1개 숨기기

카테고리

태그

Community Treasure Hunt

Design a reinforcement learning (RL) based controller to stabilize a quad copter

댓글 수: 1 이전 댓글 -1개 표시 이전 댓글 -1개 숨기기

답변 (1개)

댓글 수: 3 이전 댓글 1개 표시 이전 댓글 1개 숨기기

카테고리

태그

참고 항목

Community Treasure Hunt

댓글 수: 1
이전 댓글 -1개 표시 이전 댓글 -1개 숨기기

댓글 수: 3
이전 댓글 1개 표시 이전 댓글 1개 숨기기