how to write DDPG Algorithm at case of L2 Regularization, ADAM Optimization and Number-of-steps-to-look-ahead ?
조회 수: 2 (최근 30일)
이전 댓글 표시
I would like to inquiry how to write the DDPG algorithm if L2 Regularization, ADAM optimization and certain number of steps to look ahead exist in the code ?
i want to know the whole sequence in details like :
1- how the transitions are stored in replay buffer based on certain number of steps to look ahead
2- how could be the shape of cost function if L2 Regularization exist
3- how does adam optimizer works at this case
댓글 수: 0
답변 (0개)
참고 항목
카테고리
Help Center 및 File Exchange에서 Statistics and Machine Learning Toolbox에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!