정책 배포

코드 생성 및 훈련된 정책 배포

강화 학습 에이전트를 훈련시킨 후, 코드를 만들어 최적 정책을 배포할 수 있습니다. 예를 들어, MATLAB^® Coder™와 GPU Coder™를 사용하여 C++ 또는 CUDA^® 코드를 생성하고 내장 플랫폼에 신경망 정책을 배포할 수 있습니다.

정책을 기반으로 코드를 생성하는 방법에 대한 개요는 Generate Code from Trained Reinforcement Learning Policies 항목을 참조하십시오. 배포된 정책의 훈련에 대한 개요는 Examine Approaches to Fine Tune a Deployed Policy 항목을 참조하십시오.

함수

`generatePolicyFunction`	Generate MATLAB function that evaluates policy of an agent or policy object
`generatePolicyBlock`	Generate Simulink block that evaluates policy of an agent or policy object (R2022b 이후)
`policyParameters`	Obtain structure of policy parameters to update policy during simulation or deployment (R2025a 이후)
`updatePolicyParameters`	Update policy according to structure of policy parameters given as input argument (R2025a 이후)

강화 학습 정책 (R2022b 이후)

강화 학습 워크플로
문제에 강화 학습을 적용할 때 사용하는 일반적인 워크플로.
Generate Code from Trained Reinforcement Learning Policies
You can generate code for reinforcement learning agents using, for example, GPU Coder or MATLAB Coder.
Examine Approaches to Fine Tune a Deployed Policy
Select the best approach to train a policy in the real world.
Generate Policy Block for Deployment
Generate a policy block to deploy a trained policy.
Train Policy Deployed on Raspberry Pi
Use trainFromData in a MATLAB learning loop to train a policy deployed on a Raspberry Pi board.