벤치마크 예제

강화 학습 에이전트를 비교하기 위한 벤치마크 문제

강화 학습 에이전트는 그 유형이 다양할 뿐만 아니라 각각의 장단점이 각양각색이면서도 상호 보완적인 경우가 많습니다. 다양한 벤치마크 문제를 대상으로 여러 에이전트의 성능을 비교합니다.

튜토리얼

Compare Agents on the Discrete Double-Integrator Environment
Compare default agents on the MATLAB^® discrete action space double-integrator environment.
Compare Agents on the Discrete Cart-Pole Environment
Compare default agents on the MATLAB discrete action space cart-pole environment.
Compare Agents on the Discrete Pendulum Swing-Up Environment
Compare default agents on the Simulink^® discrete action space simple pendulum swing-up environment.
Compare Agents on the Discrete Simscape Cart-Pole Swing-Up Environment
Compare default agents on the Simscape™ Multibody™ discrete action space cart-pole swing-up environment.
Compare Agents on the Discrete Pendulum Swing-Up with Image Environment
Compare default agents on the MATLAB discrete action space pendulum swing-up with image environment.

Compare Agents on the Continuous Double Integrator Environment
Compare default agents on the MATLAB continuous action space double-integrator environment.
Compare Agents on the Continuous Cart Pole
Compare default agents on the MATLAB continuous action space cart-pole environment.
Comparison of Agents on the Continuous Pendulum Swing-Up Environment
Compare default agents on the Simulink continuous action space simple pendulum swing-up environment.
Compare Agents on the Continuous Cart Pole Swing-Up Environment
Compare default agents on the Simscape Multibody continuous action space cart-pole swing-up environment.
Compare Agents on Continuous Pendulum Swing-Up with Image Environment
Compare default agents on the MATLAB continuous action space simple pendulum swing-up with image environment.

Train PG Agent with Custom Networks to Control Discrete Double Integrator
Train a PG agent with a baseline to control a discrete action space double integrator system modeled in MATLAB.

Train LSPI Agent to Balance Discrete Cart-Pole System
Train an LSPI agent to balance discrete action space cart-pole system modeled in MATLAB.
이산 카트-폴 시스템의 균형을 유지하도록 DQN 에이전트 훈련시키기
MATLAB에서 모델링된 이산 행동 공간 카트-폴 시스템의 균형을 유지하도록 DQN 에이전트를 훈련시킵니다.
이산 카트-폴 시스템의 균형을 유지하도록 PG 에이전트 훈련시키기
MATLAB에서 모델링된 이산 행동 공간 카트-폴 시스템의 균형을 유지하도록 PG 에이전트를 훈련시킵니다.
이산 카트-폴 시스템의 균형을 유지하도록 AC 에이전트 훈련시키기
MATLAB에서 모델링된 이산 행동 공간 카트-폴 시스템의 균형을 유지하도록 AC 에이전트를 훈련시킵니다.
Train AC Agent to Balance Discrete Cart-Pole System Using Parallel Computing
Train an AC agent to control a discrete action space cart-pole system using asynchronous parallel computing.
Train MBPO Agent to Balance Continuous Cart-Pole System
A model-based reinforcement learning agent learns a model of its environment that it can use to generate additional experiences for training.

진자가 위쪽으로 똑바로 서서 균형을 유지하도록 DQN 에이전트 훈련시키기
Simulink에서 모델링된 이산 행동 공간 진자가 위쪽으로 똑바로 서서 균형을 유지하도록 DQN 에이전트를 훈련시킵니다.
진자가 위쪽으로 똑바로 서서 균형을 유지하도록 DDPG 에이전트 훈련시키기
Simulink에서 모델링된 연속 행동 공간 진자의 균형을 유지하도록 DDPG 에이전트를 훈련시킵니다.
Train DDPG Agent to Swing Up and Balance Pendulum with Bus Signal
Train a DDPG agent to balance a continuous action space pendulum Simulink model that contains observations in a bus signal.
Train DDPG Agent to Swing Up and Balance Pendulum with Image Observation
Train a DDPG agent using an image-based observation signal.
Create DQN Agent Using Deep Network Designer and Train Using Image Observations
Create a reinforcement learning agent using the Deep Network Designer app from the Deep Learning Toolbox™.

카트-폴 시스템이 위쪽으로 똑바로 서서 균형을 유지하도록 DDPG 에이전트 훈련시키기
Simscape Multibody에서 모델링된 연속 행동 공간 카트-폴 시스템이 위쪽으로 똑바로 서서 균형을 유지하도록 DDPG 에이전트를 훈련시킵니다.

Train PPO Agent for a Lander Vehicle
Train a discrete PPO agent to land a flying vehicle.
Train Discrete Soft Actor Critic Agent for Lander Vehicle
Train a discrete SAC agent to land a flying vehicle.