create markov decision process environment for reinforcement learning -凯发k8网页登录
create markov decision process environment for reinforcement learning
since r2019a
description
a markov decision process (mdp) is a discrete time stochastic control process. it
provides a mathematical framework for modeling decision making in situations where outcomes
are partly random and partly under the control of the decision maker. mdps are useful for
studying optimization problems solved using reinforcement learning. use
rlmdpenv
to create a markov decision process environment for
reinforcement learning in matlab®.
creation
syntax
description
input arguments
properties
object functions
getactioninfo | obtain action data specifications from reinforcement learning environment, agent, or experience buffer |
getobservationinfo | obtain observation data specifications from reinforcement learning environment, agent, or experience buffer |
sim | simulate trained reinforcement learning agents within specified environment |
train | train reinforcement learning agents within a specified environment |
validateenvironment | validate custom reinforcement learning environment |
examples
version history
introduced in r2019a