Interface | Description |
---|---|
EnumerablePolicy |
An interface extension to
Policy for policies that can enumerate their probability distribution over all actions. |
Policy |
An interface for defining a
Policy . |
SolverDerivedPolicy |
An interface for defining policies that refer to a
MDPSolverInterface
objects to defined the policy. |
Class | Description |
---|---|
BoltzmannQPolicy |
This class implements a Boltzmann policy where the the Q-values represent
the components of the Boltzmann distribution.
|
CachedPolicy |
This class can be used to lazily cache the policy of a source policy.
|
EpsilonGreedy |
This class defines a an epsilon-greedy policy over Q-values and requires a QComputable valueFunction to be specified.
|
GreedyDeterministicQPolicy |
A greedy policy that breaks ties by choosing the first action with the maximum value.
|
GreedyQPolicy |
A greedy policy that breaks ties by randomly choosing an action amongst the tied actions.
|
PolicyUtils | |
RandomPolicy |
A uniform random policy for single agent domains.
|