| Class | Description |
|---|---|
| BoltzmannQPolicy |
This class implements a Boltzmann policy where the the Q-values represent
the components of the Boltzmann distribution.
|
| CachedPolicy |
This class can be used to lazily cache the policy of a source policy.
|
| EpsilonGreedy |
This class defines a an epsilon-greedy policy over Q-values and requires a QComputable planner to be specified.
|
| GreedyDeterministicQPolicy |
A greedy policy that breaks ties by choosing the first action with the maximum value.
|
| GreedyQPolicy |
A greedy policy that breaks ties by randomly choosing an action amongst the tied actions.
|