| Interface | Description | 
|---|---|
| EnumerablePolicy | 
 An interface extension to  
Policy for policies that can enumerate their probability distribution over all actions. | 
| Policy | 
 An interface for defining a  
Policy. | 
| SolverDerivedPolicy | 
 An interface for defining policies that refer to a  
MDPSolverInterface
 objects to defined the policy. | 
| Class | Description | 
|---|---|
| BoltzmannQPolicy | 
 This class implements a Boltzmann policy where the the Q-values represent
 the components of the Boltzmann distribution. 
 | 
| CachedPolicy | 
 This class can be used to lazily cache the policy of a source policy. 
 | 
| EpsilonGreedy | 
 This class defines a an epsilon-greedy policy over Q-values and requires a QComputable valueFunction to be specified. 
 | 
| GreedyDeterministicQPolicy | 
 A greedy policy that breaks ties by choosing the first action with the maximum value. 
 | 
| GreedyQPolicy | 
 A greedy policy that breaks ties by randomly choosing an action amongst the tied actions. 
 | 
| PolicyUtils | |
| RandomPolicy | 
 A uniform random policy for single agent domains. 
 |