burlap.behavior.singleagent.planning.commonpolicies

Class Summary
Class	Description
BoltzmannQPolicy	This class implements a Boltzmann policy where the the Q-values represent the components of the Boltzmann distribution.
CachedPolicy	This class can be used to lazily cache the policy of a source policy.
EpsilonGreedy	This class defines a an epsilon-greedy policy over Q-values and requires a QComputable planner to be specified.
GreedyDeterministicQPolicy	A greedy policy that breaks ties by choosing the first action with the maximum value.
GreedyQPolicy	A greedy policy that breaks ties by randomly choosing an action amongst the tied actions.

Package burlap.behavior.singleagent.planning.commonpolicies