burlap.behavior.singleagent.learnfromdemo.mlirl.support

Interface Summary
Interface	Description
DifferentiableRF	An interface for defining differentiable reward functions.
QGradientPlanner	An interface for a valueFunction that can produce Q-value gradients.
QGradientPlannerFactory	A factory for generating `QGradientPlanner` objects.

Class Summary
Class	Description
BoltzmannPolicyGradient	This class provides methods to compute the gradient of a Boltzmann policy.
QGradientPlannerFactory.DifferentiableVIFactory	A `DifferentiableVI` factory.
QGradientTuple	A tuple (triple) for storing the Q-gradient associated with a state and action.

Package burlap.behavior.singleagent.learnfromdemo.mlirl.support