Interface | Description |
---|---|
DifferentiableQFunction |
An interface for a valueFunction that can produce Q-value gradients.
|
DifferentiableRF |
An interface for defining differentiable reward functions.
|
DifferentiableValueFunction | |
QGradientPlannerFactory |
A factory for generating
DifferentiableQFunction objects. |
Class | Description |
---|---|
BoltzmannPolicyGradient |
This class provides methods to compute the gradient of a Boltzmann policy.
|
QGradientPlannerFactory.DifferentiableVIFactory |
A
DifferentiableVI factory. |
QGradientTuple |
A tuple (triple) for storing the Q-gradient associated with a state and action.
|