| Interface | Description |
|---|---|
| DifferentiableRF |
An interface for defining differentiable reward functions.
|
| QGradientPlanner |
An interface for a valueFunction that can produce Q-value gradients.
|
| QGradientPlannerFactory |
A factory for generating
QGradientPlanner objects. |
| Class | Description |
|---|---|
| BoltzmannPolicyGradient |
This class provides methods to compute the gradient of a Boltzmann policy.
|
| QGradientPlannerFactory.DifferentiableVIFactory |
A
DifferentiableVI factory. |
| QGradientTuple |
A tuple (triple) for storing the Q-gradient associated with a state and action.
|