public interface QGradientPlanner extends QComputablePlanner
QComputablePlanner.QComputablePlannerHelper
Modifier and Type | Method and Description |
---|---|
java.util.List<QGradientTuple> |
getAllQGradients(State s)
Returns the list of Q-value gradients (returned as
objects ) for each action permissible in the given state. |
QGradientTuple |
getQGradient(State s,
GroundedAction a)
Returns the Q-value gradient (
QGradientTuple ) for the given state and action. |
void |
setBoltzmannBetaParameter(double beta)
Sets this planner's Boltzmann beta parameter used to compute gradients.
|
getQ, getQs
java.util.List<QGradientTuple> getAllQGradients(State s)
objects
) for each action permissible in the given state.s
- the state for which Q-value gradients are to be returned.QGradientTuple getQGradient(State s, GroundedAction a)
QGradientTuple
) for the given state and action.s
- the state for which the Q-value gradient is to be returneda
- the action for which the Q-value gradient is to be returned.void setBoltzmannBetaParameter(double beta)
beta
- the value to which this planner's Boltzmann beta parameter will be set