public class SubDifferentiableMaxOperator extends BellmanOperator implements DifferentiableDPOperator
BellmanOperator
max operator.Constructor and Description |
---|
SubDifferentiableMaxOperator() |
Modifier and Type | Method and Description |
---|---|
FunctionGradient |
gradient(double[] qs,
FunctionGradient[] qGradients)
Returns the gradient of this DP operator, giving the Q-values on which it operates, their gradient.
|
apply
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
apply
public FunctionGradient gradient(double[] qs, FunctionGradient[] qGradients)
DifferentiableDPOperator
gradient
in interface DifferentiableDPOperator
qs
- the q-valuesqGradients
- the gradients of the Q-value