public class DifferentiableSoftmaxOperator extends SoftmaxOperator implements DifferentiableDPOperator
SoftmaxOperator
beta, temp
Constructor and Description |
---|
DifferentiableSoftmaxOperator() |
DifferentiableSoftmaxOperator(double beta) |
Modifier and Type | Method and Description |
---|---|
FunctionGradient |
gradient(double[] qs,
FunctionGradient[] qGradients)
Returns the gradient of this DP operator, giving the Q-values on which it operates, their gradient.
|
apply, getBeta, setBeta
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
apply
public DifferentiableSoftmaxOperator()
public DifferentiableSoftmaxOperator(double beta)
public FunctionGradient gradient(double[] qs, FunctionGradient[] qGradients)
DifferentiableDPOperator
gradient
in interface DifferentiableDPOperator
qs
- the q-valuesqGradients
- the gradients of the Q-value