Interface | Description |
---|---|
DPOperator |
Defines a function for applying a dynamic programming operator (e.g., reducing the Q-values into a state value).
|
Class | Description |
---|---|
BellmanOperator |
The standard Bellman operator: max.
|
SoftmaxOperator |
A softmax/Boltzmann operator.
|