| Interface | Description |
|---|---|
| DPOperator |
Defines a function for applying a dynamic programming operator (e.g., reducing the Q-values into a state value).
|
| Class | Description |
|---|---|
| BellmanOperator |
The standard Bellman operator: max.
|
| SoftmaxOperator |
A softmax/Boltzmann operator.
|