public static class QProvider.Helper
extends java.lang.Object
Modifier and Type | Method and Description |
---|---|
static double |
maxQ(QProvider qSource,
State s)
Returns the optimal state value function for a state given a
QProvider . |
static double |
policyValue(QProvider qSource,
State s,
EnumerablePolicy p)
Returns the state value under a given policy for a state and
QProvider . |
public static double maxQ(QProvider qSource, State s)
QProvider
.
The optimal value is the max Q-value. If no actions are permissible in the input state, then zero is returned.public static double policyValue(QProvider qSource, State s, EnumerablePolicy p)
QProvider
.
The value is the expected Q-value under the input policy action distribution. If no actions are permissible in the input state, then zero is returned.