public interface Heuristic
Modifier and Type | Method and Description |
---|---|
double |
h(State s)
Returns the estimated amount of reward that will be received when following the optimal policy from the given state.
|
double h(State s)
s
- the state from which to estimate the future reward.