- All Implemented Interfaces:
- Enclosing class:
public static class InvertedPendulum.InvertedPendulumRewardFunction
A default reward function for this domain. Returns 0 everywhere except at fail conditions, which return -1 and
are defined by the pole being grater than some threshold (default PI/2 radians.
- James MacGlashan
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
public double reward(State s,
Returns the reward received when action a is executed in state s and the agent transitions to state sprime.
- Specified by:
reward in interface
s - the state in which the action was executed
a - the action executed
sprime - the state to which the agent transitioned
- the reward received when action a is executed in state s and the agent transitions to state sprime.