public static class InvertedPendulum.InvertedPendulumRewardFunction extends java.lang.Object implements RewardFunction
Constructor and Description |
---|
InvertedPendulumRewardFunction() |
InvertedPendulumRewardFunction(double maxAbsoluteAngle)
Initializes with a max pole angle as specified in radians
|
Modifier and Type | Method and Description |
---|---|
double |
reward(State s,
Action a,
State sprime)
Returns the reward received when action a is executed in state s and the agent transitions to state sprime.
|
public InvertedPendulumRewardFunction()
public InvertedPendulumRewardFunction(double maxAbsoluteAngle)
maxAbsoluteAngle
- the maximum pole angle in radians that causes task termination/failure.public double reward(State s, Action a, State sprime)
RewardFunction
reward
in interface RewardFunction
s
- the state in which the action was executeda
- the action executedsprime
- the state to which the agent transitioned