public static class RewardValueProjection.CustomRewardNoTermModel extends CustomRewardModel
FullModel.Helper
model, rewardFunction
Constructor and Description |
---|
CustomRewardNoTermModel(SampleModel model,
RewardFunction rewardFunction) |
Modifier and Type | Method and Description |
---|---|
protected EnvironmentOutcome |
modifyOutcome(EnvironmentOutcome eo) |
boolean |
terminal(State s)
Indicates whether a state is a terminal state (i.e., no more action occurs and zero reward received from there on out)
|
sample, transitions
public CustomRewardNoTermModel(SampleModel model, RewardFunction rewardFunction)
public boolean terminal(State s)
SampleModel
terminal
in interface SampleModel
terminal
in class CustomRewardModel
s
- the input state to testprotected EnvironmentOutcome modifyOutcome(EnvironmentOutcome eo)
modifyOutcome
in class CustomRewardModel