public static class RewardValueProjection.CustomRewardNoTermModel extends CustomRewardModel
FullModel.Helpermodel, rewardFunction| Constructor and Description |
|---|
CustomRewardNoTermModel(SampleModel model,
RewardFunction rewardFunction) |
| Modifier and Type | Method and Description |
|---|---|
protected EnvironmentOutcome |
modifyOutcome(EnvironmentOutcome eo) |
boolean |
terminal(State s)
Indicates whether a state is a terminal state (i.e., no more action occurs and zero reward received from there on out)
|
sample, transitionspublic CustomRewardNoTermModel(SampleModel model, RewardFunction rewardFunction)
public boolean terminal(State s)
SampleModelterminal in interface SampleModelterminal in class CustomRewardModels - the input state to testprotected EnvironmentOutcome modifyOutcome(EnvironmentOutcome eo)
modifyOutcome in class CustomRewardModel