Interface | Description |
---|---|
DirectOptionTerminateMapper |
If an option deterministically terminates with a fixed number of steps, then it may be useful
for an option to immediately transition from the state in which the option was initiated to the
end terminal state, rather than having to simulate each step of execution.
|
Class | Description |
---|---|
EnvironmentOptionOutcome | |
LocalSubgoalRF |
It is typical for options to be defined for following policies to subgoals and it is often useful
to use a planning or learning algorithm to define these policies, in which case a subgoal reward
function for the option would need to be specified.
|
LocalSubgoalTF |
It is typical for options to be defined for following policies to subgoals and it is often useful
to use a planning or learning algorithm to define these policies, in which case a terminal
function for the option would need to be specified in order to learn or plan for its policy.
|
OptionEvaluatingRF |
This class is a reward function that accepts a reward function for primitive actions and returns
that when the query action is a primitive.
|