public interface DirectOptionTerminateMapper
Option
object
to prevent the option from performing the simulation.
If the options path and number of steps cannot be guaranteed in advance, then this interface
should not be implemented.Modifier and Type | Method and Description |
---|---|
State |
generateOptionTerminalState(State s)
Returns the termination state that will result from applying the option associated with this object
in the given initiation state.
|
double |
getCumulativeReward(State s,
State sp,
RewardFunction rf,
double discount)
Returns the cumulative discounted reward that would be received from applying the option associated with this object
in the given initiation state.
|
int |
getNumSteps(State s,
State sp)
Returns the number of steps that would have occurred for taking the option associated with this object
from state
s and terminating in state sp . |
State generateOptionTerminalState(State s)
s
- the initiation state in which the option associated with this object will be applied.int getNumSteps(State s, State sp)
s
and terminating in state sp
.s
- the initiation state in which the option associated with this object will be applied.sp
- the terminal state the option will reachdouble getCumulativeReward(State s, State sp, RewardFunction rf, double discount)
s
- the initiation state in which the option associated with this object will be applied.sp
- the terminal state the option will reachrf
- the reward function being useddiscount
- the discount factor