public static class BeliefMDPGenerator.BeliefRF extends java.lang.Object implements RewardFunction
State objects are classes that implement BeliefState
and EnumerableBeliefState.
If the POMDP reward function does not depend on the next state, then this can be declared with the srcRFIsNextStateIndependent
flag in the burlap.oomdp.singleagent.pomdp.BeliefMDPGenerator.BeliefRF#BeliefRF(PODomain, RewardFunction, boolean)
constructor, which will decrease the computational demands since the next states do not have to be marginalized over.| Modifier and Type | Field and Description |
|---|---|
protected PODomain |
podomain
The source POMDP domain
|
protected RewardFunction |
pomdpRF
The source POMDP reward function to turn into a belief MDP reward function
|
protected boolean |
srcRFIsNextStateIndependent
A boolean flag indicating whether the POMDP reward function is independent of the next state transition.
|
| Constructor and Description |
|---|
BeliefMDPGenerator.BeliefRF(PODomain podomain,
RewardFunction pomdpRF)
Initializes.
|
BeliefMDPGenerator.BeliefRF(PODomain podomain,
RewardFunction pomdpRF,
boolean srcRFIsNextStateIndependent)
Initializes.
|
| Modifier and Type | Method and Description |
|---|---|
double |
reward(State s,
GroundedAction a,
State sprime)
Returns the reward received when action a is executed in state s and the agent transitions to state sprime.
|
protected double |
saOnlyReward(State s,
GroundedAction a)
Returns the belief MDP reward when the POMDP reward function is independent from the next state transition.
|
protected double |
sasReward(State s,
GroundedAction a)
Returns the belief MDP reward when the POMDP reward function is dependent on the next state transition.
|
protected PODomain podomain
protected RewardFunction pomdpRF
protected boolean srcRFIsNextStateIndependent
public BeliefMDPGenerator.BeliefRF(PODomain podomain, RewardFunction pomdpRF)
podomain - the source POMDP domainpomdpRF - the source POMDP reward functionpublic BeliefMDPGenerator.BeliefRF(PODomain podomain, RewardFunction pomdpRF, boolean srcRFIsNextStateIndependent)
podomain - the source POMDP domainpomdpRF - the source POMDP reward functionsrcRFIsNextStateIndependent - a boolean flag indicating whether the POMDP reward function is independent of the next state transition.public double reward(State s, GroundedAction a, State sprime)
RewardFunctionreward in interface RewardFunctions - the state in which the action was executeda - the action executedsprime - the state to which the agent transitionedprotected double saOnlyReward(State s, GroundedAction a)
s - the input belief statea - the action selected.protected double sasReward(State s, GroundedAction a)
s - the input belief statea - the action selected.