public static class BeliefMDPGenerator.BeliefRF extends java.lang.Object implements RewardFunction
State
objects are classes that implement BeliefState
and EnumerableBeliefState
.
If the POMDP reward function does not depend on the next state, then this can be declared with the srcRFIsNextStateIndependent
flag in the burlap.oomdp.singleagent.pomdp.BeliefMDPGenerator.BeliefRF#BeliefRF(PODomain, RewardFunction, boolean)
constructor, which will decrease the computational demands since the next states do not have to be marginalized over.Modifier and Type | Field and Description |
---|---|
protected PODomain |
podomain
The source POMDP domain
|
protected RewardFunction |
pomdpRF
The source POMDP reward function to turn into a belief MDP reward function
|
protected boolean |
srcRFIsNextStateIndependent
A boolean flag indicating whether the POMDP reward function is independent of the next state transition.
|
Constructor and Description |
---|
BeliefMDPGenerator.BeliefRF(PODomain podomain,
RewardFunction pomdpRF)
Initializes.
|
BeliefMDPGenerator.BeliefRF(PODomain podomain,
RewardFunction pomdpRF,
boolean srcRFIsNextStateIndependent)
Initializes.
|
Modifier and Type | Method and Description |
---|---|
double |
reward(State s,
GroundedAction a,
State sprime)
Returns the reward received when action a is executed in state s and the agent transitions to state sprime.
|
protected double |
saOnlyReward(State s,
GroundedAction a)
Returns the belief MDP reward when the POMDP reward function is independent from the next state transition.
|
protected double |
sasReward(State s,
GroundedAction a)
Returns the belief MDP reward when the POMDP reward function is dependent on the next state transition.
|
protected PODomain podomain
protected RewardFunction pomdpRF
protected boolean srcRFIsNextStateIndependent
public BeliefMDPGenerator.BeliefRF(PODomain podomain, RewardFunction pomdpRF)
podomain
- the source POMDP domainpomdpRF
- the source POMDP reward functionpublic BeliefMDPGenerator.BeliefRF(PODomain podomain, RewardFunction pomdpRF, boolean srcRFIsNextStateIndependent)
podomain
- the source POMDP domainpomdpRF
- the source POMDP reward functionsrcRFIsNextStateIndependent
- a boolean flag indicating whether the POMDP reward function is independent of the next state transition.public double reward(State s, GroundedAction a, State sprime)
RewardFunction
reward
in interface RewardFunction
s
- the state in which the action was executeda
- the action executedsprime
- the state to which the agent transitionedprotected double saOnlyReward(State s, GroundedAction a)
s
- the input belief statea
- the action selected.protected double sasReward(State s, GroundedAction a)
s
- the input belief statea
- the action selected.