DiffVFRF

java.lang.Object
- burlap.behavior.singleagent.learnbydemo.mlirl.support.DifferentiableRF
- - burlap.behavior.singleagent.learnbydemo.mlirl.differentiableplanners.diffvinit.DiffVFRF

All Implemented Interfaces:

RewardFunction
```
public class DiffVFRF
extends DifferentiableRF
```
A differentiable reward function wrapper for use with MLIRL when the reward function is known, but the value function initialization for leaf nodes is to be learned. This class takes as input the true reward function and a DifferentiableVInit object to form the DifferentiableRF object that MLIRL will use.

Author:

James MacGlashan.

Field Summary

Fields
Modifier and Type Field and Description

protected DifferentiableVInit.ParamedDiffVInit diffVInit

protected RewardFunction objectiveRF
- Fields inherited from class burlap.behavior.singleagent.learnbydemo.mlirl.support.DifferentiableRF
  dim, parameters

Fields
Modifier and Type	Field and Description
`protected DifferentiableVInit.ParamedDiffVInit`	`diffVInit`
`protected RewardFunction`	`objectiveRF`

Constructor Summary

Constructors
Constructor and Description

DiffVFRF(RewardFunction objectiveRF, DifferentiableVInit.ParamedDiffVInit diffVinit)

Constructors
Constructor and Description
`DiffVFRF(RewardFunction objectiveRF, DifferentiableVInit.ParamedDiffVInit diffVinit)`

Method Summary

Methods
Modifier and Type	Method and Description
`protected DifferentiableRF`	`copyHelper()` A helper method for making a copy of this reward function.
`double[]`	`getGradient(State s, GroundedAction ga, State sp)` Returns the gradient of the reward function for the given state transition.
`double`	`reward(State s, GroundedAction a, State sprime)` Returns the reward received when action a is executed in state s and the agent transitions to state sprime.
`void`	`setParameters(double[] parameters)`

Methods inherited from class burlap.behavior.singleagent.learnbydemo.mlirl.support.DifferentiableRF
copy, getParameterDimension, getParameters, randomizeParameters, setParameter, toString

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait

- Field Detail
  - objectiveRF
```
protected RewardFunction objectiveRF
```
  - diffVInit
```
protected DifferentiableVInit.ParamedDiffVInit diffVInit
```
- Constructor Detail
  - DiffVFRF
```
public DiffVFRF(RewardFunction objectiveRF,
        DifferentiableVInit.ParamedDiffVInit diffVinit)
```
- Method Detail
  - getGradient
```
public double[] getGradient(State s,
                   GroundedAction ga,
                   State sp)
```
    Description copied from class: DifferentiableRF
    
    Returns the gradient of the reward function for the given state transition.
    
    Specified by:
    
    getGradient in class DifferentiableRF
    
    Parameters:
    s - the source state
    ga - the action taken in the source state
    sp - the resulting state from the action
    
    Returns:
    the gradient of the reward function for the given transition.
  - copyHelper
```
protected DifferentiableRF copyHelper()
```
    Description copied from class: DifferentiableRF
    
    A helper method for making a copy of this reward function. THe parameters and dimensionality do not have to be copied, because they will be copied in the public DifferentiableRF.copy() method.
    
    Specified by:
    
    copyHelper in class DifferentiableRF
    
    Returns:
    a copy of this reward function.
  - reward
```
public double reward(State s,
            GroundedAction a,
            State sprime)
```
    Description copied from interface: RewardFunction
    
    Returns the reward received when action a is executed in state s and the agent transitions to state sprime.
    
    Parameters:
    s - the state in which the action was executed
    a - the action executed
    sprime - the state to which the agent transitioned
    
    Returns:
    the reward received when action a is executed in state s and the agent transitions to state sprime.
  - setParameters
```
public void setParameters(double[] parameters)
```
    Overrides:
    
    setParameters in class DifferentiableRF

Class DiffVFRF

Field Summary

Fields inherited from class burlap.behavior.singleagent.learnbydemo.mlirl.support.DifferentiableRF

Constructor Summary

Method Summary

Methods inherited from class burlap.behavior.singleagent.learnbydemo.mlirl.support.DifferentiableRF

Methods inherited from class java.lang.Object

Field Detail

objectiveRF

diffVInit

Constructor Detail

DiffVFRF

Method Detail

getGradient

copyHelper

reward

setParameters