SparseSampling.StateNode

java.lang.Object
- burlap.behavior.singleagent.planning.stochastic.sparsesampling.SparseSampling.StateNode

Enclosing class:

SparseSampling
```
public class SparseSampling.StateNode
extends java.lang.Object
```
A class for state nodes. Includes the state, a value estimate, whether the node has been closed and methods for estimating the Q and V values.

Author:

James MacGlashan

Constructor Summary

Constructors
Constructor and Description

SparseSampling.StateNode(StateHashTuple sh, int height)
Creates a node for the given hased state at the given height

Constructors
Constructor and Description
`SparseSampling.StateNode(StateHashTuple sh, int height)` Creates a node for the given hased state at the given height

Method Summary

Methods
Modifier and Type	Method and Description
`java.util.List<QValue>`	`estimateQs()` Estimates and returns the Q-values for this node.
`double`	`estimateV()` Returns the estimated Q-value if this node is closed, or estimates it and closes it otherwise.
`protected double`	`fullBelmmanQValue(GroundedAction ga)` Computes the exact Q-value using full Bellman update with the actual transition dynamics.
`protected double`	`sampledBellmanQEstimate(GroundedAction ga)` Estimates the Q-value using sampling from the transition dynamics.

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Constructor Detail
  - SparseSampling.StateNode
```
public SparseSampling.StateNode(StateHashTuple sh,
                        int height)
```
    Creates a node for the given hased state at the given height
    
    Parameters:
    sh - the hashed state
    height - the height of the node
- Method Detail
  - estimateQs
```
public java.util.List<QValue> estimateQs()
```
    Estimates and returns the Q-values for this node. Q-values and used state samples are forgotten after this call completes.
    
    Returns:
    a List of the estiamted Q-values for each action.
  - sampledBellmanQEstimate
```
protected double sampledBellmanQEstimate(GroundedAction ga)
```
    Estimates the Q-value using sampling from the transition dynamics. This is the standard Sparse Sampling procedure.
    
    Parameters:
    ga - the action for which the Q-value estimate is to be returned
    
    Returns:
    the Q-value estimate
  - fullBelmmanQValue
```
protected double fullBelmmanQValue(GroundedAction ga)
```
    Computes the exact Q-value using full Bellman update with the actual transition dynamics. This procedure will cause Sparse Sampling to compute the exact Q-values and optimal policy for a finite horizon problem. It is reccommened when the number of transitions from any given state is small tractable to compute.
    
    Parameters:
    ga - the action for which the Q-value estimate is to be returned
    
    Returns:
    the exact finite horizon Q-value
  - estimateV
```
public double estimateV()
```
    Returns the estimated Q-value if this node is closed, or estimates it and closes it otherwise.
    
    Returns:
    the estimated Q-value for this node.

Class SparseSampling.StateNode

Constructor Summary

Method Summary

Methods inherited from class java.lang.Object

Constructor Detail

SparseSampling.StateNode

Method Detail

estimateQs

sampledBellmanQEstimate

fullBelmmanQValue

estimateV